Detail publikace

Comparison of Keyword Spotting Approaches for Informal Continuous Speech

SZŐKE, I., SCHWARZ, P., BURGET, L., FAPŠO, M., KARAFIÁT, M., ČERNOCKÝ, J., MATĚJKA, P.

Originální název

Comparison of Keyword Spotting Approaches for Informal Continuous Speech

Anglický název

Comparison of Keyword Spotting Approaches for Informal Continuous Speech

Jazyk

en

Originální abstrakt

This paper describes several approaches to  keyword spotting (KWS)  for informal continuous speech. We compare  acoustic keyword spotting, spotting in word lattices generated by large vocabulary continuous speech recognition and a hybrid approach making use of phoneme lattices generated by a phoneme recognizer. The  systems are compared on carefully defined test data extracted from ICSI meeting database. The  acoustic and phoneme-lattice based KWS are based on a phoneme recognizer making use of temporal-pattern (TRAP) feature extraction and posterior estimation using neural nets. We show its superiority over traditional HMM/GMM systems. The advantages and drawbacks of different approaches are discussed.

Anglický abstrakt

This paper describes several approaches to  keyword spotting (KWS)  for informal continuous speech. We compare  acoustic keyword spotting, spotting in word lattices generated by large vocabulary continuous speech recognition and a hybrid approach making use of phoneme lattices generated by a phoneme recognizer. The  systems are compared on carefully defined test data extracted from ICSI meeting database. The  acoustic and phoneme-lattice based KWS are based on a phoneme recognizer making use of temporal-pattern (TRAP) feature extraction and posterior estimation using neural nets. We show its superiority over traditional HMM/GMM systems. The advantages and drawbacks of different approaches are discussed.

Dokumenty

BibTex


@inproceedings{BUT18062,
  author="Igor {Szőke} and Petr {Schwarz} and Lukáš {Burget} and Michal {Fapšo} and Martin {Karafiát} and Jan {Černocký} and Pavel {Matějka}",
  title="Comparison of Keyword Spotting Approaches for Informal Continuous Speech",
  annote="This paper describes several approaches to  keyword spotting
(KWS)  for informal continuous speech. We compare  acoustic
keyword spotting, spotting in word lattices generated by large
vocabulary continuous speech recognition and a hybrid approach making
use of phoneme lattices generated by a phoneme recognizer. The 
systems are compared on carefully defined test data extracted from ICSI
meeting database. The  acoustic and phoneme-lattice based KWS are
based on a phoneme recognizer making use of temporal-pattern (TRAP)
feature extraction and posterior estimation using neural nets. We show
its superiority over traditional HMM/GMM systems. The advantages and
drawbacks of different approaches are discussed.",
  booktitle="Interspeech'2005 - Eurospeech - 9th European Conference on Speech Communication and Technology",
  chapter="18062",
  journal="5th European Conference EUROSPEECH 97",
  year="2005",
  month="september",
  pages="633--636",
  type="conference paper"
}