Detail publikace
Comparison of Keyword Spotting Approaches for Informal Continuous Speech
SZŐKE, I., SCHWARZ, P., BURGET, L., FAPŠO, M., KARAFIÁT, M., ČERNOCKÝ, J., MATĚJKA, P.
Originální název
Comparison of Keyword Spotting Approaches for Informal Continuous Speech
Anglický název
Comparison of Keyword Spotting Approaches for Informal Continuous Speech
Jazyk
en
Originální abstrakt
This paper describes several approaches to keyword spotting (KWS) for informal continuous speech. We compare acoustic keyword spotting, spotting in word lattices generated by large vocabulary continuous speech recognition and a hybrid approach making use of phoneme lattices generated by a phoneme recognizer. The systems are compared on carefully defined test data extracted from ICSI meeting database. The acoustic and phoneme-lattice based KWS are based on a phoneme recognizer making use of temporal-pattern (TRAP) feature extraction and posterior estimation using neural nets. We show its superiority over traditional HMM/GMM systems. The advantages and drawbacks of different approaches are discussed.
Anglický abstrakt
This paper describes several approaches to keyword spotting (KWS) for informal continuous speech. We compare acoustic keyword spotting, spotting in word lattices generated by large vocabulary continuous speech recognition and a hybrid approach making use of phoneme lattices generated by a phoneme recognizer. The systems are compared on carefully defined test data extracted from ICSI meeting database. The acoustic and phoneme-lattice based KWS are based on a phoneme recognizer making use of temporal-pattern (TRAP) feature extraction and posterior estimation using neural nets. We show its superiority over traditional HMM/GMM systems. The advantages and drawbacks of different approaches are discussed.
Dokumenty
BibTex
@inproceedings{BUT18062,
author="Igor {Szőke} and Petr {Schwarz} and Lukáš {Burget} and Michal {Fapšo} and Martin {Karafiát} and Jan {Černocký} and Pavel {Matějka}",
title="Comparison of Keyword Spotting Approaches for Informal Continuous Speech",
annote="This paper describes several approaches to keyword spotting
(KWS) for informal continuous speech. We compare acoustic
keyword spotting, spotting in word lattices generated by large
vocabulary continuous speech recognition and a hybrid approach making
use of phoneme lattices generated by a phoneme recognizer. The
systems are compared on carefully defined test data extracted from ICSI
meeting database. The acoustic and phoneme-lattice based KWS are
based on a phoneme recognizer making use of temporal-pattern (TRAP)
feature extraction and posterior estimation using neural nets. We show
its superiority over traditional HMM/GMM systems. The advantages and
drawbacks of different approaches are discussed.",
booktitle="Interspeech'2005 - Eurospeech - 9th European Conference on Speech Communication and Technology",
chapter="18062",
journal="5th European Conference EUROSPEECH 97",
year="2005",
month="september",
pages="633--636",
type="conference paper"
}