Detail publikace

Phonotactic Language Identification using High Quality Phoneme Recognition

Pavel Matejka, Petr Schwarz, Jan Cernocky, Pavel Chytil

Originální název

Typ

článek ve sborníku ve WoS nebo Scopus

Jazyk

angličtina

Originální abstrakt

Phoneme Recognizers followed by Language Modeling (PRLM) have consistently yielded top performance in language identification (LID) task. Parallel ordering of PRLMs (PPRLM) improves performance even more. Since tokenizer is the most important part of LID system the high quality phoneme recognizer is employed. Two different multilingual databases for training phoneme recognizers are compared and the amount of sufficient training data is studied. Reported results are on data from NIST 2003 LID evaluation. Our four PRLM systems have Equal Error Rate (EER) of 2.4% on 12 languages task. This result compares favorably to the best known result from this task.

Klíčová slova v angličtině

language identification, phoneme recognition, phonotactic model

Autoři

Pavel Matejka, Petr Schwarz, Jan Cernocky, Pavel Chytil

Vydáno

8. 4. 2005

Místo

Lisbon, Portugal

Strany od

Strany do

Strany počet

URL

http://www.fit.vutbr.cz/~matejkap/publi/2005/eurospeech2005.pdf

BibTex

@inproceedings{BUT17756,
  author="Pavel {Matějka} and Petr {Schwarz} and Jan {Černocký} and Pavel {Chytil}",
  title="Phonotactic Language Identification using High Quality Phoneme Recognition",
  booktitle="submitted to Eurospeech 2005",
  year="2005",
  volume="2005",
  pages="4",
  address="Lisbon, Portugal",
  url="http://www.fit.vutbr.cz/~matejkap/publi/2005/eurospeech2005.pdf"
}

VUT

Fakulty

Vysokoškolské ústavy

Součásti

Phonotactic Language Identification using High Quality Phoneme Recognition