Detail publikace

Automatic Language Identification using Phoneme and Automatically Derived Unit Strings

MATĚJKA, P., SZŐKE, I., SCHWARZ, P., ČERNOCKÝ, J.

Originální název

Automatic Language Identification using Phoneme and Automatically Derived Unit Strings

Typ

článek v časopise - ostatní, Jost

Jazyk

angličtina

Originální abstrakt

Language identification (LID) based on phono-tactic modeling is presented in this paper. Approaches using phoneme strings and strings of units automatically derived by an Ergodic HMM (EHMM) are compared. The phoneme recognizers were trained on 6 languages from OGI multi-language-corpus and Czech SpeechDat-E. The LID results are obtained on 4 languages. The results show superiority of Czech phoneme recognizer while used in LID and promising trends using the EHMM-derived units.

Klíčová slova

language identificaton, phoneme recognizer, speech processing, ergodic hidden Markov model

Autoři

MATĚJKA, P., SZŐKE, I., SCHWARZ, P., ČERNOCKÝ, J.

Rok RIV

2004

Vydáno

8. 9. 2004

Nakladatel

Springer

ISSN

0302-9743

Periodikum

Lecture Notes in Computer Science

Ročník

2004

Číslo

3206

Stát

Spolková republika Německo

Strany od

147

Strany do

154

Strany počet

8

URL

BibTex

@article{BUT45377,
  author="Pavel {Matějka} and Igor {Szőke} and Petr {Schwarz} and Jan {Černocký}",
  title="Automatic Language Identification using Phoneme and Automatically Derived Unit Strings",
  journal="Lecture Notes in Computer Science",
  year="2004",
  volume="2004",
  number="3206",
  pages="8",
  issn="0302-9743",
  url="http://www.springerlink.com/index/CUFLYEGQA8W1LNBE"
}