Detail publikace

Speaker dependent changes in formants based on normalization of vowel triangle

Originální název

Speaker dependent changes in formants based on normalization of vowel triangle

Anglický název

Speaker dependent changes in formants based on normalization of vowel triangle

Jazyk

en

Originální abstrakt

In smoothed LPC speech signal spectrum, the local peaks are called formants. Their frequency and level depend on three basic aspects, namely on spoken phoneme, individual speaker as well as emotional state of speaker. This paper is focused on speaker analysis based on comparison of vowel formants with their normalized values. The novel method described here uses for formant normalization statistical values of formants presented in the form of formant planes. Formant normalization is made by average formant position given for Czech vowels. For each speaker the center of gravity of two vowel triangles are calculated; for the small vowel triangle created by /a/, /e/, /o/ vowels and for the big vowel triangle created by /a/, /i/, /u/ vowels. The differences between the individual and normalized center of gravity for both vowel triangles are represented as resulting vectors. Parameters of these vectors were analyzed in more details for statistical evaluation for a set of 12 male speakers. Best results were achieved for the /a/, /e/, /o/ triangle normalized by average formant position in the F2-F3 plane, where the vector angle and the vector length were totally different within analyzed speakers.

Anglický abstrakt

In smoothed LPC speech signal spectrum, the local peaks are called formants. Their frequency and level depend on three basic aspects, namely on spoken phoneme, individual speaker as well as emotional state of speaker. This paper is focused on speaker analysis based on comparison of vowel formants with their normalized values. The novel method described here uses for formant normalization statistical values of formants presented in the form of formant planes. Formant normalization is made by average formant position given for Czech vowels. For each speaker the center of gravity of two vowel triangles are calculated; for the small vowel triangle created by /a/, /e/, /o/ vowels and for the big vowel triangle created by /a/, /i/, /u/ vowels. The differences between the individual and normalized center of gravity for both vowel triangles are represented as resulting vectors. Parameters of these vectors were analyzed in more details for statistical evaluation for a set of 12 male speakers. Best results were achieved for the /a/, /e/, /o/ triangle normalized by average formant position in the F2-F3 plane, where the vector angle and the vector length were totally different within analyzed speakers.

BibTex


@inproceedings{BUT98860,
  author="Miroslav {Staněk} and Milan {Sigmund}",
  title="Speaker dependent changes in formants based on normalization of vowel triangle",
  annote="In smoothed LPC speech signal spectrum, the local peaks are called formants. Their frequency and level depend on three basic aspects, namely on spoken phoneme, individual speaker as well as emotional state of speaker. This paper is focused on speaker analysis based on comparison of vowel formants with their normalized values. The novel method described here uses for formant normalization statistical values of formants presented in the form of formant planes. Formant normalization is made by average formant position given for Czech vowels. For each speaker the center of gravity of two vowel triangles are calculated; for the small vowel triangle created by /a/, /e/, /o/ vowels and for the big vowel triangle created by /a/, /i/, /u/ vowels. The differences between the individual and normalized center of gravity for both vowel triangles are represented as resulting vectors. Parameters of these vectors were analyzed in more details for statistical evaluation for a set of 12 male speakers. Best results were achieved for the /a/, /e/, /o/ triangle normalized by average formant position in the F2-F3 plane, where the vector angle and the vector length were totally different within analyzed speakers.",
  booktitle="Proceedings of 23th International Conference RADIOELEKTRONIKA 2013",
  chapter="98860",
  howpublished="print",
  year="2013",
  month="april",
  pages="337--341",
  type="conference paper"
}