Publication detail

Cognitive Role of Speech Pauses and Algorithmic Consideration for their Processing

SMÉKAL, Z. STEJSKAL, V. ESPOSITO, A.

Original Title

Cognitive Role of Speech Pauses and Algorithmic Consideration for their Processing

English Title

Cognitive Role of Speech Pauses and Algorithmic Consideration for their Processing

Type

journal article - other

Language

en

Original Abstract

This study investigates pausing strategies, focusing the attention on empty speech pauses. A cross-modal analysis (video and audio) of spontaneous narratives produced by male and female children and adults showed that a remarkable amount of empty speech pauses was used to signal new concepts in the speech flow and to segment discourse units such as clauses and paragraphs. Based on these results, an adaptive mathematical model for pause distribution was suggested, that exploits, as pause features, the absence of signal and/or the changes of energy over different acoustic dimensions strongly related to the auditory perception. These considerations inspired the formulation and the implementation of two pause detection procedures that proved to be more effective than the Likelihood Ratio Test (LRT) and Long-Term Spectral Divergence (LTSD) algorithms recently proposed in literature and applied for Voice Activity Detection (VAD).

English abstract

This study investigates pausing strategies, focusing the attention on empty speech pauses. A cross-modal analysis (video and audio) of spontaneous narratives produced by male and female children and adults showed that a remarkable amount of empty speech pauses was used to signal new concepts in the speech flow and to segment discourse units such as clauses and paragraphs. Based on these results, an adaptive mathematical model for pause distribution was suggested, that exploits, as pause features, the absence of signal and/or the changes of energy over different acoustic dimensions strongly related to the auditory perception. These considerations inspired the formulation and the implementation of two pause detection procedures that proved to be more effective than the Likelihood Ratio Test (LRT) and Long-Term Spectral Divergence (LTSD) algorithms recently proposed in literature and applied for Voice Activity Detection (VAD).

Keywords

Speech a empty pauses discrimination, adaptive algorithms

RIV year

2008

Released

01.06.2008

Publisher

World Scientific Publications

Location

Singapore

Pages from

1073

Pages to

1088

Pages count

16

BibTex


@article{BUT47246,
  author="Zdeněk {Smékal} and Vojtěch {Stejskal} and Anna {Esposito}",
  title="Cognitive Role of Speech Pauses and Algorithmic Consideration for their Processing",
  annote="This study investigates pausing strategies, focusing the attention on empty speech
pauses. A cross-modal analysis (video and audio) of spontaneous narratives produced by
male and female children and adults showed that a remarkable amount of empty speech
pauses was used to signal new concepts in the speech flow and to segment discourse
units such as clauses and paragraphs. Based on these results, an adaptive mathematical
model for pause distribution was suggested, that exploits, as pause features, the absence
of signal and/or the changes of energy over different acoustic dimensions strongly related
to the auditory perception. These considerations inspired the formulation and the implementation
of two pause detection procedures that proved to be more effective than the
Likelihood Ratio Test (LRT) and Long-Term Spectral Divergence (LTSD) algorithms
recently proposed in literature and applied for Voice Activity Detection (VAD).",
  address="World Scientific Publications",
  chapter="47246",
  institution="World Scientific Publications",
  journal="INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE",
  number="5",
  volume="22",
  year="2008",
  month="june",
  pages="1073--1088",
  publisher="World Scientific Publications",
  type="journal article - other"
}