Detail publikace

BUT system for DIHARD Speech Diarization Challenge 2018

DIEZ SÁNCHEZ, M. LANDINI, F. BURGET, L. ROHDIN, J. SILNOVA, A. ŽMOLÍKOVÁ, K. NOVOTNÝ, O. VESELÝ, K. GLEMBEK, O. PLCHOT, O. MOŠNER, L. MATĚJKA, P.

Originální název

BUT system for DIHARD Speech Diarization Challenge 2018

Typ

článek ve sborníku ve WoS nebo Scopus

Jazyk

angličtina

Originální abstrakt

This paper presents the approach developed by the BUT team for the first DIHARD speech diarization challenge, which is based on our Bayesian Hidden Markov Model with eigenvoice priors system. Besides the description of the approach, we provide a brief analysis of different techniques and data processing methods tested on the development set. We also introduce a simple attempt for overlapped speech detection that we used for attaining cleaner speaker models and reassigning overlapped speech to multiple speakers. Finally, we present results obtained on the evaluation set and discuss findings we made during the development phase and with the help of the DIHARD leaderboard feedback.

Klíčová slova

Speaker Diarization, Variational Bayes, HMM, i-vector, x-vector, Overlapped speech, DIHARD

Autoři

DIEZ SÁNCHEZ, M.; LANDINI, F.; BURGET, L.; ROHDIN, J.; SILNOVA, A.; ŽMOLÍKOVÁ, K.; NOVOTNÝ, O.; VESELÝ, K.; GLEMBEK, O.; PLCHOT, O.; MOŠNER, L.; MATĚJKA, P.

Vydáno

2. 9. 2018

Nakladatel

International Speech Communication Association

Místo

Hyderabad

ISSN

1990-9772

Periodikum

Proceedings of Interspeech

Ročník

2018

Číslo

9

Stát

Francouzská republika

Strany od

2798

Strany do

2802

Strany počet

5

URL

BibTex

@inproceedings{BUT155100,
  author="Mireia {Diez Sánchez} and Federico Nicolás {Landini} and Lukáš {Burget} and Johan Andréas {Rohdin} and Anna {Silnova} and Kateřina {Žmolíková} and Ondřej {Novotný} and Karel {Veselý} and Ondřej {Glembek} and Oldřich {Plchot} and Ladislav {Mošner} and Pavel {Matějka}",
  title="BUT system for DIHARD Speech Diarization Challenge 2018",
  booktitle="Proceedings of Interspeech 2018",
  year="2018",
  journal="Proceedings of Interspeech",
  volume="2018",
  number="9",
  pages="2798--2802",
  publisher="International Speech Communication Association",
  address="Hyderabad",
  doi="10.21437/Interspeech.2018-1749",
  issn="1990-9772",
  url="https://www.isca-speech.org/archive/Interspeech_2018/abstracts/1749.html"
}