Publication detail

BUT System Description for The Third DIHARD Speech Diarization Challenge

LANDINI, F. LOZANO DÍEZ, A. BURGET, L. DIEZ SÁNCHEZ, M. SILNOVA, A. ŽMOLÍKOVÁ, K. GLEMBEK, O. MATĚJKA, P. STAFYLAKIS, T. BRUMMER, J.

Original Title

BUT System Description for The Third DIHARD Speech Diarization Challenge

Type

article in a collection out of WoS and Scopus

Language

English

Original Abstract

This is the system description corresponding to the systems developed by the BUT team for The Third DIHARD Speech Diarization Challenge. The systems for both tracks consist of a DOVERlap fusion of an end-to-end NN system with xvector based clustering systems in the form of spectral clustering and VBx. Given that the x-vector clustering systems do not provide overlapping speakers, overlapped speech is detected by a TasNet-based detector before the final fusion with the end-to-end approach.

Keywords

Speaker Diarization, DIHARD, VBx diarization, end-to-end diarization, overlapped speech detection

Authors

LANDINI, F.; LOZANO DÍEZ, A.; BURGET, L.; DIEZ SÁNCHEZ, M.; SILNOVA, A.; ŽMOLÍKOVÁ, K.; GLEMBEK, O.; MATĚJKA, P.; STAFYLAKIS, T.; BRUMMER, J.

Released

23. 1. 2021

Location

on-line by LDC and University of Pennsylvania

Pages from

1

Pages to

5

Pages count

5

URL

BibTex

@inproceedings{BUT170909,
  author="Federico Nicolás {Landini} and Alicia {Lozano Díez} and Lukáš {Burget} and Mireia {Diez Sánchez} and Anna {Silnova} and Kateřina {Žmolíková} and Ondřej {Glembek} and Pavel {Matějka} and Themos {Stafylakis} and Johan Nikolaas Langenhoven {Brummer}",
  title="BUT System Description for The Third DIHARD Speech Diarization Challenge",
  booktitle="Proceedings available at Dihard Challenge Github",
  year="2021",
  pages="1--5",
  address="on-line by LDC and University of Pennsylvania",
  url="https://dihardchallenge.github.io/dihard3/system_descriptions/dihard3_system_description_team55.pdf"
}