Publication detail

Analysis of ABC Submission to NIST SRE 2019 CMN and VAST Challenge

ALAM, J. BOULIANNE, G. BURGET, L. DAHMANE, M. DIEZ SÁNCHEZ, M. GLEMBEK, O. LALONDE, M. LOZANO DÍEZ, A. MATĚJKA, P. MIZERA, P. MOŠNER, L. NOISEUX, C. MONTEIRO, J. NOVOTNÝ, O. PLCHOT, O. ROHDIN, J. SILNOVA, A. SLAVÍČEK, J. STAFYLAKIS, T. ST-CHARLES, P. WANG, S. ZEINALI, H.

Original Title

Type

conference paper

Language

English

Original Abstract

We present a condensed description and analysis of the joint submission of ABC team for NIST SRE 2019, by BUT, CRIM, Phonexia, Omilia and UAM. We concentrate on challenges that arose during development and we analyze the results obtained on the evaluation data and on our development sets. The conversational telephone speech (CMN2) condition is challenging for current state-of-the-art systems, mainly due to the language mismatch between training and test data. We show that a combination of adversarial domain adaptation, backend adaptation and score normalization can mitigate this mismatch. On the VAST condition, we demonstrate the importance of deploying diarization when dealing with multi-speaker utterances and the drastic improvements that can be obtained by combining audio and visual modalities.

Keywords

speaker verification, NIST SRE, CMN, VAST, system fusion.

Authors

ALAM, J.; BOULIANNE, G.; BURGET, L.; DAHMANE, M.; DIEZ SÁNCHEZ, M.; GLEMBEK, O.; LALONDE, M.; LOZANO DÍEZ, A.; MATĚJKA, P.; MIZERA, P.; MOŠNER, L.; NOISEUX, C.; MONTEIRO, J.; NOVOTNÝ, O.; PLCHOT, O.; ROHDIN, J.; SILNOVA, A.; SLAVÍČEK, J.; STAFYLAKIS, T.; ST-CHARLES, P.; WANG, S.; ZEINALI, H.

Released

1. 11. 2020

Publisher

International Speech Communication Association

Location

Tokyo

ISBN

2312-2846

Periodical

Proceedings of Odyssey: The Speaker and Language Recognition Workshop Odyssey 2014, Joensuu, Finland

Year of study

2020

Number

State

Republic of Finland

Pages from

289

Pages to

295

Pages count

URL

https://www.isca-speech.org/archive/Odyssey_2020/abstracts/73.html

BibTex

@inproceedings{BUT164070,
  author="ALAM, J. and BOULIANNE, G. and BURGET, L. and DAHMANE, M. and DIEZ SÁNCHEZ, M. and GLEMBEK, O. and LALONDE, M. and LOZANO DÍEZ, A. and MATĚJKA, P. and MIZERA, P. and MOŠNER, L. and NOISEUX, C. and MONTEIRO, J. and NOVOTNÝ, O. and PLCHOT, O. and ROHDIN, J. and SILNOVA, A. and SLAVÍČEK, J. and STAFYLAKIS, T. and ST-CHARLES, P. and WANG, S. and ZEINALI, H.",
  title="Analysis of ABC Submission to NIST SRE 2019 CMN and VAST Challenge",
  booktitle="Proceedings of Odyssey 2020 The Speaker and Language Recognition Workshop",
  year="2020",
  journal="Proceedings of Odyssey: The Speaker and Language Recognition Workshop Odyssey 2014, Joensuu, Finland",
  volume="2020",
  number="11",
  pages="289--295",
  publisher="International Speech Communication Association",
  address="Tokyo",
  doi="10.21437/Odyssey.2020-41",
  issn="2312-2846",
  url="https://www.isca-speech.org/archive/Odyssey_2020/abstracts/73.html"
}

VUT

Faculties

University Institutes

Parts

Analysis of ABC Submission to NIST SRE 2019 CMN and VAST Challenge