Fast Variational Bayes for Heavy-tailed PLDA Applied to i-vectors and x-vectors

Anna Silnova; Niko Brümmer; Daniel Garcia-Romero; David Snyder; Lukas Burget

2018 INTERSPEECH INTERSPEECH 2018

Fast Variational Bayes for Heavy-tailed PLDA Applied to i-vectors and x-vectors

Abstract

The standard state-of-the-art backend for text-independent speaker recognizers that use i-vectors or x-vectors is Gaussian PLDA (G-PLDA), assisted by a Gaussianization step involving length normalization. G-PLDA can be trained with both gener- ative or discriminative methods. It has long been known that heavy-tailed PLDA (HT-PLDA), applied without length nor- malization, gives similar accuracy, but at considerable extra computational cost. We have recently introduced a fast scor- ing algorithm for a discriminatively trained HT-PLDA back- end. This paper extends that work by introducing a fast, vari- ational Bayes, generative training algorithm. We compare old and new backends, with and without length-normalization, with i-vectors and x-vectors, on SRE’10, SRE’16 and SITW.

🐣 Hot Topic Early Bird — representation learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Speech & Audio

🌉 Interdisciplinary Bridge — Machine Learning and Speech & Audio

Authors

Anna Silnova , Niko Brümmer , Daniel Garcia-Romero , David Snyder , Lukas Burget

Topics

Machine Learning > Core Methods > Classification Machine Learning > Core Methods > Embedding Learning Machine Learning > Optimization & Theory > Bayesian Inference Speech & Audio > Recognition > Speaker Recognition Machine Learning > Bayesian & Probabilistic > Probabilistic Modeling Machine Learning > Bayesian & Probabilistic > Variational Inference

Keywords

representation learning variational inference speaker verification speaker recognition variational baye probabilistic linear discriminant analysis

Download PDF

Related papers

HoloCompanion: An MR Friend for EveryOne 2018

Estimation of the Vocal Tract Length of Vowel Sounds Based on the Frequency of the Significant Spectral Valley 2018

Deep Learning Techniques for Koala Activity Detection 2018

An Exploration of Local Speaking Rate Variations in Mandarin Read Speech 2018

Acoustic Analysis of Whispery Voice Disguise in Mandarin Chinese 2018