Gain Compensation for Fast i-Vector Extraction Over Short Duration

Kong Aik Lee; Haizhou Li

2017 INTERSPEECH INTERSPEECH 2017

Gain Compensation for Fast i-Vector Extraction Over Short Duration

Abstract

I-vector is widely described as a compact and effective representation of speech utterances for speaker recognition. Standard i-vector extraction could be an expensive task for applications where computing resource is limited, for instance, on handheld devices. Fast approximate inference of i-vector aims to reduce the computational cost required in i-vector extraction where run-time requirement is critical. Most fast approaches hinge on certain assumptions to approximate the i-vector inference formulae with little loss of accuracy. In this paper, we analyze the uniform assumption that we had proposed earlier. We show that the assumption generally hold for long utterances but inadequate for utterances of short duration. We then propose to compensate for the negative effects by applying a simple gain factor on the i-vectors estimated from short utterances. The assertion is confirmed through analysis and experiments conducted on NIST SRE’08 and SRE’10 datasets.

🧭 Keyword Pioneer — i-vector extraction

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

🌉 Interdisciplinary Bridge — Machine Learning and Speech & Audio

🐣 Hot Topic Early Bird — probabilistic modeling

Authors

Kong Aik Lee , Haizhou Li

Topics

Machine Learning > Core Methods > Representation Learning Machine Learning > Optimization & Theory > Bayesian Inference Speech & Audio > Recognition > Speaker Recognition Machine Learning > Bayesian & Probabilistic > Probabilistic Modeling

Keywords

probabilistic modeling probabilistic inference speaker verification speaker recognition i-vector extraction short duration speech gain compensation short utterance

Download PDF

Related papers

Description of the Munich-Passau Snore Sound Corpus (MPSSC) 2017

A Study on Replay Attack and Anti-Spoofing for Automatic Speaker Verification 2017

Binaural Reverberant Speech Separation Based on Deep Neural Networks 2017

Building Audio-Visual Phonetically Annotated Arabic Corpus for Expressive Text to Speech 2017

A Comparison of Danish Listeners’ Processing Cost in Judging the Truth Value of Norwegian, Swedish, and English Sentences 2017