Towards a Fault-Tolerant Speaker Verification System: A Regularization Approach to Reduce the Condition Number

Siqi Zheng; Gang Liu; Hongbin Suo; Yun Lei

2019 INTERSPEECH INTERSPEECH 2019

Towards a Fault-Tolerant Speaker Verification System: A Regularization Approach to Reduce the Condition Number

Abstract

Large-scale deployment of speech interaction devices makes it possible to harvest tremendous data quickly, which also introduces the problem of wrong labeling during data mining. Mislabeled training data has a substantial negative effect on the performance of speaker verification system. This study aims to enhance the generalization ability and robustness of the model when the training data is contaminated by wrong labels. Several regularization approaches are proposed to reduce the condition number of the speaker verification problem, making the model less sensitive to errors in the inputs. They are validated on both NIST SRE corpus and far-field smart speaker data. The results suggest that the performance deterioration caused by mislabeled training data can be significantly ameliorated by proper regularization.

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Siqi Zheng , Gang Liu , Hongbin Suo , Yun Lei

Topics

Machine Learning > Core Methods > Classification Machine Learning > Core Methods > Metric Learning Machine Learning > Optimization & Theory > Optimization

Keywords

robust optimization speaker verification fault tolerance condition number mislabeled datum

Download PDF

Related papers

Using Real-Time Visual Biofeedback for Second Language Instruction 2019

VAE-Based Regularization for Deep Speaker Embedding 2019

End-to-End SpeakerBeam for Single Channel Target Speech Recognition 2019

Attention-Enhanced Connectionist Temporal Classification for Discrete Speech Emotion Recognition 2019

Attentive to Individual: A Multimodal Emotion Recognition Network with Personalized Attention Profile 2019