Adversarial Domain Adaptation for Speaker Verification Using Partially Shared Network

Zhengyang Chen; Shuai Wang; Yanmin Qian

2020 INTERSPEECH INTERSPEECH 2020

Adversarial Domain Adaptation for Speaker Verification Using Partially Shared Network

Abstract

Speaker verification systems usually suffer from large performance degradation when applied to a new dataset from a different domain. In this work, we will study the domain adaption strategy between datasets with different languages using domain adversarial training. We introduce a partially shared network based domain adversarial training architecture to learn an asymmetric mapping for source and target domain embedding extractor. This architecture can help the embedding extractor learn domain invariant feature without sacrificing the ability on speaker discrimination. When doing the evaluation on cross-lingual domain adaption, the source domain data is in English from NIST SRE04-10 and Switchboard, and the target domain data is in Cantonese and Tagalog from NIST SRE16. Our results show that the usual adversarial training mode will indeed harm the speaker discrimination when the source and target domain embedding extractors are fully shared, and in contrast the newly proposed architecture solves this problem and achieves ~25.0% relative average Equal Error Rate (EER) improvement on SRE16 Cantonese and Tagalog evaluation.

🌉 Interdisciplinary Bridge — Computer Vision and Machine Learning

🧭 Keyword Pioneer — speaker discrimination

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Speech & Audio

Authors

Zhengyang Chen , Shuai Wang , Yanmin Qian

Topics

Machine Learning > Learning Types > Adversarial Learning Machine Learning > Application Areas > Domain Adaptation Computer Vision > Analysis > Biometrics Machine Learning > Learning Types > Domain Adaptation

Keywords

adversarial learning domain adaptation adversarial training speaker verification equal error rate domain adversarial training speaker discrimination domain invariant feature

Download PDF

Related papers

Memory Controlled Sequential Self Attention for Sound Recognition 2020

Dual Attention in Time and Frequency Domain for Voice Activity Detection 2020

Automatic Prediction of Speech Intelligibility Based on X-Vectors in the Context of Head and Neck Cancer 2020

A Noise Robust Technique for Detecting Vowels in Speech Signals 2020

Joint Detection of Sentence Stress and Phrase Boundary for Prosody 2020