Evolutionary Algorithm Enhanced Neural Architecture Search for Text-Independent Speaker Verification

Xiaoyang Qu; Jianzong Wang; Jing Xiao

2020 INTERSPEECH INTERSPEECH 2020

Evolutionary Algorithm Enhanced Neural Architecture Search for Text-Independent Speaker Verification

Abstract

State-of-the-art speaker verification models are based on deep learning techniques, which heavily depend on the hand-designed neural architectures from experts or engineers. We borrow the idea of neural architecture search (NAS) for the text-independent speaker verification task. As NAS can learn deep network structures automatically, we introduce the NAS conception into the well-known x-vector network. Furthermore, this paper proposes an evolutionary algorithm enhanced neural architecture search method called Auto-Vector to automatically discover promising networks for the speaker verification task. The experimental results demonstrate our NAS-based model outperforms state-of-the-art speaker verification models.

🌉 Interdisciplinary Bridge — Computer Vision and Deep Learning and Machine Learning and Speech & Audio

🧭 Keyword Pioneer — x-vector network

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Xiaoyang Qu , Jianzong Wang , Jing Xiao

Topics

Deep Learning > Techniques > Model Architecture Computer Vision > Analysis > Biometrics Speech & Audio > Recognition > Speaker Recognition Machine Learning > Learning Types > Meta-Learning Deep Learning > Optimization & Theory > Neural Network Optimization

Keywords

network architecture neural architecture search speaker verification evolutionary algorithm x-vector network

Download PDF

Related papers

Memory Controlled Sequential Self Attention for Sound Recognition 2020

Dual Attention in Time and Frequency Domain for Voice Activity Detection 2020

Automatic Prediction of Speech Intelligibility Based on X-Vectors in the Context of Head and Neck Cancer 2020

A Noise Robust Technique for Detecting Vowels in Speech Signals 2020

Joint Detection of Sentence Stress and Phrase Boundary for Prosody 2020