2020
INTERSPEECH
INTERSPEECH 2020
Evolutionary Algorithm Enhanced Neural Architecture Search for Text-Independent Speaker Verification
Abstract
State-of-the-art speaker verification models are based on deep learning techniques, which heavily depend on the hand-designed neural architectures from experts or engineers. We borrow the idea of neural architecture search (NAS) for the text-independent speaker verification task. As NAS can learn deep network structures automatically, we introduce the NAS conception into the well-known x-vector network. Furthermore, this paper proposes an evolutionary algorithm enhanced neural architecture search method called Auto-Vector to automatically discover promising networks for the speaker verification task. The experimental results demonstrate our NAS-based model outperforms state-of-the-art speaker verification models.
🌉
Interdisciplinary Bridge
— Computer Vision and Deep Learning and Machine Learning and Speech & Audio
🧭
Keyword Pioneer
— x-vector network
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio