2020
INTERSPEECH
INTERSPEECH 2020
VoiceID on the Fly: A Speaker Recognition System that Learns from Scratch
Abstract
We proposed a novel AI framework to conduct real-time multi-speaker recognition without any prior registration or pretraining by learning the speaker identification on the fly. We considered the practical problem of online learning with episodically revealed rewards and introduced a solution based on semi-supervised and self-supervised learning methods in a web-based application at https://www.baihan.nyc/viz/VoiceID/
🌉
Interdisciplinary Bridge
— Artificial Intelligence and Machine Learning and Mathematics & Optimization
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio
Authors
Topics
Artificial Intelligence > Learning Paradigms > Few-Shot Learning
Machine Learning > Learning Types > Self-Supervised Learning
Machine Learning > Learning Types > Semi-Supervised Learning
Machine Learning > Learning Types > Unsupervised Learning
Mathematics & Optimization > Optimization > Online Algorithms