2022 INTERSPEECH INTERSPEECH 2022

NeMo Open Source Speaker Diarization System

Abstract

We introduce an open-source speaker diarization system which is part of the NeMo conversational AI toolkit. During the Show and Tell session, we will present an interactive system which demonstrates both online and offline speaker diarization. The audience would be able to test the speaker diarization system by recording their voice. We believe that our demo session would be an excellent opportunity to learn and experience how a speaker diarization system can be implemented for real-life applications using an open source toolkit.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Natural Language Processing
🧭 Keyword Pioneer — voice recording
🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy, Speech & Audio