Certification of Speaker Recognition Models to Additive Perturbations

Dmitrii Korzh; Elvir Karimov; Mikhail Pautov; Oleg Y. Rogov; Ivan Oseledets

2025 AAAI AAAI 2025

Certification of Speaker Recognition Models to Additive Perturbations

Abstract

Abstract Speaker recognition technology is applied to various tasks, from personal virtual assistants to secure access systems. However, the robustness of these systems against adversarial attacks, particularly to additive perturbations, remains a significant challenge. In this paper, we pioneer applying robustness certification techniques to speaker recognition, initially developed for the image domain. Our work covers this gap by transferring and improving randomized smoothing certification techniques against norm-bounded additive perturbations for classification and few-shot learning tasks to speaker recognition. We demonstrate the effectiveness of these methods on VoxCeleb 1 and 2 datasets for several models. We expect this work to improve the robustness of voice biometrics and accelerate the research of certification methods in the audio domain.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning and Speech & Audio

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Dmitrii Korzh , Elvir Karimov , Mikhail Pautov , Oleg Y. Rogov , Ivan Oseledets

Topics

Artificial Intelligence > Core AI > Causal Inference Machine Learning > Learning Types > Adversarial Learning Machine Learning > Optimization & Theory > Learning Theory Speech & Audio > Recognition > Speaker Recognition Speech & Audio > Analysis > Speaker Verification Artificial Intelligence > Core AI > Adversarial Learning Artificial Intelligence > Core AI > Safety

Keywords

adversarial robustness speaker recognition randomized smoothing robustness certification voice biometrics additive perturbation

Download PDF

Related papers

BEV-TSR: Text-Scene Retrieval in BEV Space for Autonomous Driving 2025

APIRL: Deep Reinforcement Learning for REST API Fuzzing 2025

Anywhere: A Multi-Agent Framework for User-Guided, Reliable, and Diverse Foreground-Conditioned Image Generation 2025

3CAD: A Large-Scale Real-World 3C Product Dataset for Unsupervised Anomaly Detection 2025

Collaborative Learning for 3D Hand-Object Reconstruction and Compositional Action Recognition from Egocentric RGB Videos Using Superquadrics 2025