Privacy-Preserving Siamese Feature Extraction for Gender Recognition versus Speaker Identification

Alexandru Nelus; Silas Rech; Timm Koppelmann; Henrik Biermann; Rainer Martin

2019 INTERSPEECH INTERSPEECH 2019

Privacy-Preserving Siamese Feature Extraction for Gender Recognition versus Speaker Identification

Abstract

In this paper we propose a deep neural-network-based feature extraction scheme with the purpose of reducing the privacy risks encountered in speaker classification tasks. For this we choose a challenging scenario where we wish to perform gender recognition but at the same time prevent an attacker who has intercepted the features to perform speaker identification. Our approach is to employ Siamese training in order to obtain a feature representation that minimizes the Euclidean distance between same gender speakers while maximizing it for different gender speakers. It is experimentally shown that the obtained effect is of anonymizing speakers from the same gender class and thus drastically reducing privacy risks while still permitting class discrimination with a higher accuracy than other previously investigated methods.

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Alexandru Nelus , Silas Rech , Timm Koppelmann , Henrik Biermann , Rainer Martin

Topics

Machine Learning > Core Methods > Metric Learning Machine Learning > Application Areas > Privacy

Keywords

metric learning feature extraction siamese network speaker identification gender recognition

Download PDF

Related papers

Using Real-Time Visual Biofeedback for Second Language Instruction 2019

VAE-Based Regularization for Deep Speaker Embedding 2019

End-to-End SpeakerBeam for Single Channel Target Speech Recognition 2019

Attention-Enhanced Connectionist Temporal Classification for Discrete Speech Emotion Recognition 2019

Attentive to Individual: A Multimodal Emotion Recognition Network with Personalized Attention Profile 2019