Continual Learning for Fake Audio Detection

Haoxin Ma; Jiangyan Yi; Jianhua Tao; Ye Bai; Zhengkun Tian; Chenglong Wang

2021 INTERSPEECH INTERSPEECH 2021

Continual Learning for Fake Audio Detection

Abstract

Fake audio attack becomes a major threat to the speaker verification system. Although current detection approaches have achieved promising results on dataset-specific scenarios, they encounter difficulties on unseen spoofing data. Fine-tuning and retraining from scratch have been applied to incorporate new data. However, fine-tuning leads to performance degradation on previous data. Retraining takes a lot of time and computation resources. Besides, previous data are unavailable due to privacy in some situations. To solve the above problems, this paper proposes detecting fake without forgetting, a continual-learning-based method, to make the model learn new spoofing attacks incrementally. A knowledge distillation loss is introduced to loss function to preserve the memory of original model. Supposing the distribution of genuine voice is consistent among different scenarios, an extra embedding similarity loss is used as another constraint to further do a positive sample alignment. Experiments are conducted on the ASVspoof2019 dataset. The results show that our proposed method outperforms fine-tuning by the relative reduction of average equal error rate up to 81.62%.

🧭 Keyword Pioneer — fake audio detection

🐣 Hot Topic Early Bird — continual learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Speech & Audio

🌉 Interdisciplinary Bridge — Machine Learning and Speech & Audio

Authors

Haoxin Ma , Jiangyan Yi , Jianhua Tao , Ye Bai , Zhengkun Tian , Chenglong Wang

Topics

Machine Learning > Learning Types > Continual Learning Machine Learning > Application Areas > Knowledge Distillation Speech & Audio > Analysis > Speaker Verification Machine Learning > Learning Types > Knowledge Distillation Machine Learning > Learning Paradigms > Continual Learning

Keywords

continual learning catastrophic forgetting knowledge distillation spoofing detection speaker verification equal error rate fake audio detection

Download PDF

Related papers

Energy-Friendly Keyword Spotting System Using Add-Based Convolution 2021

Dialogue Situation Recognition for Everyday Conversation Using Multimodal Information 2021

Using Games to Augment Corpora for Language Recognition and Confusability 2021

A Psychology-Driven Computational Analysis of Political Interviews 2021

The 2020 Personalized Voice Trigger Challenge: Open Datasets, Evaluation Metrics, Baseline System and Results 2021