Robust Continuous On-Device Personalization for Automatic Speech Recognition

Khe Chai Sim; Angad Chandorkar; Fan Gao; Mason Chua; Tsendsuren Munkhdalai; Françoise Beaufays

2021 INTERSPEECH INTERSPEECH 2021

Robust Continuous On-Device Personalization for Automatic Speech Recognition

Abstract

On-device personalization of an all-neural automatic speech recognition (ASR) model can be achieved efficiently by fine-tuning the last few layers of the model. This approach has been shown to be effective for adapting the model to recognize rare named entities using only a small amount of data. To reliably perform continuous on-device learning, it is important for the training process to be completely autonomous without manual intervention. Our simulation studies show that training over many rounds may eventually lead to a significant model drift if the personalized model is indiscriminately accepted at the end of each training round. It is important to have appropriate acceptance criteria in place to guard the model against drifting. Moreover, for storage efficiency, it is desirable to persist the model weights in quantized form. We found that quantizing and dequantizing the model weights in between training rounds can prevent the model from learning effectively. This issue can be circumvented by adding noise to the quantized weights at the start of each training round.

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning and Speech & Audio

🧭 Keyword Pioneer — model fine-tuning

🐣 Hot Topic Early Bird — continual learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Deep Learning, Interdisciplinary, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Speech & Audio

Authors

Khe Chai Sim , Angad Chandorkar , Fan Gao , Mason Chua , Tsendsuren Munkhdalai , Françoise Beaufays

Topics

Machine Learning > Learning Types > Continual Learning Machine Learning > Application Areas > Efficient Computing Deep Learning > Architectures > Neural Networks Speech & Audio > Recognition > Automatic Speech Recognition

Keywords

model quantization continual learning automatic speech recognition model fine-tuning on-device personalization on-device learning model personalization continuous learning

Download PDF

Related papers

Energy-Friendly Keyword Spotting System Using Add-Based Convolution 2021

Dialogue Situation Recognition for Everyday Conversation Using Multimodal Information 2021

Using Games to Augment Corpora for Language Recognition and Confusability 2021

A Psychology-Driven Computational Analysis of Political Interviews 2021

The 2020 Personalized Voice Trigger Challenge: Open Datasets, Evaluation Metrics, Baseline System and Results 2021