Fine-Tuning ASR models for Very Low-Resource Languages: A Study on Mvskoke

Julia Mainzinger; Gina-Anne Levow

2024 ACL ACL 2024

Fine-Tuning ASR models for Very Low-Resource Languages: A Study on Mvskoke

Abstract

AbstractRecent advancements in multilingual models for automatic speech recognition (ASR) have been able to achieve a high accuracy for languages with extremely limited resources. This study examines ASR modeling for the Mvskoke language, an indigenous language of America. The parameter efficiency of adapter training is contrasted with training entire models, and it is demonstrated how performance varies with different amounts of data. Additionally, the models are evaluated with trigram language model decoding, and the outputs are compared across different types of speech recordings. Results show that training an adapter is both parameter efficient and gives higher accuracy for a relatively small amount of data.

🌉 Interdisciplinary Bridge — Natural Language Processing and Speech & Audio

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Interdisciplinary, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Speech & Audio

Authors

Julia Mainzinger , Gina-Anne Levow

Topics

Natural Language Processing > Resources & Methods > Multilingual NLP Speech & Audio > Recognition > Automatic Speech Recognition Machine Learning > Learning Paradigms > Few-Shot Learning Machine Learning > Learning Paradigms > Transfer Learning Machine Learning > Learning Types > Transfer Learning Artificial Intelligence > Core AI > Speech Recognition

Keywords

speech recognition language model adaptation automatic speech recognition low-resource language language model parameter efficiency multilingual model adapter training

Download PDF

Related papers

Reinforcement Learning-Driven LLM Agent for Automated Attacks on LLMs 2024

EtymoLink: A Structured English Etymology Dataset 2024

Turkish Delights: A Dataset on Turkish Euphemisms 2024

Subjectivity Detection in English News using Large Language Models 2024

Does DetectGPT Fully Utilize Perturbation? Bridging Selective Perturbation to Fine-tuned Contrastive Learning Detector would be Better 2024