Requirements and Motivations of Low-Resource Speech Synthesis for Language Revitalization

Aidan Pine; Dan Wells; Nathan Brinklow; Patrick Littell; Korin Richmond

2022 ACL ACL 2022

Requirements and Motivations of Low-Resource Speech Synthesis for Language Revitalization

Abstract

AbstractThis paper describes the motivation and development of speech synthesis systems for the purposes of language revitalization. By building speech synthesis systems for three Indigenous languages spoken in Canada, Kanien’kéha, Gitksan & SENĆOŦEN, we re-evaluate the question of how much data is required to build low-resource speech synthesis systems featuring state-of-the-art neural models. For example, preliminary results with English data show that a FastSpeech2 model trained with 1 hour of training data can produce speech with comparable naturalness to a Tacotron2 model trained with 10 hours of data. Finally, we motivate future research in evaluation and classroom integration in the field of speech synthesis for language revitalization.

🌉 Interdisciplinary Bridge — Deep Learning and Interdisciplinary and Speech & Audio

🧭 Keyword Pioneer — low-resource speech synthesis

🐣 Hot Topic Early Bird — indigenous language

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio