Towards Building a Language-Independent Speech Scoring Assessment

Shreyansh Gupta; Abhishek Unnam; Kuldeep Yadav; Varun Aggarwal

2024 AAAI AAAI 2024

Towards Building a Language-Independent Speech Scoring Assessment

Abstract

Abstract Automatic speech scoring is crucial in language learning, providing targeted feedback to language learners by assessing pronunciation, fluency, and other speech qualities. However, the scarcity of human-labeled data for languages beyond English poses a significant challenge in developing such systems. In this work, we propose a Language-Independent scoring approach to evaluate speech without relying on labeled data in the target language. We introduce a multilingual speech scoring system that leverages representations from the wav2vec 2.0 XLSR model and a force-alignment technique based on CTC-Segmentation to construct speech features. These features are used to train a machine learning model to predict pronunciation and fluency scores. We demonstrate the potential of our method by predicting expert ratings on a speech dataset spanning five languages - English, French, Spanish, German and Portuguese, and comparing its performance against Language-Specific models trained individually on each language, as well as a jointly-trained model on all languages. Results indicate that our approach shows promise as an initial step towards a universal language independent speech scoring.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning and Natural Language Processing and Speech & Audio

🧭 Keyword Pioneer — automatic speech scoring

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Speech & Audio

Authors

Shreyansh Gupta , Abhishek Unnam , Kuldeep Yadav , Varun Aggarwal

Topics

Natural Language Processing > Resources & Methods > Multilingual NLP Speech & Audio > Recognition > Speech Recognition Artificial Intelligence > Core AI > Language Machine Learning > Learning Types > Multi-Lingual Learning

Keywords

language learning multilingual speech speech assessment pronunciation assessment fluency evaluation automatic speech scoring language independent approach multilingual speech system

Download PDF

Related papers

Goal Alignment: Re-analyzing Value Alignment Problems Using Human-Aware AI 2024

Meta-Inverse Reinforcement Learning for Mean Field Games via Probabilistic Context Variables 2024

Suppressing Uncertainty in Gaze Estimation 2024

Mask-Homo: Pseudo Plane Mask-Guided Unsupervised Multi-Homography Estimation 2024

Heterogeneous Test-Time Training for Multi-Modal Person Re-identification 2024