Language-specific Effects on Automatic Speech Recognition Errors for World Englishes

June Choe; Yiran Chen; May Pik Yu Chan; Aini Li; Xin Gao; Nicole Holliday

2022 COLING COLING 2022

Language-specific Effects on Automatic Speech Recognition Errors for World Englishes

Abstract

AbstractDespite recent advancements in automated speech recognition (ASR) technologies, reports of unequal performance across speakers of different demographic groups abound. At the same time, the focus on performance metrics such as the Word Error Rate (WER) in prior studies limit the specificity and scope of recommendations that can be offered for system engineering to overcome these challenges. The current study bridges this gap by investigating the performance of Otter’s automatic captioning system on native and non-native English speakers of different language background through a linguistic analysis of segment-level errors. By examining language-specific error profiles for vowels and consonants motivated by linguistic theory, we find that certain categories of errors can be predicted from the phonological structure of a speaker’s native language.

🧭 Keyword Pioneer — world english

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy, Speech & Audio

Authors

June Choe , Yiran Chen , May Pik Yu Chan , Aini Li , Xin Gao , Nicole Holliday

Topics

Speech & Audio > Recognition > Automatic Speech Recognition

Keywords

automatic speech recognition linguistic analysis speech recognition error phonological structure world english

Download PDF

Related papers

MulZDG: Multilingual Code-Switching Framework for Zero-shot Dialogue Generation 2022

The Role of Context and Uncertainty in Shallow Discourse Parsing 2022

SelfMix: Robust Learning against Textual Label Noise with Self-Mixup Training 2022

Complicate Then Simplify: A Novel Way to Explore Pre-trained Models for Text Classification 2022

Repo4QA: Answering Coding Questions via Dense Retrieval on GitHub Repositories 2022