Word Error Rate Estimation for Speech Recognition: e-WER

Ahmed Ali; Steve Renals

2018 ACL ACL 2018

Word Error Rate Estimation for Speech Recognition: e-WER

Abstract

AbstractMeasuring the performance of automatic speech recognition (ASR) systems requires manually transcribed data in order to compute the word error rate (WER), which is often time-consuming and expensive. In this paper, we propose a novel approach to estimate WER, or e-WER, which does not require a gold-standard transcription of the test set. Our e-WER framework uses a comprehensive set of features: ASR recognised text, character recognition results to complement recognition output, and internal decoder features. We report results for the two features; black-box and glass-box using unseen 24 Arabic broadcast programs. Our system achieves 16.9% WER root mean squared error (RMSE) across 1,400 sentences. The estimated overall WER e-WER was 25.3% for the three hours test set, while the actual WER was 28.5%.

🌉 Interdisciplinary Bridge — Machine Learning and Natural Language Processing and Speech & Audio

📈 Trend Setter — Speech Recognition

🧭 Keyword Pioneer — root mean squared error

🐣 Hot Topic Early Bird — evaluation metric

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Ahmed Ali , Steve Renals

Topics

Machine Learning > Application Areas > Risk Management Speech & Audio > Recognition > Speech Recognition Natural Language Processing > Applications > Speech Recognition

Keywords

feature extraction automatic speech recognition evaluation metric word error rate error estimation performance estimation root mean squared error

Download PDF

Related papers

Economic Event Detection in Company-Specific News Text 2018

Investigating Effective Parameters for Fine-tuning of Word Embeddings Using Only a Small Corpus 2018

SemAxis: A Lightweight Framework to Characterize Domain-Specific Word Semantics Beyond Sentiment 2018

Fighting Offensive Language on Social Media with Unsupervised Text Style Transfer 2018

Affordances in Grounded Language Learning 2018