Improving the Lwazi ASR Baseline

Charl van Heerden; Neil Kleynhans; Marelie Davel

2016 INTERSPEECH INTERSPEECH 2016

Improving the Lwazi ASR Baseline

Abstract

We investigate the impact of recent advances in speech recognition techniques for under-resourced languages. Specifically, we review earlier results published on the Lwazi ASR corpus of South African languages, and experiment with additional acoustic modeling approaches. We demonstrate large gains by applying current state-of-the-art techniques, even if the data itself is neither extended nor improved. We analyze the various performance improvements observed, report on comparative performance per technique — across all eleven languages in the corpus — and discuss the implications of our findings for under-resourced languages in general.

🚀 Conference Pioneer — INTERSPEECH 2016

🌉 Interdisciplinary Bridge — Deep Learning and Speech & Audio

📈 Trend Setter — Pretraining

🧭 Keyword Pioneer — state-of-the-art technique

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Deep Learning, Interdisciplinary, Machine Learning, Natural Language Processing, Speech & Audio