2016 INTERSPEECH INTERSPEECH 2016

Improving the Lwazi ASR Baseline

Abstract

We investigate the impact of recent advances in speech recognition techniques for under-resourced languages. Specifically, we review earlier results published on the Lwazi ASR corpus of South African languages, and experiment with additional acoustic modeling approaches. We demonstrate large gains by applying current state-of-the-art techniques, even if the data itself is neither extended nor improved. We analyze the various performance improvements observed, report on comparative performance per technique β€” across all eleven languages in the corpus β€” and discuss the implications of our findings for under-resourced languages in general.

πŸš€ Conference Pioneer β€” INTERSPEECH 2016
πŸŒ‰ Interdisciplinary Bridge β€” Deep Learning and Speech & Audio
πŸ“ˆ Trend Setter β€” Pretraining
🧭 Keyword Pioneer β€” state-of-the-art technique
🐝 Cross-Pollinator β€” Artificial Intelligence, Computer Science, Computer Vision, Deep Learning, Interdisciplinary, Machine Learning, Natural Language Processing, Speech & Audio