Liulishuo's System for the Spoken CALL Shared Task 2018

Huy Nguyen; Lei Chen; Ramon Prieto; Chuan Wang; Yang Liu

2018 INTERSPEECH INTERSPEECH 2018

Liulishuo's System for the Spoken CALL Shared Task 2018

Abstract

The Spoken CALL (Computer-Assisted Language Learning) 2018 shared task requires systems to automatically accept or reject each single-sentence spoken response depending on whether the response is correct given a prompt. Spoken responses are first recognized into texts and then classified as ‘accept’ or ‘reject’ based on their language and meaning. This paper describes our system for the shared task. We focused on improving speech recognition performance, developing a rich set of features to capture the linguistic and semantic meaning of the responses and optimizing classification results for various factors (training set, n-best hypotheses of speech recognition, decision threshold, model ensemble). Our system achieves the best performance among the participating teams.

🌉 Interdisciplinary Bridge — Machine Learning and Speech & Audio

🧭 Keyword Pioneer — n-best hypothesis

🐣 Hot Topic Early Bird — model ensemble

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy, Speech & Audio

Authors

Huy Nguyen , Lei Chen , Ramon Prieto , Chuan Wang , Yang Liu

Topics

Machine Learning > Core Methods > Classification Machine Learning > Application Areas > Domain Adaptation Speech & Audio > Recognition > Speech Recognition

Keywords

automatic speech recognition model ensemble n-best hypothesis computer-assisted language learning accept-reject classification speech recognition feature

Download PDF

Related papers

HoloCompanion: An MR Friend for EveryOne 2018

Estimation of the Vocal Tract Length of Vowel Sounds Based on the Frequency of the Significant Spectral Valley 2018

Deep Learning Techniques for Koala Activity Detection 2018

An Exploration of Local Speaking Rate Variations in Mandarin Read Speech 2018

Acoustic Analysis of Whispery Voice Disguise in Mandarin Chinese 2018