Mixtures of Deep Neural Experts for Automated Speech Scoring

Sara Papi; Edmondo Trentin; Roberto Gretter; Marco Matassoni; Daniele Falavigna

2020 INTERSPEECH INTERSPEECH 2020

Mixtures of Deep Neural Experts for Automated Speech Scoring

Abstract

The paper copes with the task of automatic assessment of second language proficiency from the language learners’ spoken responses to test prompts. The task has significant relevance to the field of computer assisted language learning. The approach presented in the paper relies on two separate modules: (1) an automatic speech recognition system that yields text transcripts of the spoken interactions involved, and (2) a multiple classifier system based on deep learners that ranks the transcripts into proficiency classes. Different deep neural network architectures (both feed-forward and recurrent) are specialized over diverse representations of the texts in terms of: a reference grammar, the outcome of probabilistic language models, several word embeddings, and two bag-of-word models. Combination of the individual classifiers is realized either via a probabilistic pseudo-joint model, or via a neural mixture of experts. Using the data of the third Spoken CALL Shared Task challenge, the highest values to date were obtained in terms of three popular evaluation metrics.

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning and Speech & Audio

🧭 Keyword Pioneer — speech scoring

🐣 Hot Topic Early Bird — mixture of expert

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Speech & Audio

Authors

Sara Papi , Edmondo Trentin , Roberto Gretter , Marco Matassoni , Daniele Falavigna

Topics

Machine Learning > Core Methods > Classification Deep Learning > Architectures > Neural Networks Speech & Audio > Recognition > Speech Recognition Speech & Audio > Analysis > Speech Analysis Machine Learning > Core Methods > Ensemble Methods

Keywords

automatic speech recognition deep neural network mixture of expert classifier ensemble language learner speech assessment language proficiency speech scoring automated speech scoring

Download PDF

Related papers

Memory Controlled Sequential Self Attention for Sound Recognition 2020

Dual Attention in Time and Frequency Domain for Voice Activity Detection 2020

Automatic Prediction of Speech Intelligibility Based on X-Vectors in the Context of Head and Neck Cancer 2020

A Noise Robust Technique for Detecting Vowels in Speech Signals 2020

Joint Detection of Sentence Stress and Phrase Boundary for Prosody 2020