Cross-Database Models for the Classification of Dysarthria Presence

Stephanie Gillespie; Yash-yee Logan; Elliot Moore; Jacqueline Laures-Gore; Scott Russell; Rupal Patel

2017 INTERSPEECH INTERSPEECH 2017

Cross-Database Models for the Classification of Dysarthria Presence

Abstract

Dysarthria is a motor speech disorder that impacts verbal articulation and co-ordination, resulting in slow, slurred and imprecise speech. Automated classification of dysarthria subtypes and severities could provide a useful clinical tool in assessing the onset and progress in treatment. This study represents a pilot project to train models to detect the presence of dysarthria in continuous speech. Subsets of the Universal Access Research Dataset (UA-Speech) and the Atlanta Motor Speech Disorders Corpus (AMSDC) database were utilized in a cross-database training strategy (training on UA-Speech / testing on AMSDC) to distinguish speech with and without dysarthria. In addition to traditional spectral and prosodic features, the current study also includes features based on the Teager Energy Operator (TEO) and the glottal waveform. Baseline results on the UA-Speech dataset maximize word- and participant-level accuracies at 75.3% and 92.9% using prosodic features. However, the cross-training of UA-Speech tested on the AMSDC maximize word- and participant-level accuracies at 71.3% and 90% based on a TEO feature. The results of this pilot study reinforce consideration of dysarthria subtypes in cross-dataset training as well as highlight additional features that may be sensitive to the presence of dysarthria in continuous speech.

🧭 Keyword Pioneer — speech disorder classification

🐝 Cross-Pollinator — Artificial Intelligence, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Machine Learning, Mathematics & Optimization, Natural Language Processing, Speech & Audio

Authors

Stephanie Gillespie , Yash-yee Logan , Elliot Moore , Jacqueline Laures-Gore , Scott Russell , Rupal Patel

Topics

Machine Learning > Core Methods > Classification Machine Learning > Application Areas > Domain Adaptation

Keywords

prosodic feature dysarthria detection speech disorder classification motor speech disorder teager energy operator cross-database training

Download PDF

Related papers

Description of the Munich-Passau Snore Sound Corpus (MPSSC) 2017

A Study on Replay Attack and Anti-Spoofing for Automatic Speaker Verification 2017

Binaural Reverberant Speech Separation Based on Deep Neural Networks 2017

Building Audio-Visual Phonetically Annotated Arabic Corpus for Expressive Text to Speech 2017

A Comparison of Danish Listeners’ Processing Cost in Judging the Truth Value of Norwegian, Swedish, and English Sentences 2017