Comparison of Speech Tasks and Recording Devices for Voice Based Automatic Classification of Healthy Subjects and Patients with Amyotrophic Lateral Sclerosis

Suhas B.N.; Deep Patel; Nithin Rao; Yamini Belur; Pradeep Reddy; Nalini Atchayaram; Ravi Yadav; Dipanjan Gope; Prasanta Kumar Ghosh

2019 INTERSPEECH INTERSPEECH 2019

Comparison of Speech Tasks and Recording Devices for Voice Based Automatic Classification of Healthy Subjects and Patients with Amyotrophic Lateral Sclerosis

Abstract

We consider the task of speech based automatic classification of patients with amyotrophic lateral sclerosis (ALS) and healthy subjects. The role of different speech tasks and recording devices on classification accuracy is examined. Sustained phoneme production (PHON), diadochokinetic task (DDK) and spontaneous speech (SPON) have been used as speech tasks. The chosen five recording devices include a high quality microphone and built-in smartphone microphones at various price ranges. Experiments are performed using speech data from 25 ALS patients and 25 healthy subjects using support vector machines and deep neural networks as classifiers and suprasegmental features based on mel frequency cepstral coefficients. Results reveal that DDK consistently performs better than SPON and PHON across all devices for discriminating ALS patients and healthy subjects. Considering DDK, the best classification accuracy of 92.2% is obtained using a high quality microphone but the accuracy drops if there is a mismatch between the microphones for training and test. However, a classifier trained with recordings from all devices together performs more uniformly across all devices. The findings from this study could aid in determining the choice of the task and device in developing an assistive tool for detection and monitoring of ALS.

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning

🧭 Keyword Pioneer — amyotrophic lateral sclerosis

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Speech & Audio

Authors

Suhas B.N. , Deep Patel , Nithin Rao , Yamini Belur , Pradeep Reddy , Nalini Atchayaram , Ravi Yadav , Dipanjan Gope , Prasanta Kumar Ghosh

Topics

Machine Learning > Core Methods > Classification Machine Learning > Core Methods > Representation Learning Deep Learning > Architectures > Neural Networks Speech & Audio > Analysis > Clinical Speech Analysis Healthcare & Medicine > Clinical > Medical AI

Keywords

speech analysis speech classification support vector machine deep neural network mel frequency cepstral coefficient amyotrophic lateral sclerosis

Download PDF

Related papers

Using Real-Time Visual Biofeedback for Second Language Instruction 2019

VAE-Based Regularization for Deep Speaker Embedding 2019

End-to-End SpeakerBeam for Single Channel Target Speech Recognition 2019

Attention-Enhanced Connectionist Temporal Classification for Discrete Speech Emotion Recognition 2019

Attentive to Individual: A Multimodal Emotion Recognition Network with Personalized Attention Profile 2019