Investigating Objective Intelligibility in Real-Time EMG-to-Speech Conversion

Lorenz Diener; Tanja Schultz

2018 INTERSPEECH INTERSPEECH 2018

Investigating Objective Intelligibility in Real-Time EMG-to-Speech Conversion

Abstract

This paper presents an analysis of the influence of various system parameters on the output quality of our neural network based real-time EMG-to-Speech conversion system. This EMG-to-Speech system allows for the direct conversion of facial surface electromyographic signals into audible speech in real time, allowing for a closed-loop setup where users get direct audio feedback. Such a setup opens new avenues for research and applications through co-adaptation approaches. In this paper, we evaluate the influence of several parameters on the output quality, such as time context, EMG-Audio delay, network-, training data- and Mel spectrogram size. The resulting output quality is evaluated based on the objective output quality measure STOI.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Healthcare & Medicine

🧭 Keyword Pioneer — emg-to-speech conversion

🐣 Hot Topic Early Bird — real-time processing

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Lorenz Diener , Tanja Schultz

Topics

Artificial Intelligence > Core AI > Multimodal Learning Healthcare & Medicine > Research > Biosignal Processing

Keywords

real-time processing speech intelligibility mel spectrogram neural network emg-to-speech conversion

Download PDF

Related papers

HoloCompanion: An MR Friend for EveryOne 2018

Estimation of the Vocal Tract Length of Vowel Sounds Based on the Frequency of the Significant Spectral Valley 2018

Deep Learning Techniques for Koala Activity Detection 2018

An Exploration of Local Speaking Rate Variations in Mandarin Read Speech 2018

Acoustic Analysis of Whispery Voice Disguise in Mandarin Chinese 2018