The INTERSPEECH 2020 Computational Paralinguistics Challenge: Elderly Emotion, Breathing & Masks

Bjorn W. Schuller; Anton Batliner; Christian Bergler; Eva-Maria Messner; Antonia Hamilton; Shahin Amiriparian; Alice Baird; Georgios Rizos; Maximilian Schmitt; Lukas Stappen; Harald Baumeister; Alexis Deighton MacIntyre; Simone Hantke

2020 INTERSPEECH INTERSPEECH 2020

The INTERSPEECH 2020 Computational Paralinguistics Challenge: Elderly Emotion, Breathing & Masks

Abstract

The INTERSPEECH 2020 Computational Paralinguistics Challenge addresses three different problems for the first time in a research competition under well-defined conditions: In the Elderly Emotion Sub-Challenge, arousal and valence in the speech of elderly individuals have to be modelled as a 3-class problem; in the Breathing Sub-Challenge, breathing has to be assessed as a regression problem; and in the Mask Sub-Challenge, speech without and with a surgical mask has to be told apart. We describe the Sub-Challenges, baseline feature extraction, and classifiers based on the ‘usual’ ComParE and BoAW features as well as deep unsupervised representation learning using the auDeep toolkit, and deep feature extraction from pre-trained CNNs using the Deep Spectrum toolkit; in addition, we partially add deep end-to-end sequential modelling, and, for the first time in the challenge, linguistic analysis.

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning and Speech & Audio

🧭 Keyword Pioneer — mask detection

🐝 Cross-Pollinator — Artificial Intelligence, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Bjorn W. Schuller , Anton Batliner , Christian Bergler , Eva-Maria Messner , Antonia Hamilton , Shahin Amiriparian , Alice Baird , Georgios Rizos , Maximilian Schmitt , Lukas Stappen , Harald Baumeister , Alexis Deighton MacIntyre , Simone Hantke

Topics

Machine Learning > Core Methods > Classification Machine Learning > Learning Types > Unsupervised Learning Speech & Audio > Analysis > Clinical Speech Analysis Speech & Audio > Analysis > Speech Analysis Deep Learning > Techniques > Self-Supervised Learning

Keywords

emotion recognition speech emotion recognition mask detection computational paralinguistics breathing analysis deep representation learning elderly emotion

Download PDF

Related papers

Memory Controlled Sequential Self Attention for Sound Recognition 2020

Dual Attention in Time and Frequency Domain for Voice Activity Detection 2020

Automatic Prediction of Speech Intelligibility Based on X-Vectors in the Context of Head and Neck Cancer 2020

A Noise Robust Technique for Detecting Vowels in Speech Signals 2020

Joint Detection of Sentence Stress and Phrase Boundary for Prosody 2020