2020 INTERSPEECH INTERSPEECH 2020

Detecting and Analysing Spontaneous Oral Cancer Speech in the Wild

Abstract

Oral cancer speech is a disease which impacts more than half a million people worldwide every year. Analysis of oral cancer speech has so far focused on read speech. In this paper, we 1) present and 2) analyse a three-hour long spontaneous oral cancer speech dataset collected from YouTube. 3) We set baselines for an oral cancer speech detection task on this dataset. The analysis of these explainable machine learning baselines shows that sibilants and stop consonants are the most important indicators for spontaneous oral cancer speech detection.

🌉 Interdisciplinary Bridge — Machine Learning and Speech & Audio
🧭 Keyword Pioneer — oral cancer detection
🐝 Cross-Pollinator — Artificial Intelligence, Deep Learning, Healthcare & Medicine, Machine Learning, Natural Language Processing, Speech & Audio