Automatic Children Speech Sound Disorder Detection with Age and Speaker Bias Mitigation

Gahye Kim; Yunjung Eom; Selina S. Sung; Seunghee Ha; Tae-Jin Yoon; Jungmin So

2024 INTERSPEECH INTERSPEECH 2024

Automatic Children Speech Sound Disorder Detection with Age and Speaker Bias Mitigation

Abstract

Addressing speech sound disorders (SSD) in early childhood is pivotal for mitigating cognitive and communicative impediments. Previous works on automatic SSD detection rely on audio features without considering the age and speaker bias which results in degraded performance. In this paper, we propose an SSD detection system in which debiasing techniques are applied to mitigate the biases. For the age bias, we use a multi-head model where the feature extractor is shared across different age groups but the final decision is made using the age-dependent classifier. For the speaker bias, we augment the dataset by mixing the audios of the multiple speakers in the same age group. When evaluated with our Korean SSD dataset, the proposed method showed significant improvements over previous approaches.

🌉 Interdisciplinary Bridge — Data Science & Analytics and Machine Learning

🧭 Keyword Pioneer — age bias mitigation

🐝 Cross-Pollinator — Artificial Intelligence, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Machine Learning, Natural Language Processing, Speech & Audio

Authors

Gahye Kim , Yunjung Eom , Selina S. Sung , Seunghee Ha , Tae-Jin Yoon , Jungmin So

Topics

Machine Learning > Application Areas > Fairness Data Science & Analytics > Applications > Disease Surveillance

Keywords

speech sound disorder multi-head model age bias mitigation speaker bias mitigation age-dependent classifier

Download PDF

Related papers

Reshape Dimensions Network for Speaker Recognition 2024

RevRIR: Joint Reverberant Speech and Room Impulse Response Embedding using Contrastive Learning with Application to Room Shape Classification 2024

Mixed Children/Adult/Childrenized Fine-Tuning for Children’s ASR: How to Reduce Age Mismatch and Speaking Style Mismatch 2024

Exploring Speech Foundation Models for Speaker Diarization in Child-Adult Dyadic Interactions 2024

K-means and hierarchical clustering of f0 contours 2024