2022
INTERSPEECH
INTERSPEECH 2022
PERCEPT-R: An Open-Access American English Child/Clinical Speech Corpus Specialized for the Audio Classification of /ɹ/
Abstract
We present the PERCEPT-R corpus, a labeled corpus of child speakers of American English with typical speech and residual speech sound disorders affecting rhotics. We demonstrate the utility of age-and-gender normalized formants extracted from PERCEPT-R in training support vector classifiers to predict ground-truth perceptual judgments of "rhotic” (i.e., dialect-typical) and "derhotic” phones for novel speakers (mean of participant-specific f-metrics = .83; SD = .18, N = 281).
🌉
Interdisciplinary Bridge
— Machine Learning and Speech & Audio
🧭
Keyword Pioneer
— rhotic detection
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio