Automatic Phone Alignment of Code-switched Urum–Russian Field Data

Emily Ahn; Eleanor Chodroff; Gina-Anne Levow

2025 ACL ACL 2025

Automatic Phone Alignment of Code-switched Urum–Russian Field Data

Abstract

AbstractCode-switching, using multiple languages in a single utterance, is a common means of communication.In the language documentation process, speakers may code-switch between the target language and a language of broader communication; however, how to handle this mixed speech data is not always clearly addressed for speech research and specifically for a corpus phonetics pipeline.This paper investigates best practices for conducting phone-level forced alignment of code-switched field data using the Urum speech dataset from DoReCo. This dataset comprises 117 minutes of narrative utterances, of which 42% contain code-switched Urum–Russian speech.We demonstrate that the inclusion of Russian speech and Russian pretrained acoustic models can aid the alignment of Urum phones.Beyond using boundary alignment precision and accuracy metrics, we also discovered that the method of acoustic modeling impacted a downstream corpus phonetics investigation of code-switched Urum–Russian.

🧭 Keyword Pioneer — phone alignment

🐝 Cross-Pollinator — Data Science & Analytics, Deep Learning, Machine Learning, Natural Language Processing, Speech & Audio

🌉 Interdisciplinary Bridge — Interdisciplinary and Natural Language Processing and Speech & Audio