An Uyghur Extension to the MASSIVE Multi-lingual Spoken Language Understanding Corpus with Comprehensive Evaluations

Ainikaerjiang Aimaiti; Di Wu; Liting Jiang; Gulinigeer Abudouwaili; Hao Huang; Wushour Silamu

2024 INTERSPEECH INTERSPEECH 2024

An Uyghur Extension to the MASSIVE Multi-lingual Spoken Language Understanding Corpus with Comprehensive Evaluations

Abstract

Spoken Language Understanding (SLU) plays a crucial role in task-oriented dialogues, and the development of SLU in various languages has been rapid. However, progress in Uyghur SLU research has been slow due to the lack of publicly available datasets. To address this issue, we extend the MASSIVE dataset to include Uyghur language, thus creating the first Uyghur SLU dataset, MASSIVE-UG. After incorporating MASSIVE-UG, the average overall accuracy of the other 51 languages has improved, demonstrating the reliability of the dataset constructed in this paper. Considering the agglutinative nature of Uyghur, we segmented it into stem and affix and conducted experiments using different embedding methods and multiple baselines. The experimental results indicate that the performance of Uyghur SLU is influenced by several factors, including representation, embedding, and modeling approach. The dataset and code are available at https://github.com/xjuspeech/MASSIVE-UG.

🧭 Keyword Pioneer — uyghur language

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Ainikaerjiang Aimaiti , Di Wu , Liting Jiang , Gulinigeer Abudouwaili , Hao Huang , Wushour Silamu

Topics

Natural Language Processing > Applications > Intent Classification Natural Language Processing > Resources & Methods > Multilingual NLP

Keywords

multilingual nlp speech recognition intent classification spoken language understanding uyghur language

Download PDF

Related papers

Reshape Dimensions Network for Speaker Recognition 2024

RevRIR: Joint Reverberant Speech and Room Impulse Response Embedding using Contrastive Learning with Application to Room Shape Classification 2024

Mixed Children/Adult/Childrenized Fine-Tuning for Children’s ASR: How to Reduce Age Mismatch and Speaking Style Mismatch 2024

Exploring Speech Foundation Models for Speaker Diarization in Child-Adult Dyadic Interactions 2024

K-means and hierarchical clustering of f0 contours 2024