MoE-SLU: Towards ASR-Robust Spoken Language Understanding via Mixture-of-Experts

Xuxin Cheng; Zhihong Zhu; Xianwei Zhuang; Zhanpeng Chen; Zhiqi Huang; Yuexian Zou

2024 ACL ACL 2024

MoE-SLU: Towards ASR-Robust Spoken Language Understanding via Mixture-of-Experts

Abstract

AbstractAs a crucial task in the task-oriented dialogue systems, spoken language understanding (SLU) has garnered increasing attention. However, errors from automatic speech recognition (ASR) often hinder the performance of understanding. To tackle this problem, we propose MoE-SLU, an ASR-Robust SLU framework based on the mixture-of-experts technique. Specifically, we first introduce three strategies to generate additional transcripts from clean transcripts. Then, we employ the mixture-of-experts technique to weigh the representations of the generated transcripts, ASR transcripts, and the corresponding clean manual transcripts. Additionally, we also regularize the weighted average of predictions and the predictions of ASR transcripts by minimizing the Jensen-Shannon Divergence (JSD) between these two output distributions. Experiment results on three benchmark SLU datasets demonstrate that our MoE-SLU achieves state-of-the-art performance. Further model analysis also verifies the superiority of our method.

🌉 Interdisciplinary Bridge — Natural Language Processing and Speech & Audio

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Xuxin Cheng , Zhihong Zhu , Xianwei Zhuang , Zhanpeng Chen , Zhiqi Huang , Yuexian Zou

Topics

Natural Language Processing > Applications > Intent Classification Speech & Audio > Recognition > Automatic Speech Recognition

Keywords

automatic speech recognition intent classification spoken language understanding mixture of expert slot filling

Download PDF

Related papers

Reinforcement Learning-Driven LLM Agent for Automated Attacks on LLMs 2024

EtymoLink: A Structured English Etymology Dataset 2024

Turkish Delights: A Dataset on Turkish Euphemisms 2024

Subjectivity Detection in English News using Large Language Models 2024

Does DetectGPT Fully Utilize Perturbation? Bridging Selective Perturbation to Fine-tuned Contrastive Learning Detector would be Better 2024