Contextual Language Model Adaptation for Conversational Agents

Anirudh Raju; Behnam Hedayatnia; Linda Liu; Ankur Gandhe; Chandra Khatri; Angeliki Metallinou; Anu Venkatesh; Ariya Rastrow

2018 INTERSPEECH INTERSPEECH 2018

Contextual Language Model Adaptation for Conversational Agents

Abstract

Statistical language models (LM) play a key role in Automatic Speech Recognition (ASR) systems used by conversational agents. These ASR systems should provide a high accuracy under a variety of speaking styles, domains, vocabulary and argots. In this paper, we present a DNN-based method to adapt the LM to each user-agent interaction based on generalized contextual information, by predicting an optimal, context-dependent set of LM interpolation weights. We show that this framework for contextual adaptation provides accuracy improvements under different possible mixture LM partitions that are relevant for both (1) Goal-oriented conversational agents where it’s natural to partition the data by the requested application and for (2) Non-goal oriented conversational agents where the data can be partitioned using topic labels that come from predictions of a topic classifier. We obtain a relative WER reduction of 3% with a 1-pass decoding strategy and 6% in a 2-pass decoding framework, over an unadapted model. We also show up to a 15% relative WER reduction in recognizing named entities which is of significant value for conversational ASR systems.

🌉 Interdisciplinary Bridge — Natural Language Processing and Speech & Audio

🐣 Hot Topic Early Bird — conversational agent

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Anirudh Raju , Behnam Hedayatnia , Linda Liu , Ankur Gandhe , Chandra Khatri , Angeliki Metallinou , Anu Venkatesh , Ariya Rastrow

Topics

Natural Language Processing > Generation > Language Modeling Speech & Audio > Recognition > Automatic Speech Recognition

Keywords

language model adaptation automatic speech recognition conversational agent word error rate contextual adaptation neural network

Download PDF

Related papers

HoloCompanion: An MR Friend for EveryOne 2018

Estimation of the Vocal Tract Length of Vowel Sounds Based on the Frequency of the Significant Spectral Valley 2018

Deep Learning Techniques for Koala Activity Detection 2018

An Exploration of Local Speaking Rate Variations in Mandarin Read Speech 2018

Acoustic Analysis of Whispery Voice Disguise in Mandarin Chinese 2018