A Comparison of Deep Learning Methods for Language Understanding

Mandy Korpusik; Zoe Liu; James Glass

2019 INTERSPEECH INTERSPEECH 2019

A Comparison of Deep Learning Methods for Language Understanding

Abstract

In this paper, we compare a suite of neural networks (recurrent, convolutional, and the recently proposed BERT model) to a CRF with hand-crafted features on three semantic tagging corpora: the Air Travel Information System (ATIS) benchmark, restaurant queries, and written and spoken meal descriptions. Our motivation is to investigate pre-trained BERT’s transferability to the domains we are interested in. We demonstrate that neural networks without feature engineering outperform state-of-the-art statistical and deep learning approaches on all three tasks (except written meal descriptions, where the CRF is slightly better) and that deep, attention-based BERT, in particular, surpasses state-of-the-art results on these tasks. Error analysis shows the models are less confident when making errors, enabling the system to follow up with the user when uncertain.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning

📈 Trend Setter — Foundation Models

🧭 Keyword Pioneer — transformer models

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Interdisciplinary, Machine Learning, Natural Language Processing, Reinforcement Learning, Speech & Audio

🐣 Hot Topic Early Bird — language understanding

Authors

Mandy Korpusik , Zoe Liu , James Glass

Topics

Artificial Intelligence > Core AI > Foundation Models Deep Learning > Architectures > Transformers Natural Language Processing > Understanding > Semantic Analysis Machine Learning > Learning Types > Transfer Learning

Keywords

transfer learning speech recognition natural language understanding language understanding recurrent neural network deep neural network pre-trained model semantic tagging transformer model

Download PDF

Related papers

Using Real-Time Visual Biofeedback for Second Language Instruction 2019

VAE-Based Regularization for Deep Speaker Embedding 2019

End-to-End SpeakerBeam for Single Channel Target Speech Recognition 2019

Attention-Enhanced Connectionist Temporal Classification for Discrete Speech Emotion Recognition 2019

Attentive to Individual: A Multimodal Emotion Recognition Network with Personalized Attention Profile 2019