2018 ACL ACL 2018

Code-Switching Language Modeling using Syntax-Aware Multi-Task Learning

Abstract

AbstractLack of text data has been the major issue on code-switching language modeling. In this paper, we introduce multi-task learning based language model which shares syntax representation of languages to leverage linguistic information and tackle the low resource data issue. Our model jointly learns both language modeling and Part-of-Speech tagging on code-switched utterances. In this way, the model is able to identify the location of code-switching points and improves the prediction of next word. Our approach outperforms standard LSTM based language model, with an improvement of 9.7% and 7.4% in perplexity on SEAME Phase I and Phase II dataset respectively.

🌉 Interdisciplinary Bridge — Machine Learning and Natural Language Processing
📈 Trend Setter — Language Modeling
🧭 Keyword Pioneer — code-switching language
🐣 Hot Topic Early Bird — multi-task learning
🐝 Cross-Pollinator — Artificial Intelligence, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Speech & Audio