Preserving Cross-Linguality of Pre-trained Models via Continual Learning

Zihan Liu; Genta Indra Winata; Andrea Madotto; Pascale Fung

2021 ACL ACL 2021

Preserving Cross-Linguality of Pre-trained Models via Continual Learning

Abstract

AbstractRecently, fine-tuning pre-trained language models (e.g., multilingual BERT) to downstream cross-lingual tasks has shown promising results. However, the fine-tuning process inevitably changes the parameters of the pre-trained model and weakens its cross-lingual ability, which leads to sub-optimal performance. To alleviate this problem, we leverage continual learning to preserve the original cross-lingual ability of the pre-trained model when we fine-tune it to downstream tasks. The experimental result shows that our fine-tuning methods can better preserve the cross-lingual ability of the pre-trained model in a sentence retrieval task. Our methods also achieve better performance than other fine-tuning baselines on the zero-shot cross-lingual part-of-speech tagging and named entity recognition tasks.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning and Natural Language Processing

📈 Trend Setter — Continual Learning

🧭 Keyword Pioneer — cross-lingual ability

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Zihan Liu , Genta Indra Winata , Andrea Madotto , Pascale Fung

Topics

Artificial Intelligence > Learning Paradigms > Transfer Learning Machine Learning > Learning Types > Continual Learning Machine Learning > Application Areas > Domain Adaptation Natural Language Processing > Resources & Methods > Multilingual NLP Natural Language Processing > Applications > Named Entity Recognition Artificial Intelligence > Learning Paradigms > Continual Learning Natural Language Processing > Applications > Part-of-Speech Tagging

Keywords

continual learning transfer learning cross-lingual transfer named entity recognition part-of-speech tagging pre-trained language model pretrained language model multilingual bert sentence retrieval zero-shot cross-lingual cross-lingual ability

Download PDF

Related papers

Out-of-Scope Intent Detection with Self-Supervision and Discriminative Training 2021

A Non-Autoregressive Edit-Based Approach to Controllable Text Simplification 2021

How Did This Get Funded?! Automatically Identifying Quirky Scientific Achievements 2021

Exploring Discourse Structures for Argument Impact Classification 2021

Language Embeddings for Typology and Cross-lingual Transfer Learning 2021