Iterative Dual Domain Adaptation for Neural Machine Translation

Jiali Zeng; Yang Liu; Jinsong Su; Yubing Ge; Yaojie Lu; Yongjing Yin; Jiebo Luo

2019 EMNLP EMNLP 2019

Iterative Dual Domain Adaptation for Neural Machine Translation

Abstract

AbstractPrevious studies on the domain adaptation for neural machine translation (NMT) mainly focus on the one-pass transferring out-of-domain translation knowledge to in-domain NMT model. In this paper, we argue that such a strategy fails to fully extract the domain-shared translation knowledge, and repeatedly utilizing corpora of different domains can lead to better distillation of domain-shared translation knowledge. To this end, we propose an iterative dual domain adaptation framework for NMT. Specifically, we first pretrain in-domain and out-of-domain NMT models using their own training corpora respectively, and then iteratively perform bidirectional translation knowledge transfer (from in-domain to out-of-domain and then vice versa) based on knowledge distillation until the in-domain NMT model convergences. Furthermore, we extend the proposed framework to the scenario of multiple out-of-domain training corpora, where the above-mentioned transfer is performed sequentially between the in-domain and each out-of-domain NMT models in the ascending order of their domain similarities. Empirical results on Chinese-English and English-German translation tasks demonstrate the effectiveness of our framework.

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — translation knowledge

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Jiali Zeng , Yang Liu , Jinsong Su , Yubing Ge , Yaojie Lu , Yongjing Yin , Jiebo Luo

Topics

Machine Learning > Application Areas > Domain Adaptation Machine Learning > Application Areas > Knowledge Distillation Natural Language Processing > Applications > Machine Translation Deep Learning > Learning Types > Knowledge Distillation Deep Learning > Learning Types > Transfer Learning

Keywords

domain adaptation knowledge distillation neural machine translation iterative learning bidirectional translation translation knowledge bidirectional transfer

Download PDF

Related papers

Read, Attend and Comment: A Deep Architecture for Automatic News Comment Generation 2019

Chains-of-Reasoning at TextGraphs 2019 Shared Task: Reasoning over Chains of Facts for Explainable Multi-hop Inference 2019

A Boundary-aware Neural Model for Nested Named Entity Recognition 2019

A Multi-Pairwise Extension of Procrustes Analysis for Multilingual Word Translation 2019

Improving Distantly-Supervised Relation Extraction with Joint Label Embedding 2019