Learning to Copy for Automatic Post-Editing

Xuancheng Huang; Yang Liu; Huanbo Luan; Jingfang Xu; Maosong Sun

2019 IJCNLP IJCNLP 2019

Learning to Copy for Automatic Post-Editing

Abstract

AbstractAutomatic post-editing (APE), which aims to correct errors in the output of machine translation systems in a post-processing step, is an important task in natural language processing. While recent work has achieved considerable performance gains by using neural networks, how to model the copying mechanism for APE remains a challenge. In this work, we propose a new method for modeling copying for APE. To better identify translation errors, our method learns the representations of source sentences and system outputs in an interactive way. These representations are used to explicitly indicate which words in the system outputs should be copied. Finally, CopyNet (Gu et.al., 2016) can be combined with our method to place the copied words in correct positions in post-edited translations. Experiments on the datasets of the WMT 2016-2017 APE shared tasks show that our approach outperforms all best published results.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Natural Language Processing

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Xuancheng Huang , Yang Liu , Huanbo Luan , Jingfang Xu , Maosong Sun

Topics

Artificial Intelligence > Core AI > Multimodal Learning Deep Learning > Models > Generative Models Natural Language Processing > Generation > Text Generation Natural Language Processing > Applications > Machine Translation

Keywords

machine translation neural machine translation text generation copy mechanism automatic post-editing neural network copying mechanism

Download PDF

Related papers

Fine-grained Knowledge Fusion for Sequence Labeling Domain Adaptation 2019

Exploiting Monolingual Data at Scale for Neural Machine Translation 2019

Distributionally Robust Language Modeling 2019

Unsupervised Domain Adaptation of Contextualized Embeddings for Sequence Labeling 2019

ARAML: A Stable Adversarial Training Framework for Text Generation 2019