HW-TSC’s Submissions to the WMT22 Word-Level Auto Completion Task

Hao Yang; Hengchao Shang; Zongyao Li; Daimeng Wei; Xianghui He; Xiaoyu Chen; Zhengzhe Yu; Jiaxin Guo; Jinlong Yang; Shaojun Li; Yuanchang Luo; Yuhao Xie; Lizhi Lei; Ying Qin

2022 EMNLP EMNLP 2022

HW-TSC’s Submissions to the WMT22 Word-Level Auto Completion Task

Abstract

AbstractThis paper presents the submissions of Huawei Translation Services Center (HW-TSC) to WMT 2022 Word-Level AutoCompletion Task. We propose an end-to-end autoregressive model with bi-context based on Transformer to solve current task. The model uses a mixture of subword and character encoding units to realize the joint encoding of human input, the context of the target side and the decoded sequence, which ensures full utilization of information. We uses one model to solve four types of data structures in the task. During training, we try using a machine translation model as the pre-trained model and fine-tune it for the task. We also add BERT-style MLM data at the fine-tuning stage to improve model performance. We participate in zh→en, en→de, and de→en directions and win the first place in all the three tracks. Particularly, we outperform the second place by more than 5% in terms of accuracy on the zh→en and en→de tracks. The result is buttressed by human evaluations as well, demonstrating the effectiveness of our model.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — bi-context transformer

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy, Speech & Audio

Authors

Hao Yang , Hengchao Shang , Zongyao Li , Daimeng Wei , Xianghui He , Xiaoyu Chen , Zhengzhe Yu , Jiaxin Guo , Jinlong Yang , Shaojun Li , Yuanchang Luo , Yuhao Xie , Lizhi Lei , Ying Qin

Topics

Natural Language Processing > Applications > Machine Translation Artificial Intelligence > Core AI > Natural Language Processing Machine Learning > Learning Types > Machine Translation Deep Learning > Learning Types > Sequence Modeling

Keywords

machine translation autoregressive model masked language modeling character encoding subword encoding word-level autocompletion word-level auto completion bi-context transformer

Download PDF

Generative Entity Typing with Curriculum Learning 2022

Towards Reinterpreting Neural Topic Models via Composite Activations 2022

Weakly Supervised Headline Dependency Parsing 2022

Cross-modal Transfer Between Vision and Language for Protest Detection 2022

HW-TSC’s Submissions to the WMT22 Word-Level Auto Completion Task

Abstract

Authors

Topics

Keywords

Related papers