HW-TSC’s Submissions to the WMT 2022 General Machine Translation Shared Task

Daimeng Wei; Zhiqiang Rao; Zhanglin Wu; Shaojun Li; Yuanchang Luo; Yuhao Xie; Xiaoyu Chen; Hengchao Shang; Zongyao Li; Zhengzhe Yu; Jinlong Yang; Miaomiao Ma; Lizhi Lei; Hao Yang; Ying Qin

2022 EMNLP EMNLP 2022

HW-TSC’s Submissions to the WMT 2022 General Machine Translation Shared Task

Abstract

AbstractThis paper presents the submissions of Huawei Translate Services Center (HW-TSC) to the WMT 2022 General Machine Translation Shared Task. We participate in 6 language pairs, including Zh↔En, Ru↔En, Uk↔En, Hr↔En, Uk↔Cs and Liv↔En. We use Transformer architecture and obtain the best performance via multiple variants with larger parameter sizes. We perform fine-grained pre-processing and filtering on the provided large-scale bilingual and monolingual datasets. For medium and highresource languages, we mainly use data augmentation strategies, including Back Translation, Self Training, Ensemble Knowledge Distillation, Multilingual, etc. For low-resource languages such as Liv, we use pre-trained machine translation models, and then continue training with Regularization Dropout (R-Drop). The previous mentioned data augmentation methods are also used. Our submissions obtain competitive results in the final evaluation.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — ensemble knowledge distillation

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Daimeng Wei , Zhiqiang Rao , Zhanglin Wu , Shaojun Li , Yuanchang Luo , Yuhao Xie , Xiaoyu Chen , Hengchao Shang , Zongyao Li , Zhengzhe Yu , Jinlong Yang , Miaomiao Ma , Lizhi Lei , Hao Yang , Ying Qin

Topics

Machine Learning > Application Areas > Data Augmentation Machine Learning > Application Areas > Knowledge Distillation Deep Learning > Architectures > Transformers Natural Language Processing > Generation > Machine Translation Artificial Intelligence > Core AI > Natural Language Processing

Keywords

knowledge distillation machine translation multilingual translation neural machine translation back translation self training ensemble knowledge distillation

Download PDF

Generative Entity Typing with Curriculum Learning 2022

Towards Reinterpreting Neural Topic Models via Composite Activations 2022

Weakly Supervised Headline Dependency Parsing 2022

Cross-modal Transfer Between Vision and Language for Protest Detection 2022

HW-TSC’s Submissions to the WMT 2022 General Machine Translation Shared Task

Abstract

Authors

Topics

Keywords

Related papers