Tencent submission for WMT20 Quality Estimation Shared Task

Haijiang Wu; Zixuan Wang; Qingsong Ma; Xinjie Wen; Ruichen Wang; Xiaoli Wang; Yulin Zhang; Zhipeng Yao; Siyao Peng

2020 EMNLP EMNLP 2020

Tencent submission for WMT20 Quality Estimation Shared Task

Abstract

AbstractThis paper presents Tencent’s submission to the WMT20 Quality Estimation (QE) Shared Task: Sentence-Level Post-editing Effort for English-Chinese in Task 2. Our system ensembles two architectures, XLM-based and Transformer-based Predictor-Estimator models. For the XLM-based Predictor-Estimator architecture, the predictor produces two types of contextualized token representations, i.e., masked XLM and non-masked XLM; the LSTM-estimator and Transformer-estimator employ two effective strategies, top-K and multi-head attention, to enhance the sentence feature representation. For Transformer-based Predictor-Estimator architecture, we improve a top-performing model by conducting three modifications: using multi-decoding in machine translation module, creating a new model by replacing the transformer-based predictor with XLM-based predictor, and finally integrating two models by a weighted average. Our submission achieves a Pearson correlation of 0.664, ranking first (tied) on English-Chinese.

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning and Natural Language Processing

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Haijiang Wu , Zixuan Wang , Qingsong Ma , Xinjie Wen , Ruichen Wang , Xiaoli Wang , Yulin Zhang , Zhipeng Yao , Siyao Peng

Topics

Machine Learning > Core Methods > Regression Deep Learning > Architectures > Transformers Natural Language Processing > Applications > Machine Translation Natural Language Processing > Resources & Methods > Large Language Models Natural Language Processing > Generation > Machine Translation Machine Learning > Learning Types > Ensemble Learning Deep Learning > Learning Types > Ensemble Learning

Keywords

ensemble learning attention mechanism machine translation cross-lingual transfer neural machine translation quality estimation model ensemble multi-head attention cross-lingual model

Download PDF

Related papers

Fast semantic parsing with well-typedness guarantees 2020

Detecting Objectifying Language in Online Professor Reviews 2020

Analogous Process Structure Induction for Sub-event Sequence Prediction 2020

Aspect Sentiment Classification with Aspect-Specific Opinion Spans 2020

Robust and Interpretable Grounding of Spatial References with Relation Networks 2020