Unbabel’s Participation in the WMT20 Metrics Shared Task

Ricardo Rei; Craig Stewart; Ana C Farinha; Alon Lavie

2020 EMNLP EMNLP 2020

Unbabel’s Participation in the WMT20 Metrics Shared Task

Abstract

AbstractWe present the contribution of the Unbabel team to the WMT 2020 Shared Task on Metrics. We intend to participate on the segmentlevel, document-level and system-level tracks on all language pairs, as well as the “QE as a Metric” track. Accordingly, we illustrate results of our models in these tracks with reference to test sets from the previous year. Our submissions build upon the recently proposed COMET framework: we train several estimator models to regress on different humangenerated quality scores and a novel ranking model trained on relative ranks obtained from Direct Assessments. We also propose a simple technique for converting segment-level predictions into a document-level score. Overall, our systems achieve strong results for all language pairs on previous test sets and in many cases set a new state-of-the-art.

🌉 Interdisciplinary Bridge — Computer Science and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — segment-level scoring

🐣 Hot Topic Early Bird — human evaluation

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Ricardo Rei , Craig Stewart , Ana C Farinha , Alon Lavie

Topics

Machine Learning > Core Methods > Regression Natural Language Processing > Applications > Machine Translation Computer Science > Applications > Information Retrieval Machine Learning > Learning Types > Regression

Keywords

machine translation quality estimation ranking model human evaluation regression model machine translation evaluation segment-level scoring human-generated quality score document-level scoring

Download PDF

Related papers

Fast semantic parsing with well-typedness guarantees 2020

Detecting Objectifying Language in Online Professor Reviews 2020

Analogous Process Structure Induction for Sub-event Sequence Prediction 2020

Aspect Sentiment Classification with Aspect-Specific Opinion Spans 2020

Robust and Interpretable Grounding of Spatial References with Relation Networks 2020