Regressive Ensemble for Machine Translation Quality Evaluation

Michal Štefánik; Vít Novotný; Petr Sojka

2021 EMNLP EMNLP 2021

Regressive Ensemble for Machine Translation Quality Evaluation

Abstract

AbstractThis work introduces a simple regressive ensemble for evaluating machine translation quality based on a set of novel and established metrics. We evaluate the ensemble using a correlation to expert-based MQM scores of the WMT 2021 Metrics workshop. In both monolingual and zero-shot cross-lingual settings, we show a significant performance improvement over single metrics. In the cross-lingual settings, we also demonstrate that an ensemble approach is well-applicable to unseen languages. Furthermore, we identify a strong reference-free baseline that consistently outperforms the commonly-used BLEU and METEOR measures and significantly improves our ensemble’s performance.

🌉 Interdisciplinary Bridge — Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — regressive ensemble

🐣 Hot Topic Early Bird — zero-shot evaluation

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Michal Štefánik , Vít Novotný , Petr Sojka

Topics

Machine Learning > Core Methods > Regression Natural Language Processing > Applications > Machine Translation Machine Learning > Learning Types > Multi-Task Learning Machine Learning > Learning Types > Ensemble Learning Machine Learning > Core Methods > Ensemble Methods Machine Learning > Learning Types > Evaluation

Keywords

ensemble learning machine translation cross-lingual transfer regression analysis zero-shot evaluation machine translation evaluation quality evaluation cross-lingual evaluation machine translation quality regressive ensemble mqm score correlation

Download PDF

Related papers

Continual Learning in Multilingual NMT via Language-Specific Embeddings 2021

MultiDoc2Dial: Modeling Dialogues Grounded in Multiple Documents 2021

Efficient Multi-Task Auxiliary Learning: Selecting Auxiliary Data by Feature Similarity 2021

Neural Machine Translation with Heterogeneous Topic Knowledge Embeddings 2021

Semantics-Preserved Data Augmentation for Aspect-Based Sentiment Analysis 2021