2021
EMNLP
EMNLP 2021
Regressive Ensemble for Machine Translation Quality Evaluation
Abstract
AbstractThis work introduces a simple regressive ensemble for evaluating machine translation quality based on a set of novel and established metrics. We evaluate the ensemble using a correlation to expert-based MQM scores of the WMT 2021 Metrics workshop. In both monolingual and zero-shot cross-lingual settings, we show a significant performance improvement over single metrics. In the cross-lingual settings, we also demonstrate that an ensemble approach is well-applicable to unseen languages. Furthermore, we identify a strong reference-free baseline that consistently outperforms the commonly-used BLEU and METEOR measures and significantly improves our ensemble’s performance.
🌉
Interdisciplinary Bridge
— Machine Learning and Natural Language Processing
🧭
Keyword Pioneer
— regressive ensemble
🐣
Hot Topic Early Bird
— zero-shot evaluation
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio
Authors
Topics
Machine Learning > Core Methods > Regression
Natural Language Processing > Applications > Machine Translation
Machine Learning > Learning Types > Multi-Task Learning
Machine Learning > Learning Types > Ensemble Learning
Machine Learning > Core Methods > Ensemble Methods
Machine Learning > Learning Types > Evaluation