Exploring Context-Aware Evaluation Metrics for Machine Translation

Xinyu Hu; Xunjian Yin; Xiaojun Wan

2023 EMNLP EMNLP 2023

Exploring Context-Aware Evaluation Metrics for Machine Translation

Abstract

AbstractPrevious studies on machine translation evaluation mostly focused on the quality of individual sentences, while overlooking the important role of contextual information. Although WMT Metrics Shared Tasks have introduced context content into the human annotations of translation evaluation since 2019, the relevant metrics and methods still did not take advantage of the corresponding context. In this paper, we propose a context-aware machine translation evaluation metric called Cont-COMET, built upon the effective COMET framework. Our approach simultaneously considers the preceding and subsequent contexts of the sentence to be evaluated and trains our metric to be aligned with the setting during human annotation. We also introduce a content selection method to extract and utilize the most relevant information. The experiments and evaluation of Cont-COMET on the official test framework from WMT show improvements in both system-level and segment-level assessments.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Data Science & Analytics and Deep Learning and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — translation quality assessment

🐣 Hot Topic Early Bird — quality assessment

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy, Speech & Audio

Authors

Xinyu Hu , Xunjian Yin , Xiaojun Wan

Topics

Artificial Intelligence > Core AI > Multimodal Learning Machine Learning > Core Methods > Regression Natural Language Processing > Applications > Machine Translation Data Science & Analytics > Methods > Time Series Natural Language Processing > Applications > Text Generation Machine Learning > Optimization & Theory > Evaluation Machine Learning > Core Methods > Evaluation Machine Learning > Learning Types > Machine Translation Deep Learning > Learning Types > Evaluation

Keywords

machine translation evaluation metric quality assessment translation quality machine translation evaluation content selection context-aware metric translation quality assessment sentence-level evaluation segment-level assessment wmt metrics

Download PDF

Related papers

Exploring Linguistic Probes for Morphological Generalization 2023

NameGuess: Column Name Expansion for Tabular Data 2023

Vision-Enhanced Semantic Entity Recognition in Document Images via Visually-Asymmetric Consistency Learning 2023

Improving Conversational Recommendation Systems via Bias Analysis and Language-Model-Enhanced Data Augmentation 2023

On the Calibration of Large Language Models and Alignment 2023