Challenging the State-of-the-art Machine Translation Metrics from a Linguistic Perspective

Eleftherios Avramidis; Shushen Manakhimova; Vivien Macketanz; Sebastian Möller

2023 EMNLP EMNLP 2023

Challenging the State-of-the-art Machine Translation Metrics from a Linguistic Perspective

Abstract

AbstractWe employ a linguistically motivated challenge set in order to evaluate the state-of-the-art machine translation metrics submitted to the Metrics Shared Task of the 8th Conference for Machine Translation. The challenge set includes about 21,000 items extracted from 155 machine translation systems for three language directions, covering more than 100 linguistically-motivated phenomena organized in 14 categories. The metrics that have the best performance with regard to our linguistically motivated analysis are the Cometoid22-wmt23 (a trained metric based on distillation) for German-English and MetricX-23-c (based on a fine-tuned mT5 encoder-decoder language model) for English-German and English-Russian. Some of the most difficult phenomena are passive voice for German-English, named entities, terminology and measurement units for English-German, and focus particles, adverbial clause and stripping for English-Russian.

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy, Speech & Audio

Authors

Eleftherios Avramidis , Shushen Manakhimova , Vivien Macketanz , Sebastian Möller

Topics

Natural Language Processing > Applications > Information Retrieval Natural Language Processing > Applications > Machine Translation

Keywords

machine translation evaluation metric linguistic analysis machine translation metric translation error challenge set trained metric

Download PDF

Related papers

Exploring Linguistic Probes for Morphological Generalization 2023

NameGuess: Column Name Expansion for Tabular Data 2023

Vision-Enhanced Semantic Entity Recognition in Document Images via Visually-Asymmetric Consistency Learning 2023

Improving Conversational Recommendation Systems via Bias Analysis and Language-Model-Enhanced Data Augmentation 2023

On the Calibration of Large Language Models and Alignment 2023