KoBE: Knowledge-Based Machine Translation Evaluation

Zorik Gekhman; Roee Aharoni; Genady Beryozkin; Markus Freitag; Wolfgang Macherey

2020 EMNLP EMNLP 2020

KoBE: Knowledge-Based Machine Translation Evaluation

Abstract

AbstractWe propose a simple and effective method for machine translation evaluation which does not require reference translations. Our approach is based on (1) grounding the entity mentions found in each source sentence and candidate translation against a large-scale multilingual knowledge base, and (2) measuring the recall of the grounded entities found in the candidate vs. those found in the source. Our approach achieves the highest correlation with human judgements on 9 out of the 18 language pairs from the WMT19 benchmark for evaluation without references, which is the largest number of wins for a single evaluation method on this task. On 4 language pairs, we also achieve higher correlation with human judgements than BLEU. To foster further research, we release a dataset containing 1.8 million grounded entity mentions across 18 language pairs from the WMT19 metrics track data.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Natural Language Processing

📈 Trend Setter — Knowledge Editing

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Zorik Gekhman , Roee Aharoni , Genady Beryozkin , Markus Freitag , Wolfgang Macherey

Topics

Natural Language Processing > Applications > Machine Translation Artificial Intelligence > Core AI > Knowledge Editing

Keywords

machine translation knowledge base entity extraction

Download PDF

Related papers

Fast semantic parsing with well-typedness guarantees 2020

Detecting Objectifying Language in Online Professor Reviews 2020

Analogous Process Structure Induction for Sub-event Sequence Prediction 2020

Aspect Sentiment Classification with Aspect-Specific Opinion Spans 2020

Robust and Interpretable Grounding of Spatial References with Relation Networks 2020