Detecting Post-Edited References and Their Effect on Human Evaluation

Vĕra Kloudová; Ondřej Bojar; Martin Popel

2021 EACL EACL 2021

Detecting Post-Edited References and Their Effect on Human Evaluation

Abstract

AbstractThis paper provides a quick overview of possible methods how to detect that reference translations were actually created by post-editing an MT system. Two methods based on automatic metrics are presented: BLEU difference between the suspected MT and some other good MT and BLEU difference using additional references. These two methods revealed a suspicion that the WMT 2020 Czech reference is based on MT. The suspicion was confirmed in a manual analysis by finding concrete proofs of the post-editing procedure in particular sentences. Finally, a typology of post-editing changes is presented where typical errors or changes made by the post-editor or errors adopted from the MT are classified.

🌉 Interdisciplinary Bridge — Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — post-editing detection

🐣 Hot Topic Early Bird — translation evaluation

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Vĕra Kloudová , Ondřej Bojar , Martin Popel

Topics

Natural Language Processing > Applications > Machine Translation Natural Language Processing > Generation > Machine Translation Machine Learning > Learning Types > Evaluation

Keywords

machine translation bleu score human evaluation translation evaluation post-editing detection bleu metric

Download PDF

Related papers

Joint Coreference Resolution and Character Linking for Multiparty Conversation 2021

Progressively Pretrained Dense Corpus Index for Open-Domain Question Answering 2021

Crisscrossed Captions: Extended Intramodal and Intermodal Semantic Similarity Judgments for MS-COCO 2021

Representations for Question Answering from Documents with Tables and Text 2021

Gender and Racial Fairness in Depression Research using Social Media 2021