Findings of the WMT 2024 Shared Task on Chat Translation

Wafaa Mohammed; Sweta Agrawal; Amin Farajian; Vera Cabarrão; Bryan Eikema; Ana C Farinha; José G. C. de Souza

2024 EMNLP EMNLP 2024

Findings of the WMT 2024 Shared Task on Chat Translation

Abstract

AbstractThis paper presents the findings from the third edition of the Chat Translation Shared Task. As with previous editions, the task involved translating bilingual customer support conversations, specifically focusing on the impact of conversation context in translation quality and evaluation. We also include two new language pairs: English-Korean and English-Dutch, in addition to the set of language pairs from previous editions: English-German, English-French, and English-Brazilian Portuguese.We received 22 primary submissions and 32 contrastive submissions from eight teams, with each language pair having participation from at least three teams. We evaluated the systems comprehensively using both automatic metrics and human judgments via a direct assessment framework.The official rankings for each language pair were determined based on human evaluation scores, considering performance in both translation directions—agent and customer. Our analysis shows that while the systems excelled at translating individual turns, there is room for improvement in overall conversation-level translation quality.

🌉 Interdisciplinary Bridge — Machine Learning and Mathematics & Optimization and Natural Language Processing

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Wafaa Mohammed , Sweta Agrawal , Amin Farajian , Vera Cabarrão , Bryan Eikema , Ana C Farinha , José G. C. de Souza

Topics

Machine Learning > Application Areas > Domain Adaptation Natural Language Processing > Applications > Machine Translation Natural Language Processing > Resources & Methods > Multilingual NLP Mathematics & Optimization > Optimization > Stochastic Methods Natural Language Processing > Generation > Machine Translation Natural Language Processing > Applications > Dialogue Systems

Keywords

machine translation human evaluation dialogue system translation quality context-aware translation conversation context dialogue translation bilingual translation chat translation conversational translation bilingual conversation conversation translation

Download PDF

Related papers

EmbodiedBERT: Cognitively Informed Metaphor Detection Incorporating Sensorimotor Information 2024

Mitigating Matthew Effect: Multi-Hypergraph Boosted Multi-Interest Self-Supervised Learning for Conversational Recommendation 2024

Learning to Extract Structured Entities Using Language Models 2024

Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis 2024

CSSL: Contrastive Self-Supervised Learning for Dependency Parsing on Relatively Free Word Ordered and Morphologically Rich Low Resource Languages 2024