Document-level Translation with LLM Reranking: Team-J at WMT 2024 General Translation Task

Keito Kudo; Hiroyuki Deguchi; Makoto Morishita; Ryo Fujii; Takumi Ito; Shintaro Ozaki; Koki Natsumi; Kai Sato; Kazuki Yano; Ryosuke Takahashi; Subaru Kimura; Tomomasa Hara; Yusuke Sakai; Jun Suzuki

2024 EMNLP EMNLP 2024

Document-level Translation with LLM Reranking: Team-J at WMT 2024 General Translation Task

Abstract

AbstractWe participated in the constrained track for English-Japanese and Japanese-Chinese translations at the WMT 2024 General Machine Translation Task. Our approach was to generate a large number of sentence-level translation candidates and select the most probable translation using minimum Bayes risk (MBR) decoding and document-level large language model (LLM) re-ranking. We first generated hundreds of translation candidates from multiple translation models and retained the top 30 candidates using MBR decoding. In addition, we continually pre-trained LLMs on the target language corpora to leverage document-level information. We utilized LLMs to select the most probable sentence sequentially in context from the beginning of the document.

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Keito Kudo , Hiroyuki Deguchi , Makoto Morishita , Ryo Fujii , Takumi Ito , Shintaro Ozaki , Koki Natsumi , Kai Sato , Kazuki Yano , Ryosuke Takahashi , Subaru Kimura , Tomomasa Hara , Yusuke Sakai , Jun Suzuki

Topics

Natural Language Processing > Applications > Machine Translation

Keywords

document-level translation sentence-level translation large language model translation reranking minimum bayesian risk

Download PDF

Related papers

EmbodiedBERT: Cognitively Informed Metaphor Detection Incorporating Sensorimotor Information 2024

Mitigating Matthew Effect: Multi-Hypergraph Boosted Multi-Interest Self-Supervised Learning for Conversational Recommendation 2024

Learning to Extract Structured Entities Using Language Models 2024

Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis 2024

CSSL: Contrastive Self-Supervised Learning for Dependency Parsing on Relatively Free Word Ordered and Morphologically Rich Low Resource Languages 2024