Narrowing the Gap between Zero- and Few-shot Machine Translation by Matching Styles

Weiting Tan; Haoran Xu; Lingfeng Shen; Shuyue Stella Li; Kenton Murray; Philipp Koehn; Benjamin Van Durme; Yunmo Chen

2024 NAACL NAACL 2024

Narrowing the Gap between Zero- and Few-shot Machine Translation by Matching Styles

Abstract

AbstractLarge language models trained primarily in a monolingual setting have demonstrated their ability to generalize to machine translation using zero- and few-shot examples with in-context learning. However, even though zero-shot translations are relatively good, there remains a discernible gap comparing their performance with the few-shot setting. In this paper, we investigate the factors contributing to this gap and find that this gap can largely be closed (for about 70%) by matching the writing styles of the target corpus. Additionally, we explore potential approaches to enhance zero-shot baselines without the need for parallel demonstration examples, providing valuable insights into how these methods contribute to improving translation metrics.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Natural Language Processing

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Weiting Tan , Haoran Xu , Lingfeng Shen , Shuyue Stella Li , Kenton Murray , Philipp Koehn , Benjamin Van Durme , Yunmo Chen

Topics

Artificial Intelligence > Learning Paradigms > Few-Shot Learning Natural Language Processing > Applications > Machine Translation

Keywords

few-shot learning machine translation in-context learning zero-shot translation style matching

Download PDF

Related papers

Working Alliance Transformer for Psychotherapy Dialogue Classification 2024

Named Entity Recognition Under Domain Shift via Metric Learning for Life Sciences 2024

Assessing Logical Puzzle Solving in Large Language Models: Insights from a Minesweeper Case Study 2024

TelME: Teacher-leading Multimodal Fusion Network for Emotion Recognition in Conversation 2024

Extractive Summarization with Text Generator 2024