Are Small Language Models the Silver Bullet to Low-Resource Languages Machine Translation?

Yewei Song; Lujun Li; Cedric Lothritz; Saad Ezzini; Lama Sleem; Niccolo' Gentile; Radu State; Tegawendé F. Bissyandé; Jacques Klein

2026 EACL EACL 2026

Are Small Language Models the Silver Bullet to Low-Resource Languages Machine Translation?

Abstract

AbstractSmall language models (SLMs) offer computationally efficient alternatives to large language models, yet their translation quality for low-resource languages (LRLs) remains severely limited. This work presents the first large-scale evaluation of SLMs across 200 languages, revealing systematic underperformance in LRLs and identifying key sources of linguistic disparity. We show that knowledge distillation from strong teacher models using predominantly monolingual LRL data substantially boosts SLM translation quality—often enabling 2B–3B models to match or surpass systems up to 70B parameters. Our study highlights three core findings: (1) a comprehensive benchmark exposing the limitations of SLMs on 200 languages; (2) evidence that LRL-focused distillation improves translation without inducing catastrophic forgetting, with full-parameter fine-tuning and decoder-only teachers outperforming LoRA and encoder–decoder approaches; and (3) consistent cross-lingual gains demonstrating the scalability and robustness of the method. These results establish an effective, low-cost pathway for improving LRL translation and provide practical guidance for deploying SLMs in truly low-resource settings.

❓ The Questioner

🌉 Interdisciplinary Bridge — Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — parameter-efficient translation

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Yewei Song , Lujun Li , Cedric Lothritz , Saad Ezzini , Lama Sleem , Niccolo' Gentile , Radu State , Tegawendé F. Bissyandé , Jacques Klein

Topics

Machine Learning > Application Areas > Knowledge Distillation Natural Language Processing > Applications > Machine Translation

Keywords

knowledge distillation machine translation cross-lingual transfer low-resource language small language model parameter-efficient translation

Download PDF

Related papers

Investigating Gender Stereotypes in Large Language Models via Social Determinants of Health 2026

A Benchmark for Audio Reasoning Capabilities of Multimodal Large Language Models 2026

InfiGUIAgent: A Multimodal Generalist GUI Agent with Native Reasoning and Reflection 2026

Generative Personality Simulation via Theory-Informed Structured Interview 2026

Word Surprisal Correlates with Sentential Contradiction in LLMs 2026