Findings of the JUST-NLP 2025 Shared Task on English-to-Hindi Legal Machine Translation

Kshetrimayum Boynao Singh; Sandeep Kumar; Debtanu Datta; Abhinav Joshi; Shivani Mishra; Shounak Paul; Pawan Goyal; Sarika Jain; Saptarshi Ghosh; Ashutosh Modi; Asif Ekbal

2025 IJCNLP IJCNLP 2025

Findings of the JUST-NLP 2025 Shared Task on English-to-Hindi Legal Machine Translation

Abstract

AbstractThis paper provides an overview of the Shared Task on Legal Machine Translation (L-MT), organized as part of the JUST-NLP 2025 Workshop at IJCNLP-AACL 2025, aimed at improving the translation of legal texts, a domain where precision, structural faithfulness, and terminology preservation are essential. The training set comprises 50,000 sentences, with 5,000 sentences each for the validation and test sets. The submissions employed strategies such as: domain-adaptive fine-tuning of multilingual models, QLoRA-based parameter-efficient adaptation, curriculum-guided supervised training, reinforcement learning with verifiable MT metrics, and from-scratch Transformer training. The systems are evaluated based on BLEU, METEOR, TER, chrF++, BERTScore, and COMET metrics. We also combine the scores of these metrics to give an average score (AutoRank). The top-performing system is based on a fine-tuned distilled NLLB-200 model and achieved the highest AutoRank score of 72.1. Domain adaptation consistently yielded substantial improvements over baseline models, and precision-focused rewards proved especially effective for the legal MT. The findings also highlight that large multilingual Transformers can deliver accurate and reliable English-to-Hindi legal translations when carefully fine-tuned on legal data, advancing the broader goal of improving access to justice in multilingual settings.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Kshetrimayum Boynao Singh , Sandeep Kumar , Debtanu Datta , Abhinav Joshi , Shivani Mishra , Shounak Paul , Pawan Goyal , Sarika Jain , Saptarshi Ghosh , Ashutosh Modi , Asif Ekbal

Topics

Artificial Intelligence > Learning Paradigms > Transfer Learning Machine Learning > Application Areas > Domain Adaptation

Keywords

reinforcement learning domain adaptation parameter-efficient adaptation multilingual model legal machine translation

Download PDF

Cold Starts and Hard Cases: A Two-Stage SFT-RLVR Approach for Legal Machine Translation (Just-NLP L-MT shared task) 2025

Rethinking Information Synthesis in Multimodal Question Answering A Multi-Agent Perspective 2025

MELAC: Massive Evaluation of Large Language Models with Alignment of Culture in Persian Language 2025

From Anger to Joy: How Nationality Personas Shape Emotion Attribution in Large Language Models 2025

Findings of the JUST-NLP 2025 Shared Task on English-to-Hindi Legal Machine Translation

Abstract

Authors

Topics

Keywords

Related papers