Penalizing Divergence: Multi-Parallel Translation for Low-Resource Languages of North America

Garrett Nicolai; Changbing Yang; Miikka Silfverberg

2022 COLING COLING 2022

Penalizing Divergence: Multi-Parallel Translation for Low-Resource Languages of North America

Abstract

AbstractThis paper explores a special case in multilingual machine translation: so called multi-parallel translation, where the target data for all language pairs are identical. While multi-parallelism offers benefits which are not available in a standard translation setting, translation models can easily overfit when training data are limited. We introduce a regularizer, the divergence penalty, which penalizes the translation model when it represents source sentences with identical target translations in divergent ways. Experiments on very low-resourced Indigenous North American languages show that an initially deficient multilingual translator can improve by 4.9 BLEU through mBART pre-training, and 5.5 BLEU points with the strategic addition of monolingual data, and that a divergence penalty leads to further increases of 0.4 BLEU. Further experiments on Germanic languages demonstrate a improvement of 0.5 BLEU when applying the divergence penalty. An investigation of the neural encoder representations learned by our translation models shows that the divergence penalty encourages models to learn a unified neural interlingua.

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning and Mathematics & Optimization and Natural Language Processing

🧭 Keyword Pioneer — divergence penalty

🐣 Hot Topic Early Bird — indigenous language

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy, Speech & Audio

Authors

Garrett Nicolai , Changbing Yang , Miikka Silfverberg

Topics

Deep Learning > Architectures > Transformers Machine Learning > Learning Types > Multi-Task Learning Machine Learning > Learning Types > Transfer Learning Mathematics & Optimization > Optimization > Optimization Natural Language Processing > Generation > Machine Translation Machine Learning > Learning Types > Multi-Modal Learning

Keywords

machine translation multilingual translation neural machine translation low-resource language multilingual model indigenous language neural interlingua divergence penalty

Download PDF

Related papers

MulZDG: Multilingual Code-Switching Framework for Zero-shot Dialogue Generation 2022

The Role of Context and Uncertainty in Shallow Discourse Parsing 2022

SelfMix: Robust Learning against Textual Label Noise with Self-Mixup Training 2022

Complicate Then Simplify: A Novel Way to Explore Pre-trained Models for Text Classification 2022

Repo4QA: Answering Coding Questions via Dense Retrieval on GitHub Repositories 2022