Targeted Multilingual Adaptation for Low-resource Language Families

C. M. Downey; Terra Blevins; Dhwani Serai; Dwija Parikh; Shane Steinert-Threlkeld

2024 EMNLP EMNLP 2024

Targeted Multilingual Adaptation for Low-resource Language Families

Abstract

AbstractMassively multilingual models are known to have limited utility in any one language, and to perform particularly poorly on low-resource languages. By contrast, targeted multinguality has been shown to benefit low-resource languages. To test this approach more rigorously, we systematically study best practices for adapting a pre-trained model to a language family. Focusing on the Uralic family as a test case, we adapt XLM-R under various configurations to model 15 languages; we then evaluate the performance of each experimental setting on two downstream tasks and 11 evaluation languages. Our adapted models significantly outperform mono- and multilingual baselines. A regression analysis reveals that adapted vocabulary size is relatively unimportant for low-resource languages, and that low-resource languages can be aggressively up-sampled during training at little detriment to performance in high-resource languages. These results introduce new best practices for performing language adaptation in a targeted setting.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — targeted multilingual adaptation

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy, Speech & Audio

Authors

C. M. Downey , Terra Blevins , Dhwani Serai , Dwija Parikh , Shane Steinert-Threlkeld

Topics

Natural Language Processing > Resources & Methods > Multilingual NLP Machine Learning > Learning Paradigms > Transfer Learning Artificial Intelligence > Core AI > Language Deep Learning > Learning Types > Transfer Learning

Keywords

low-resource language multilingual model language family vocabulary adaptation crosslingual transfer uralic language targeted multilingual adaptation language family adaptation

Download PDF

Related papers

EmbodiedBERT: Cognitively Informed Metaphor Detection Incorporating Sensorimotor Information 2024

Mitigating Matthew Effect: Multi-Hypergraph Boosted Multi-Interest Self-Supervised Learning for Conversational Recommendation 2024

Learning to Extract Structured Entities Using Language Models 2024

Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis 2024

CSSL: Contrastive Self-Supervised Learning for Dependency Parsing on Relatively Free Word Ordered and Morphologically Rich Low Resource Languages 2024