2020 EMNLP EMNLP 2020

The TALP-UPC System Description for WMT20 News Translation Task: Multilingual Adaptation for Low Resource MT

Abstract

AbstractIn this article, we describe the TALP-UPC participation in the WMT20 news translation shared task for Tamil-English. Given the low amount of parallel training data, we resort to adapt the task to a multilingual system to benefit from the positive transfer from high resource languages. We use iterative backtranslation to fine-tune the system and benefit from the monolingual data available. In order to measure the effectivity of such methods, we compare our results to a bilingual baseline system.

🧭 Keyword Pioneer — iterative fine-tuning
🐣 Hot Topic Early Bird — multilingual machine translation
🐝 Cross-Pollinator — Artificial Intelligence, Computer Vision, Deep Learning, Interdisciplinary, Machine Learning, Natural Language Processing, Speech & Audio