Evaluating the Supervised and Zero-shot Performance of Multi-lingual Translation Models

Chris Hokamp; John Glover; Demian Gholipour Ghalandari

2019 ACL ACL 2019

Evaluating the Supervised and Zero-shot Performance of Multi-lingual Translation Models

Abstract

AbstractWe study several methods for full or partial sharing of the decoder parameters of multi-lingual NMT models. Using only the WMT 2019 shared task parallel datasets for training, we evaluate both fully supervised and zero-shot translation performance in 110 unique translation directions. We use additional test sets and re-purpose evaluation methods recently used for unsupervised MT in order to evaluate zero-shot translation performance for language pairs where no gold-standard parallel data is available. To our knowledge, this is the largest evaluation of multi-lingual translation yet conducted in terms of the total size of the training data we use, and in terms of the number of zero-shot translation pairs we evaluate. We conduct an in-depth evaluation of the translation performance of different models, highlighting the trade-offs between methods of sharing decoder parameters. We find that models which have task-specific decoder parameters outperform models where decoder parameters are fully shared across all tasks.

🌉 Interdisciplinary Bridge — Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — multi-lingual translation

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Chris Hokamp , John Glover , Demian Gholipour Ghalandari

Topics

Machine Learning > Core Methods > Embedding Learning Machine Learning > Learning Types > Zero-Shot Learning Natural Language Processing > Generation > Machine Translation Machine Learning > Learning Types > Multi-Modal Learning

Keywords

zero-shot learning multilingual translation neural machine translation parameter sharing parallel datum zero-shot translation decoder parameter multi-lingual translation decoder parameter sharing

Download PDF

Related papers

What do phone embeddings learn about Phonology? 2019

Unsupervised Morphological Segmentation for Low-Resource Polysynthetic Languages 2019

Understanding Undesirable Word Embedding Associations 2019

Inferential Machine Comprehension: Answering Questions by Recursively Deducing the Evidence Chain from Text 2019

Domain Adaptation of Neural Machine Translation by Lexicon Induction 2019