2017 EMNLP EMNLP 2017

Knowledge Distillation for Bilingual Dictionary Induction

Abstract

AbstractLeveraging zero-shot learning to learn mapping functions between vector spaces of different languages is a promising approach to bilingual dictionary induction. However, methods using this approach have not yet achieved high accuracy on the task. In this paper, we propose a bridging approach, where our main contribution is a knowledge distillation training objective. As teachers, rich resource translation paths are exploited in this role. And as learners, translation paths involving low resource languages learn from the teachers. Our training objective allows seamless addition of teacher translation paths for any given low resource pair. Since our approach relies on the quality of monolingual word embeddings, we also propose to enhance vector representations of both the source and target language with linguistic information. Our experiments on various languages show large performance gains from our distillation training objective, obtaining as high as 17% accuracy improvements.

🌱 Topic Pioneer — Knowledge Distillation
🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Machine Learning
📈 Trend Setter — Zero-Shot Learning
🧭 Keyword Pioneer — bilingual dictionary induction
🐣 Hot Topic Early Bird — zero-shot learning
🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio