Knowledge Distillation for Bilingual Dictionary Induction

Ndapandula Nakashole; Raphael Flauger

2017 EMNLP EMNLP 2017

Knowledge Distillation for Bilingual Dictionary Induction

Abstract

AbstractLeveraging zero-shot learning to learn mapping functions between vector spaces of different languages is a promising approach to bilingual dictionary induction. However, methods using this approach have not yet achieved high accuracy on the task. In this paper, we propose a bridging approach, where our main contribution is a knowledge distillation training objective. As teachers, rich resource translation paths are exploited in this role. And as learners, translation paths involving low resource languages learn from the teachers. Our training objective allows seamless addition of teacher translation paths for any given low resource pair. Since our approach relies on the quality of monolingual word embeddings, we also propose to enhance vector representations of both the source and target language with linguistic information. Our experiments on various languages show large performance gains from our distillation training objective, obtaining as high as 17% accuracy improvements.

🌱 Topic Pioneer — Knowledge Distillation

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Machine Learning

📈 Trend Setter — Zero-Shot Learning

🧭 Keyword Pioneer — bilingual dictionary induction

🐣 Hot Topic Early Bird — zero-shot learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Ndapandula Nakashole , Raphael Flauger

Topics

Machine Learning > Core Methods > Embedding Learning Machine Learning > Learning Types > Zero-Shot Learning Machine Learning > Application Areas > Knowledge Distillation Artificial Intelligence > Learning Paradigms > Zero-Shot Learning Machine Learning > Learning Types > Transfer Learning Machine Learning > Learning Types > Knowledge Distillation Deep Learning > Techniques > Knowledge Distillation

Keywords

zero-shot learning knowledge distillation cross-lingual transfer low-resource language word embedding cross-lingual mapping bilingual dictionary induction monolingual word embedding

Download PDF

Related papers

Reinforced Video Captioning with Entailment Rewards 2017

Cross-lingual Character-Level Neural Morphological Tagging 2017

Inter-Weighted Alignment Network for Sentence Pair Modeling 2017

Investigating Different Syntactic Context Types and Context Representations for Learning Word Embeddings 2017

An Empirical Analysis of Edit Importance between Document Versions 2017