2020
EMNLP
EMNLP 2020
Itβs not Greek to mBERT: Inducing Word-Level Translations from Multilingual BERT
Abstract
AbstractRecent works have demonstrated that multilingual BERT (mBERT) learns rich cross-lingual representations, that allow for transfer across languages. We study the word-level translation information embedded in mBERT and present two simple methods that expose remarkable translation capabilities with no fine-tuning. The results suggest that most of this information is encoded in a non-linear way, while some of it can also be recovered with purely linear tools. As part of our analysis, we test the hypothesis that mBERT learns representations which contain both a language-encoding component and an abstract, cross-lingual component, and explicitly identify an empirical language-identity subspace within mBERT representations.
π
Interdisciplinary Bridge
β Artificial Intelligence and Deep Learning and Machine Learning and Natural Language Processing
π§
Keyword Pioneer
β language identity
π
Cross-Pollinator
β Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy, Speech & Audio
Authors
Topics
Artificial Intelligence > Core AI > Multimodal Learning
Machine Learning > Core Methods > Embedding Learning
Natural Language Processing > Resources & Methods > Multilingual NLP
Deep Learning > Learning Types > Transfer Learning
Deep Learning > Models > Language Models
Artificial Intelligence > Core AI > Multi-Lingual Learning