2019
ACL
ACL 2019
Treat the Word As a Whole or Look Inside? Subword Embeddings Model Language Change and Typology
Abstract
AbstractWe use a variant of word embedding model that incorporates subword information to characterize the degree of compositionality in lexical semantics. Our models reveal some interesting yet contrastive patterns of long-term change in multiple languages: Indo-European languages put more weight on subword units in newer words, while conversely Chinese puts less weights on the subwords, but more weight on the word as a whole. Our method provides novel evidence and methodology that enriches existing theories in evolutionary linguistics. The resulting word vectors also has decent performance in NLP-related tasks.
❓
The Questioner
🌉
Interdisciplinary Bridge
— Deep Learning and Interdisciplinary and Natural Language Processing
🧭
Keyword Pioneer
— evolutionary linguistics
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Speech & Audio
Authors
Topics
Natural Language Processing > Resources & Methods > Text Representation
Interdisciplinary > Linguistics
Interdisciplinary > Linguistics > Computational Linguistics
Interdisciplinary > Linguistics > Morphology
Interdisciplinary > Linguistics > Semantics
Deep Learning > Learning Types > Representation Learning