2022
NAACL
NAACL 2022
Word Tour: One-dimensional Word Embeddings via the Traveling Salesman Problem
Abstract
AbstractWord embeddings are one of the most fundamental technologies used in natural language processing. Existing word embeddings are high-dimensional and consume considerable computational resources. In this study, we propose WordTour, unsupervised one-dimensional word embeddings. To achieve the challenging goal, we propose a decomposition of the desiderata of word embeddings into two parts, completeness and soundness, and focus on soundness in this paper. Owing to the single dimensionality, WordTour is extremely efficient and provides a minimal means to handle word embeddings. We experimentally confirmed the effectiveness of the proposed method via user study and document classification.
🌉
Interdisciplinary Bridge
— Machine Learning and Mathematics & Optimization and Natural Language Processing
🧭
Keyword Pioneer
— one-dimensional embedding
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio
Authors
Topics
Machine Learning > Core Methods > Embedding Learning
Machine Learning > Learning Types > Unsupervised Learning
Natural Language Processing > Resources & Methods > Text Representation
Mathematics & Optimization > Optimization > Combinatorial Optimization
Natural Language Processing > Resources & Methods > Language Modeling