2016
COLING
COLING 2016
Robust Text Classification for Sparsely Labelled Data Using Multi-level Embeddings
Abstract
AbstractThe conventional solution for handling sparsely labelled data is extensive feature engineering. This is time consuming and task and domain specific. We present a novel approach for learning embedded features that aims to alleviate this problem. Our approach jointly learns embeddings at different levels of granularity (word, sentence and document) along with the class labels. The intuition is that topic semantics represented by embeddings at multiple levels results in better classification. We evaluate this approach in unsupervised and semi-supervised settings on two sparsely labelled classification tasks, outperforming the handcrafted models and several embedding baselines.
🌉
Interdisciplinary Bridge
— Machine Learning and Natural Language Processing
🧭
Keyword Pioneer
— document embedding
🐣
Hot Topic Early Bird
— feature engineering
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio
Authors
Topics
Machine Learning > Learning Types > Semi-Supervised Learning
Machine Learning > Learning Types > Unsupervised Learning
Natural Language Processing > Applications > Text Classification
Machine Learning > Learning Types > Representation Learning
Machine Learning > Learning Paradigms > Semi-Supervised Learning