2007 NIPS NeurIPS 2007

A Bayesian LDA-based model for semi-supervised part-of-speech tagging

Abstract

We present a novel Bayesian model for semi-supervised part-of-speech tagging. Our model extends the Latent Dirichlet Allocation model and incorporates the intuition that words’ distributions over tags, p(t|w), are sparse. In addition we in- troduce a model for determining the set of possible tags of a word which captures important dependencies in the ambiguity classes of words. Our model outper- forms the best previously proposed model for this task on a standard dataset.

🌱 Topic Pioneer β€” Part-of-Speech Tagging
πŸŒ‰ Interdisciplinary Bridge β€” Artificial Intelligence and Machine Learning and Natural Language Processing
🧭 Keyword Pioneer β€” part-of-speech tagging
🐣 Hot Topic Early Bird β€” semi-supervised learning
🐝 Cross-Pollinator β€” Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Speech & Audio
πŸ“ˆ Trend Setter β€” Part-of-Speech Tagging