A Bayesian LDA-based model for semi-supervised part-of-speech tagging

Kristina Toutanova; Mark Johnson

2007 NIPS NeurIPS 2007

A Bayesian LDA-based model for semi-supervised part-of-speech tagging

Abstract

We present a novel Bayesian model for semi-supervised part-of-speech tagging. Our model extends the Latent Dirichlet Allocation model and incorporates the intuition that words’ distributions over tags, p(t|w), are sparse. In addition we in- troduce a model for determining the set of possible tags of a word which captures important dependencies in the ambiguity classes of words. Our model outper- forms the best previously proposed model for this task on a standard dataset.

🌱 Topic Pioneer — Part-of-Speech Tagging

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — part-of-speech tagging

🐣 Hot Topic Early Bird — semi-supervised learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Speech & Audio

📈 Trend Setter — Part-of-Speech Tagging

Authors

Kristina Toutanova , Mark Johnson

Topics

Artificial Intelligence > Bayesian & Probabilistic > Bayesian Learning Machine Learning > Learning Types > Semi-Supervised Learning Natural Language Processing > Understanding > Part-of-Speech Tagging Machine Learning > Bayesian & Probabilistic > Probabilistic Modeling Machine Learning > Bayesian & Probabilistic > Bayesian Inference

Keywords

semi-supervised learning probabilistic modeling bayesian learning bayesian inference latent dirichlet allocation part-of-speech tagging sparse distribution topic model bayesian lda

Download PDF

Related papers

Exponential Family Predictive Representations of State 2007

Privacy-Preserving Belief Propagation and Sampling 2007

Efficient Principled Learning of Thin Junction Trees 2007

How SVMs can estimate quantiles and the median 2007

Rapid Inference on a Novel AND/OR graph for Object Detection, Segmentation and Parsing 2007