2007
NIPS
NeurIPS 2007
Supervised Topic Models
Abstract
We introduce supervised latent Dirichlet allocation (sLDA), a statistical model of labelled documents. The model accommodates a variety of response types. We derive a maximum-likelihood procedure for parameter estimation, which relies on variational approximations to handle intractable posterior expectations. Prediction problems motivate this research: we use the fitted model to predict response values for new documents. We test sLDA on two real-world problems: movie ratings predicted from reviews, and web page popularity predicted from text descriptions. We illustrate the benefits of sLDA versus modern regularized regression, as well as versus an unsupervised LDA analysis followed by a separate regression.
🌉
Interdisciplinary Bridge
— Machine Learning and Natural Language Processing
📈
Trend Setter
— Language Modeling
🧭
Keyword Pioneer
— supervised topic models
🐣
Hot Topic Early Bird
— variational inference
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Speech & Audio
🌱
Topic Pioneer
— Fine-Tuning
Authors
Topics
Machine Learning > Core Methods > Classification
Machine Learning > Core Methods > Regression
Machine Learning > Core Methods > Representation Learning
Natural Language Processing > Generation > Language Modeling
Natural Language Processing > Applications > Text Classification
Natural Language Processing > Resources & Methods > Text Representation
Machine Learning > Bayesian & Probabilistic > Probabilistic Modeling
Machine Learning > Learning Types > Supervised Learning
Artificial Intelligence > Core AI > Large Language Models
Machine Learning > Learning Types > Fine-Tuning
Machine Learning > Core Methods > Topic Modeling