2017
NIPS
NeurIPS 2017
Conic Scan-and-Cover algorithms for nonparametric topic modeling
Abstract
We propose new algorithms for topic modeling when the number of topics is unknown. Our approach relies on an analysis of the concentration of mass and angular geometry of the topic simplex, a convex polytope constructed by taking the convex hull of vertices representing the latent topics. Our algorithms are shown in practice to have accuracy comparable to a Gibbs sampler in terms of topic estimation, which requires the number of topics be given. Moreover, they are one of the fastest among several state of the art parametric techniques. Statistical consistency of our estimator is established under some conditions.
🌉
Interdisciplinary Bridge
— Machine Learning and Mathematics & Optimization and Natural Language Processing
📈
Trend Setter
— Topic Modeling
🧭
Keyword Pioneer
— nonparametric topic modeling
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy, Speech & Audio
Authors
Topics
Machine Learning > Core Methods > Clustering
Machine Learning > Learning Types > Unsupervised Learning
Machine Learning > Optimization & Theory > Statistical Learning
Mathematics & Optimization > Optimization > Combinatorial Optimization
Machine Learning > Core Methods > Topic Modeling
Natural Language Processing > Applications > Topic Modeling