Prior-aware Composition Inference for Spectral Topic Models

Moontae Lee; David Bindel; David Mimno

2020 AISTATS AISTATS 2020

Prior-aware Composition Inference for Spectral Topic Models

Abstract

Spectral algorithms operate on matrices or tensors of word co-occurrence to learn latent topics. These approaches remove the dependence on the original documents and produce substantial gains in efficiency with provable inference, but at a cost: the models can no longer infer any information about individual documents. Thresholded Linear Inverse is developed to learn document-specific topic compositions, but its linear characteristics limit the inference quality without considering any prior information on topic distributions. We propose two novel estimation methods that respect previously unclear prior structures of spectral topic models. Experiments on a variety of synthetic to real collections demonstrate that our Prior-Aware Dual Decomposition outperforms the baseline method, whereas our Prior-Aware Manifold Iteration performs even better on short realistic data.

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning

📈 Trend Setter — Representation Learning

🧭 Keyword Pioneer — latent topic discovery

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy, Speech & Audio

Authors

Moontae Lee , David Bindel , David Mimno

Topics

Machine Learning > Core Methods > Clustering Machine Learning > Core Methods > Representation Learning Machine Learning > Optimization & Theory > Bayesian Inference Machine Learning > Core Methods > Probabilistic Modeling Machine Learning > Learning Types > Representation Learning Machine Learning > Optimization & Theory > Representation Learning Deep Learning > Techniques > Representation Learning Machine Learning > Learning Paradigms > Representation Learning

Keywords

spectral algorithm topic model dual decomposition latent topic discovery latent topic spectral topic model prior-aware inference manifold iteration document-specific inference

Download PDF

Related papers

Stretching the Effectiveness of MLE from Accuracy to Bias for Pairwise Comparisons 2020

Fast and Accurate Ranking Regression 2020

Nonparametric Sequential Prediction While Deep Learning the Kernel 2020

Nested-Wasserstein Self-Imitation Learning for Sequence Generation 2020

Unconditional Coresets for Regularized Loss Minimization 2020