Near-Optimality of Contrastive Divergence Algorithms

Pierre Glaser; Kevin Han Huang; Arthur Gretton

2024 NIPS NeurIPS 2024

Near-Optimality of Contrastive Divergence Algorithms

Abstract

We provide a non-asymptotic analysis of the contrastive divergence (CD) algorithm, a training method for unnormalized models. While prior work has established that (for exponential family distributions) the CD iterates asymptotically converge at an $O(n^{-1 / 3})$ rate to the true parameter of the data distribution, we show that CD can achieve the parametric rate $O(n^{-1 / 2})$. Our analysis provides results for various data batching schemes, including fully online and minibatch. We additionally show that CD is near-optimal, in the sense that its asymptotic variance is close to the Cramér-Rao lower bound.

🧭 Keyword Pioneer — parametric rate

🐝 Cross-Pollinator — Artificial Intelligence, Deep Learning, Machine Learning, Mathematics & Optimization

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning and Mathematics & Optimization

Authors

Pierre Glaser , Kevin Han Huang , Arthur Gretton

Topics

Machine Learning > Learning Types > Unsupervised Learning Machine Learning > Optimization & Theory > Optimization Machine Learning > Optimization & Theory > Statistical Learning Machine Learning > Bayesian & Probabilistic > Probabilistic Modeling Mathematics & Optimization > Statistics Deep Learning > Learning Types > Generative Models

Keywords

parameter estimation asymptotic analysis exponential family cramér-rao bound asymptotic variance contrastive divergence generative model unnormalized model parametric rate

Download PDF

Related papers

SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers 2024

Training for Stable Explanation for Free 2024

NeuralSolver: Learning Algorithms For Consistent and Efficient Extrapolation Across General Tasks 2024

Expectation Alignment: Handling Reward Misspecification in the Presence of Expectation Mismatch 2024

MicroAdam: Accurate Adaptive Optimization with Low Space Overhead and Provable Convergence 2024