Convergence of Langevin MCMC in KL-divergence

Xiang Cheng; Peter Bartlett

2018 ALT ALT 2018

Convergence of Langevin MCMC in KL-divergence

Abstract

Langevin diffusion is a commonly used tool for sampling from a given distribution. In this work, we establish that when the target density $\p^*$ is such that $\log \p^*$ is $L$ smooth and $m$ strongly convex, discrete Langevin diffusion produces a distribution $\p$ with $\KL{\p}{\p^*}≤ε$ in $\tilde{O}(\frac{d}{ε})$ steps, where $d$ is the dimension of the sample space. We also study the convergence rate when the strong-convexity assumption is absent. By considering the Langevin diffusion as a gradient flow in the space of probability distributions, we obtain an elegant analysis that applies to the stronger property of convergence in KL-divergence and gives a conceptually simpler proof of the best-known convergence results in weaker metrics.

🐣 Hot Topic Early Bird — kl divergence

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Xiang Cheng , Peter Bartlett

Topics

Machine Learning > Optimization & Theory > Bayesian Inference Machine Learning > Optimization & Theory > Stochastic Processes

Keywords

kl divergence markov chain monte carlo strong convexity gradient flow sampling algorithm langevin diffusion

Download PDF

Related papers

Dimension-free Information Concentration via Exp-Concavity 2018

Multi-task {K}ernel {L}earning Based on {P}robabilistic {L}ipschitzness 2018

An Adaptive Strategy for Active Learning with Smooth Decision Boundary 2018

Corrupt Bandits for Preserving Local Privacy 2018

Online Learning of Combinatorial Objects via Extended Formulation 2018