Minimizing Convex Functionals over Space of Probability Measures via KL Divergence Gradient Flow

Rentian Yao; Linjun Huang; Yun Yang

2024 AISTATS AISTATS 2024

Minimizing Convex Functionals over Space of Probability Measures via KL Divergence Gradient Flow

Abstract

Motivated by the computation of the non-parametric maximum likelihood estimator (NPMLE) and the Bayesian posterior in statistics, this paper explores the problem of convex optimization over the space of all probability distributions. We introduce an implicit scheme, called the implicit KL proximal descent (IKLPD) algorithm, for discretizing a continuous-time gradient flow relative to the Kullback–Leibler (KL) divergence for minimizing a convex target functional. We show that IKLPD converges to a global optimum at a polynomial rate from any initialization; moreover, if the objective functional is strongly convex relative to the KL divergence, for example, when the target functional itself is a KL divergence as in the context of Bayesian posterior computation, IKLPD exhibits globally exponential convergence. Computationally, we propose a numerical method based on normalizing flow to realize IKLPD. Conversely, our numerical method can also be viewed as a new approach that sequentially trains a normalizing flow for minimizing a convex functional with a strong theoretical guarantee.

🧭 Keyword Pioneer — nonparametric maximum likelihood

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

🌉 Interdisciplinary Bridge — Machine Learning and Mathematics & Optimization

Authors

Rentian Yao , Linjun Huang , Yun Yang

Topics

Mathematics & Optimization > Mathematics > Probability Mathematics & Optimization > Mathematics > Statistics Mathematics & Optimization > Optimization > Continuous Optimization Machine Learning > Bayesian & Probabilistic > Bayesian Inference Mathematics & Optimization > Optimization > Convex Optimization Machine Learning > Bayesian & Probabilistic > Variational Inference

Keywords

variational inference convex optimization bayesian inference kl divergence probability measure normalizing flow gradient flow nonparametric maximum likelihood

Download PDF

Related papers

Causal Bandits with General Causal Models and Interventions 2024

Boundary-Aware Uncertainty for Feature Attribution Explainers 2024

Better Representations via Adversarial Training in Pre-Training: A Theoretical Perspective 2024

A Primal-Dual-Critic Algorithm for Offline Constrained Reinforcement Learning 2024

Pure Exploration in Bandits with Linear Constraints 2024