Averaging on the Bures-Wasserstein manifold: dimension-free convergence of gradient descent

Jason Altschuler; Sinho Chewi; Patrik R Gerber; Austin Stromme

2021 NIPS NeurIPS 2021

Averaging on the Bures-Wasserstein manifold: dimension-free convergence of gradient descent

Abstract

We study first-order optimization algorithms for computing the barycenter of Gaussian distributions with respect to the optimal transport metric. Although the objective is geodesically non-convex, Riemannian gradient descent empirically converges rapidly, in fact faster than off-the-shelf methods such as Euclidean gradient descent and SDP solvers. This stands in stark contrast to the best-known theoretical results, which depend exponentially on the dimension. In this work, we prove new geodesic convexity results which provide stronger control of the iterates, yielding a dimension-free convergence rate. Our techniques also enable the analysis of two related notions of averaging, the entropically-regularized barycenter and the geometric median, providing the first convergence guarantees for these problems.

🐣 Hot Topic Early Bird — gaussian distribution

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Jason Altschuler , Sinho Chewi , Patrik R Gerber , Austin Stromme

Topics

Mathematics & Optimization > Mathematics > Probability Mathematics & Optimization > Optimization > Continuous Optimization

Keywords

riemannian optimization optimal transport gradient descent gaussian distribution geodesic convexity

Download PDF

Related papers

Mosaicking to Distill: Knowledge Distillation from Out-of-Domain Data 2021

On Model Calibration for Long-Tailed Object Detection and Instance Segmentation 2021

Test-Time Personalization with a Transformer for Human Pose Estimation 2021

NTopo: Mesh-free Topology Optimization using Implicit Neural Representations 2021

Scalable Intervention Target Estimation in Linear Models 2021