Last-Iterate Convergence Rates for Min-Max Optimization: Convergence of Hamiltonian Gradient Descent and Consensus Optimization

Jacob Abernethy; Kevin A. Lai; Andre Wibisono

2021 ALT ALT 2021

Last-Iterate Convergence Rates for Min-Max Optimization: Convergence of Hamiltonian Gradient Descent and Consensus Optimization

Abstract

While classic work in convex-concave min-max optimization relies on average-iterate convergence results, the emergence of nonconvex applications such as training Generative Adversarial Networks has led to renewed interest in last-iterate convergence guarantees. Proving last-iterate convergence is challenging because many natural algorithms, such as Simultaneous Gradient Descent/Ascent, provably diverge or cycle even in simple convex-concave min-max settings, and there are relatively few papers that prove global last-iterate convergence rates beyond the bilinear and convex-strongly concave settings. In this work, we show that the Hamiltonian Gradient Descent (HGD) algorithm achieves linear convergence in a variety of more general settings, including convex-concave problems that satisfy a "sufficiently bilinear" condition. We also prove convergence rates for stochastic HGD and for some parameter settings of the Consensus Optimization algorithm of Mescheder et al. (2017).

🌉 Interdisciplinary Bridge — Machine Learning and Mathematics & Optimization

🧭 Keyword Pioneer — hamiltonian gradient descent

🐣 Hot Topic Early Bird — min-max optimization

🐝 Cross-Pollinator — Artificial Intelligence, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Machine Learning, Mathematics & Optimization, Reinforcement Learning, Robotics, Speech & Audio

Authors

Jacob Abernethy , Kevin A. Lai , Andre Wibisono

Topics

Machine Learning > Optimization & Theory > Optimization Mathematics & Optimization > Optimization > Game Theory

Keywords

consensus optimization last-iterate convergence min-max optimization convex-concave game hamiltonian gradient descent

Download PDF

Related papers

Statistical guarantees for generative models without domination 2021

Stochastic Dueling Bandits with Adversarial Corruption 2021

Asymptotically Optimal Strategies For Combinatorial Semi-Bandits in Polynomial Time 2021

Efficient sampling from the Bingham distribution 2021

Attribute-Efficient Learning of Halfspaces with Malicious Noise: Near-Optimal Label Complexity and Noise Tolerance 2021