Generalization Bounds for Noisy Iterative Algorithms Using Properties of Additive Noise Channels

Hao Wang; RUI GAO; Flavio P. Calmon

2023 JMLR JMLR 2023

Generalization Bounds for Noisy Iterative Algorithms Using Properties of Additive Noise Channels

Abstract

Machine learning models trained by different optimization algorithms under different data distributions can exhibit distinct generalization behaviors. In this paper, we analyze the generalization of models trained by noisy iterative algorithms. We derive distribution-dependent generalization bounds by connecting noisy iterative algorithms to additive noise channels found in communication and information theory. Our generalization bounds shed light on several applications, including differentially private stochastic gradient descent (DP-SGD), federated learning, and stochastic gradient Langevin dynamics (SGLD). We demonstrate our bounds through numerical experiments, showing that they can help understand recent empirical observations of the generalization phenomena of neural networks. [abs] [ pdf ][ bib ] © JMLR 2023. (edit, beta)

🧭 Keyword Pioneer — noisy iterative algorithm

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Interdisciplinary, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy, Speech & Audio

Authors

Hao Wang , RUI GAO , Flavio P. Calmon

Topics

Machine Learning > Optimization & Theory > Learning Theory Machine Learning > Optimization & Theory > Stochastic Processes Machine Learning > Optimization & Theory > Theory Machine Learning > Optimization & Theory > Information Theory Machine Learning > Learning Types > Deep Learning

Keywords

information theory federated learning differential privacy generalization bound stochastic gradient langevin dynamics noisy iterative algorithm additive noise channel differentially private stochastic gradient descent

Download PDF

Related papers

Flexible Model Aggregation for Quantile Regression 2023

Efficient Computation of Rankings from Pairwise Comparisons 2023

Efficient Structure-preserving Support Tensor Train Machine 2023

Attacks against Federated Learning Defense Systems and their Mitigation 2023

How Do You Want Your Greedy: Simultaneous or Repeated? 2023