Wide stochastic networks: Gaussian limit and PAC-Bayesian training

Eugenio Clerico; George Deligiannidis; Arnaud Doucet

2023 ALT ALT 2023

Wide stochastic networks: Gaussian limit and PAC-Bayesian training

Abstract

The limit of infinite width allows for substantial simplifications in the analytical study of over- parameterised neural networks. With a suitable random initialisation, an extremely large network exhibits an approximately Gaussian behaviour. In the present work, we establish a similar result for a simple stochastic architecture whose parameters are random variables, holding both before and during training. The explicit evaluation of the output distribution allows for a PAC-Bayesian training procedure that directly optimises the generalisation bound. For a large but finite-width network, we show empirically on MNIST that this training approach can outperform standard PAC- Bayesian methods.

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning

🧭 Keyword Pioneer — pac-bayesian training

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Eugenio Clerico , George Deligiannidis , Arnaud Doucet

Topics

Machine Learning > Optimization & Theory > Bayesian Inference Deep Learning > Architectures > Neural Networks

Keywords

gaussian process infinite width neural network pac-bayesian training

Download PDF

Related papers

Perceptronic Complexity and Online Matrix Completion 2023

Tournaments, Johnson Graphs and NC-Teaching 2023

On the complexity of finding stationary points of smooth functions in one dimension 2023

Algorithmic Learning Theory 2023: Preface 2023

Adversarially Robust Learning with Tolerance 2023