2024
NIPS
NeurIPS 2024
Mean-Field Analysis for Learning Subspace-Sparse Polynomials with Gaussian Input
Abstract
In this work, we study the mean-field flow for learning subspace-sparse polynomials using stochastic gradient descent and two-layer neural networks, where the input distribution is standard Gaussian and the output only depends on the projection of the input onto a low-dimensional subspace. We establish a necessary condition for SGD-learnability, involving both the characteristics of the target function and the expressiveness of the activation function. In addition, we prove that the condition is almost sufficient, in the sense that a condition slightly stronger than the necessary condition can guarantee the exponential decay of the loss functional to zero.
🌉
Interdisciplinary Bridge
— Deep Learning and Machine Learning and Mathematics & Optimization
🧭
Keyword Pioneer
— subspace-sparse polynomial
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio
Authors
Topics
Machine Learning > Optimization & Theory > Learning Theory
Machine Learning > Optimization & Theory > Stochastic Processes
Machine Learning > Optimization & Theory > Theory
Deep Learning > Architectures > Neural Networks
Mathematics & Optimization > Mathematics > Probability
Deep Learning > Optimization & Theory > Theory