← Optimization & Theory

Deep Learning › Optimization & Theory ›

Theory

1072 directly classified papers

Papers per year

Papers

Overparametrized Multi-layer Neural Networks: Uniform Concentration of Neural Tangent Kernel and Convergence of Stochastic Gradient Descent JMLR 2024

The good, the bad and the ugly sides of data augmentation: An implicit spectral regularization perspective JMLR 2024

Sketchy Moment Matching: Toward Fast and Provable Data Selection for Finetuning NIPS 2024

Multiple Descent in the Multiple Random Feature Model JMLR 2024

On the Effect of Initialization: The Scaling Path of 2-Layer Neural Networks JMLR 2024

The Challenges of the Nonlinear Regime for Physics-Informed Neural Networks NIPS 2024

Model Collapse Demystified: The Case of Regression NIPS 2024

On the Sparsity of the Strong Lottery Ticket Hypothesis NIPS 2024

Benign overfitting in leaky ReLU networks with moderate input dimension NIPS 2024

Improved Sample Complexity Bounds for Diffusion Model Training NIPS 2024

Simplicity Bias in Overparameterized Machine Learning AAAI 2024

Universal In-Context Approximation By Prompting Fully Recurrent Models NIPS 2024

Topological obstruction to the training of shallow ReLU neural networks NIPS 2024

The Expressive Capacity of State Space Models: A Formal Language Perspective NIPS 2024

Graph Neural Networks Do Not Always Oversmooth NIPS 2024

Understanding the Expressive Power and Mechanisms of Transformer for Sequence Modeling NIPS 2024

Curvature Clues: Decoding Deep Learning Privacy with Input Loss Curvature NIPS 2024

The Empirical Impact of Neural Parameter Symmetries, or Lack Thereof NIPS 2024

Evaluating the design space of diffusion-based generative models NIPS 2024

Global Convergence in Training Large-Scale Transformers NIPS 2024

Understanding Scaling Laws with Statistical and Approximation Theory for Transformer Neural Networks on Intrinsically Low-dimensional Data NIPS 2024

Lyapunov-Stable Deep Equilibrium Models AAAI 2024

Globally Convergent Variational Inference NIPS 2024

Separation and Bias of Deep Equilibrium Models on Expressivity and Learning Dynamics NIPS 2024

Gradients of Functions of Large Matrices NIPS 2024