Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Optimization & Theory
Deep Learning
›
Optimization & Theory
›
Theory
1072 directly classified papers
Papers per year
2007: 1
2010: 4
2011: 1
2012: 3
2013: 4
2014: 5
2015: 2
2016: 11
2017: 31
2018: 47
2019: 67
2020: 97
2021: 128
2022: 225
2023: 155
2024: 209
2025: 81
2026: 1
Papers
Overparametrized Multi-layer Neural Networks: Uniform Concentration of Neural Tangent Kernel and Convergence of Stochastic Gradient Descent
JMLR 2024
The good, the bad and the ugly sides of data augmentation: An implicit spectral regularization perspective
JMLR 2024
Sketchy Moment Matching: Toward Fast and Provable Data Selection for Finetuning
NIPS 2024
Multiple Descent in the Multiple Random Feature Model
JMLR 2024
On the Effect of Initialization: The Scaling Path of 2-Layer Neural Networks
JMLR 2024
The Challenges of the Nonlinear Regime for Physics-Informed Neural Networks
NIPS 2024
Model Collapse Demystified: The Case of Regression
NIPS 2024
On the Sparsity of the Strong Lottery Ticket Hypothesis
NIPS 2024
Benign overfitting in leaky ReLU networks with moderate input dimension
NIPS 2024
Improved Sample Complexity Bounds for Diffusion Model Training
NIPS 2024
Simplicity Bias in Overparameterized Machine Learning
AAAI 2024
Universal In-Context Approximation By Prompting Fully Recurrent Models
NIPS 2024
Topological obstruction to the training of shallow ReLU neural networks
NIPS 2024
The Expressive Capacity of State Space Models: A Formal Language Perspective
NIPS 2024
Graph Neural Networks Do Not Always Oversmooth
NIPS 2024
Understanding the Expressive Power and Mechanisms of Transformer for Sequence Modeling
NIPS 2024
Curvature Clues: Decoding Deep Learning Privacy with Input Loss Curvature
NIPS 2024
The Empirical Impact of Neural Parameter Symmetries, or Lack Thereof
NIPS 2024
Evaluating the design space of diffusion-based generative models
NIPS 2024
Global Convergence in Training Large-Scale Transformers
NIPS 2024
Understanding Scaling Laws with Statistical and Approximation Theory for Transformer Neural Networks on Intrinsically Low-dimensional Data
NIPS 2024
Lyapunov-Stable Deep Equilibrium Models
AAAI 2024
Globally Convergent Variational Inference
NIPS 2024
Separation and Bias of Deep Equilibrium Models on Expressivity and Learning Dynamics
NIPS 2024
Gradients of Functions of Large Matrices
NIPS 2024
<
1
…
5
6
7
…
43
>