Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Optimization & Theory
Deep Learning
›
Optimization & Theory
›
Theory
1072 directly classified papers
Papers per year
2007: 1
2010: 4
2011: 1
2012: 3
2013: 4
2014: 5
2015: 2
2016: 11
2017: 31
2018: 47
2019: 67
2020: 97
2021: 128
2022: 225
2023: 155
2024: 209
2025: 81
2026: 1
Papers
The Loss Landscape of Deep Linear Neural Networks: a Second-order Analysis
JMLR 2024
Finsler-Laplace-Beltrami Operators with Application to Shape Analysis
CVPR 2024
The Implicit Bias of Gradient Descent toward Collaboration between Layers: A Dynamic Analysis of Multilayer Perceptions
NIPS 2024
Learnability of high-dimensional targets by two-parameter models and gradient flow
NIPS 2024
Convergence of Message-Passing Graph Neural Networks with Generic Aggregation on Large Random Graphs
JMLR 2024
Scaling Law for Time Series Forecasting
NIPS 2024
Large Stepsize Gradient Descent for Non-Homogeneous Two-Layer Networks: Margin Improvement and Fast Optimization
NIPS 2024
Temporal Graph Neural Tangent Kernel with Graphon-Guaranteed
NIPS 2024
Length independent PAC-Bayes bounds for Simple RNNs
AISTATS 2024
Hardness of Learning Neural Networks under the Manifold Hypothesis
NIPS 2024
Great Minds Think Alike: The Universal Convergence Trend of Input Salience
NIPS 2024
What do Graph Neural Networks learn? Insights from Tropical Geometry
NIPS 2024
Approximation Rate of the Transformer Architecture for Sequence Modeling
NIPS 2024
Globally Convergent Variational Inference
NIPS 2024
SVARM-IQ: Efficient Approximation of Any-order Shapley Interactions through Stratification
AISTATS 2024
The Empirical Impact of Neural Parameter Symmetries, or Lack Thereof
NIPS 2024
Log-concave Sampling from a Convex Body with a Barrier: a Robust and Unified Dikin Walk
NIPS 2024
Separation and Bias of Deep Equilibrium Models on Expressivity and Learning Dynamics
NIPS 2024
Symmetries in Overparametrized Neural Networks: A Mean Field View
NIPS 2024
On the Sparsity of the Strong Lottery Ticket Hypothesis
NIPS 2024
Learning Neural Contracting Dynamics: Extended Linearization and Global Guarantees
NIPS 2024
The Expressive Capacity of State Space Models: A Formal Language Perspective
NIPS 2024
Leveraging PAC-Bayes Theory and Gibbs Distributions for Generalization Bounds with Complexity Measures
AISTATS 2024
Unveiling Induction Heads: Provable Training Dynamics and Feature Learning in Transformers
NIPS 2024
Spectrum Extraction and Clipping for Implicitly Linear Layers
AISTATS 2024
<
1
…
8
9
10
…
43
>