Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Optimization & Theory
Deep Learning
›
Optimization & Theory
›
Theory
1072 directly classified papers
Papers per year
2007: 1
2010: 4
2011: 1
2012: 3
2013: 4
2014: 5
2015: 2
2016: 11
2017: 31
2018: 47
2019: 67
2020: 97
2021: 128
2022: 225
2023: 155
2024: 209
2025: 81
2026: 1
Papers
UMB: Understanding Model Behavior for Open-World Object Detection
NIPS 2024
Provable Robustness against a Union of L_0 Adversarial Attacks
AAAI 2024
Deep linear networks for regression are implicitly regularized towards flat minima
NIPS 2024
Parameter Symmetry and Noise Equilibrium of Stochastic Gradient Descent
NIPS 2024
Learnability of high-dimensional targets by two-parameter models and gradient flow
NIPS 2024
SVARM-IQ: Efficient Approximation of Any-order Shapley Interactions through Stratification
AISTATS 2024
Error Correction Output Codes for Robust Neural Networks against Weight-errors: A Neural Tangent Kernel Point of View
NIPS 2024
Stable Minima Cannot Overfit in Univariate ReLU Networks: Generalization by Large Step Sizes
NIPS 2024
Backward Lens: Projecting Language Model Gradients into the Vocabulary Space
EMNLP 2024
One-Layer Transformer Provably Learns One-Nearest Neighbor In Context
NIPS 2024
Scaling Laws Across Model Architectures: A Comparative Analysis of Dense and MoE Models in Large Language Models
EMNLP 2024
Towards Efficient Verification of Quantized Neural Networks
AAAI 2024
Towards Interpretable Sequence Continuation: Analyzing Shared Circuits in Large Language Models
EMNLP 2024
Globally Convergent Variational Inference
NIPS 2024
A provable control of sensitivity of neural networks through a direct parameterization of the overall bi-Lipschitzness
NIPS 2024
Leveraging PAC-Bayes Theory and Gibbs Distributions for Generalization Bounds with Complexity Measures
AISTATS 2024
Base of RoPE Bounds Context Length
NIPS 2024
Spectrum Extraction and Clipping for Implicitly Linear Layers
AISTATS 2024
Unraveling the Gradient Descent Dynamics of Transformers
NIPS 2024
Towards Large Certified Radius in Randomized Smoothing Using Quasiconcave Optimization
AAAI 2024
Implicit Optimization Bias of Next-token Prediction in Linear Models
NIPS 2024
Effect of Ambient-Intrinsic Dimension Gap on Adversarial Vulnerability
AISTATS 2024
Nonlinear dynamics of localization in neural receptive fields
NIPS 2024
Combining Statistical Depth and Fermat Distance for Uncertainty Quantification
NIPS 2024
An exactly solvable model for emergence and scaling laws in the multitask sparse parity problem
NIPS 2024
<
1
…
9
10
11
…
43
>