Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Optimization & Theory
Deep Learning
›
Optimization & Theory
›
Theory
1072 directly classified papers
Papers per year
2007: 1
2010: 4
2011: 1
2012: 3
2013: 4
2014: 5
2015: 2
2016: 11
2017: 31
2018: 47
2019: 67
2020: 97
2021: 128
2022: 225
2023: 155
2024: 209
2025: 81
2026: 1
Papers
The First Theoretical Approximation Guarantees for the Non-Dominated Sorting Genetic Algorithm III (NSGA-III)
IJCAI 2025
Unraveling the Smoothness Properties of Diffusion Models: A Gaussian Mixture Perspective
ICCV 2025
Activation Subspaces for Out-of-Distribution Detection
ICCV 2025
Error Analysis Affected by Heavy-Tailed Gradients for Non-Convex Pairwise Stochastic Gradient Descent
AAAI 2025
Statistical inference on black-box generative models in the data kernel perspective space
ACL 2025
Verification of Neural Networks Against Convolutional Perturbations via Parameterised Kernels
AAAI 2025
Interpret and Improve In-Context Learning via the Lens of Input-Label Mappings
ACL 2025
Implicit Regularization in Deep Tucker Factorization: Low-Rankness via Structured Sparsity
AISTATS 2024
Effect of Ambient-Intrinsic Dimension Gap on Adversarial Vulnerability
AISTATS 2024
From Activation to Initialization: Scaling Insights for Optimizing Neural Fields
CVPR 2024
Scaling Laws for Data Filtering-- Data Curation cannot be Compute Agnostic
CVPR 2024
Alpha Invariance: On Inverse Scaling Between Distance and Volume Density in Neural Radiance Fields
CVPR 2024
Neural Networks with Sparse Activation Induced by Large Bias: Tighter Analysis with Bias-Generalized NTK
JMLR 2024
On Neural Networks as Infinite Tree-Structured Probabilistic Graphical Models
NIPS 2024
Finsler-Laplace-Beltrami Operators with Application to Shape Analysis
CVPR 2024
The Loss Landscape of Deep Linear Neural Networks: a Second-order Analysis
JMLR 2024
Convergence of Message-Passing Graph Neural Networks with Generic Aggregation on Large Random Graphs
JMLR 2024
Temporal Graph Neural Tangent Kernel with Graphon-Guaranteed
NIPS 2024
Can Transformers Learn n-gram Language Models?
EMNLP 2024
Detection and Measurement of Syntactic Templates in Generated Text
EMNLP 2024
High Probability Convergence Bounds for Non-convex Stochastic Gradient Descent with Sub-Weibull Noise
JMLR 2024
Dissecting the Interplay of Attention Paths in a Statistical Mechanics Theory of Transformers
NIPS 2024
Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis
EMNLP 2024
Law of Large Numbers and Central Limit Theorem for Wide Two-layer Neural Networks: The Mini-Batch and Noisy Case
JMLR 2024
A PDE-based Explanation of Extreme Numerical Sensitivities and Edge of Stability in Training Neural Networks
JMLR 2024
<
1
2
3
4
5
…
43
>