← Optimization & Theory

Deep Learning › Optimization & Theory ›

Theory

1072 directly classified papers

Papers per year

Papers

The Loss Landscape of Deep Linear Neural Networks: a Second-order Analysis JMLR 2024

Finsler-Laplace-Beltrami Operators with Application to Shape Analysis CVPR 2024

The Implicit Bias of Gradient Descent toward Collaboration between Layers: A Dynamic Analysis of Multilayer Perceptions NIPS 2024

Learnability of high-dimensional targets by two-parameter models and gradient flow NIPS 2024

Convergence of Message-Passing Graph Neural Networks with Generic Aggregation on Large Random Graphs JMLR 2024

Scaling Law for Time Series Forecasting NIPS 2024

Large Stepsize Gradient Descent for Non-Homogeneous Two-Layer Networks: Margin Improvement and Fast Optimization NIPS 2024

Temporal Graph Neural Tangent Kernel with Graphon-Guaranteed NIPS 2024

Length independent PAC-Bayes bounds for Simple RNNs AISTATS 2024

Hardness of Learning Neural Networks under the Manifold Hypothesis NIPS 2024

Great Minds Think Alike: The Universal Convergence Trend of Input Salience NIPS 2024

What do Graph Neural Networks learn? Insights from Tropical Geometry NIPS 2024

Approximation Rate of the Transformer Architecture for Sequence Modeling NIPS 2024

Globally Convergent Variational Inference NIPS 2024

SVARM-IQ: Efficient Approximation of Any-order Shapley Interactions through Stratification AISTATS 2024

The Empirical Impact of Neural Parameter Symmetries, or Lack Thereof NIPS 2024

Log-concave Sampling from a Convex Body with a Barrier: a Robust and Unified Dikin Walk NIPS 2024

Separation and Bias of Deep Equilibrium Models on Expressivity and Learning Dynamics NIPS 2024

Symmetries in Overparametrized Neural Networks: A Mean Field View NIPS 2024

On the Sparsity of the Strong Lottery Ticket Hypothesis NIPS 2024

Learning Neural Contracting Dynamics: Extended Linearization and Global Guarantees NIPS 2024

The Expressive Capacity of State Space Models: A Formal Language Perspective NIPS 2024

Leveraging PAC-Bayes Theory and Gibbs Distributions for Generalization Bounds with Complexity Measures AISTATS 2024

Unveiling Induction Heads: Provable Training Dynamics and Feature Learning in Transformers NIPS 2024

Spectrum Extraction and Clipping for Implicitly Linear Layers AISTATS 2024