← Optimization & Theory

Deep Learning › Optimization & Theory ›

Theory

1072 directly classified papers

Papers per year

Papers

UMB: Understanding Model Behavior for Open-World Object Detection NIPS 2024

Provable Robustness against a Union of L_0 Adversarial Attacks AAAI 2024

Deep linear networks for regression are implicitly regularized towards flat minima NIPS 2024

Parameter Symmetry and Noise Equilibrium of Stochastic Gradient Descent NIPS 2024

Learnability of high-dimensional targets by two-parameter models and gradient flow NIPS 2024

SVARM-IQ: Efficient Approximation of Any-order Shapley Interactions through Stratification AISTATS 2024

Error Correction Output Codes for Robust Neural Networks against Weight-errors: A Neural Tangent Kernel Point of View NIPS 2024

Stable Minima Cannot Overfit in Univariate ReLU Networks: Generalization by Large Step Sizes NIPS 2024

Backward Lens: Projecting Language Model Gradients into the Vocabulary Space EMNLP 2024

One-Layer Transformer Provably Learns One-Nearest Neighbor In Context NIPS 2024

Scaling Laws Across Model Architectures: A Comparative Analysis of Dense and MoE Models in Large Language Models EMNLP 2024

Towards Efficient Verification of Quantized Neural Networks AAAI 2024

Towards Interpretable Sequence Continuation: Analyzing Shared Circuits in Large Language Models EMNLP 2024

Globally Convergent Variational Inference NIPS 2024

A provable control of sensitivity of neural networks through a direct parameterization of the overall bi-Lipschitzness NIPS 2024

Leveraging PAC-Bayes Theory and Gibbs Distributions for Generalization Bounds with Complexity Measures AISTATS 2024

Base of RoPE Bounds Context Length NIPS 2024

Spectrum Extraction and Clipping for Implicitly Linear Layers AISTATS 2024

Unraveling the Gradient Descent Dynamics of Transformers NIPS 2024

Towards Large Certified Radius in Randomized Smoothing Using Quasiconcave Optimization AAAI 2024

Implicit Optimization Bias of Next-token Prediction in Linear Models NIPS 2024

Effect of Ambient-Intrinsic Dimension Gap on Adversarial Vulnerability AISTATS 2024

Nonlinear dynamics of localization in neural receptive fields NIPS 2024

Combining Statistical Depth and Fermat Distance for Uncertainty Quantification NIPS 2024

An exactly solvable model for emergence and scaling laws in the multitask sparse parity problem NIPS 2024