Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Optimization & Theory
Machine Learning
›
Optimization & Theory
›
Neural Network Optimization
3648 directly classified papers
Papers per year
2001: 1
2003: 1
2005: 2
2006: 3
2007: 6
2008: 1
2009: 7
2010: 5
2011: 7
2012: 9
2013: 17
2014: 18
2015: 40
2016: 76
2017: 113
2018: 214
2019: 324
2020: 414
2021: 489
2022: 445
2023: 524
2024: 469
2025: 386
2026: 77
Papers
Structured Pruning for Diverse Best-of-N Reasoning Optimization
ACL 2025
Do We Always Need the Simplicity Bias? Looking for Optimal Inductive Biases in the Wild
CVPR 2025
DEIM: DETR with Improved Matching for Fast Convergence
CVPR 2025
Parameterized Blur Kernel Prior Learning for Local Motion Deblurring
CVPR 2025
Mamba-Adaptor: State Space Model Adaptor for Visual Recognition
CVPR 2025
Subnet-Aware Dynamic Supernet Training for Neural Architecture Search
CVPR 2025
Learning from Streaming Video with Orthogonal Gradients
CVPR 2025
VI^3NR: Variance Informed Initialization for Implicit Neural Representations
CVPR 2025
Not All Parameters Matter: Masking Diffusion Models for Enhancing Generation Ability
CVPR 2025
Task-driven Layerwise Additive Activation Intervention
NAACL 2025
depyf: Open the Opaque Box of PyTorch Compiler for Machine Learning Researchers
JMLR 2025
DIRAS: Efficient LLM Annotation of Document Relevance for Retrieval Augmented Generation
NAACL 2025
Do Your Best and Get Enough Rest for Continual Learning
CVPR 2025
FASTer: Focal token Acquiring-and-Scaling Transformer for Long-term 3D Objection Detection
CVPR 2025
Threshold Filtering Packing for Supervised Fine-Tuning: Training Related Samples within Packs
NAACL 2025
Learning Instance-Specific Parameters of Black-Box Models using Differentiable Surrogates
WACV 2025
DMPT: Decoupled Modality-Aware Prompt Tuning for Multi-Modal Object Re-Identification
WACV 2025
Transformers without Normalization
CVPR 2025
Optimizing Neural Network Effectiveness via Non-Monotonicity Refinement
WACV 2025
OverLoCK: An Overview-first-Look-Closely-next ConvNet with Context-Mixing Dynamic Kernels
CVPR 2025
Linear Recency Bias During Training Improves Transformers’ Fit to Reading Times
COLING 2025
Flashback: Memory Mechanism for Enhancing Memory Efficiency and Speed in Deep Sequential Models
COLING 2025
Let’s Focus on Neuron: Neuron-Level Supervised Fine-tuning for Large Language Model
COLING 2025
Disentangle to Decay: Linear Attention with Trainable Decay Factor
COLING 2025
On the Effects of Fine-tuning Language Models for Text-Based Reinforcement Learning
COLING 2025
<
1
…
5
6
7
…
146
>