Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Optimization & Theory
Machine Learning
›
Optimization & Theory
›
Neural Network Optimization
3648 directly classified papers
Papers per year
2001: 1
2003: 1
2005: 2
2006: 3
2007: 6
2008: 1
2009: 7
2010: 5
2011: 7
2012: 9
2013: 17
2014: 18
2015: 40
2016: 76
2017: 113
2018: 214
2019: 324
2020: 414
2021: 489
2022: 445
2023: 524
2024: 469
2025: 386
2026: 77
Papers
Imbalanced Gradients in RL Post-Training of Multi-Task LLMs
EACL 2026
Acceleration of Backpropagation in Linear Layers of Transformer Models Based on Gradient Structure
EACL 2026
Iterative Structured Pruning for Large Language Models with Multi-Domain Calibration
EACL 2026
Suppressing Final Layer Hidden State Jumps in Transformer Pretraining
EACL 2026
Stabilizing Direct Training of Spiking Neural Networks: Membrane Potential Initialization and Threshold-robust Surrogate Gradient
WACV 2026
Domain-Specific Quality Estimation for Machine Translation in Low-Resource Scenarios
EACL 2026
Representation Collapse in Machine Translation Through the Lens of Angular Dispersion
EACL 2026
Improving Chain-of-Thought for Logical Reasoning via Attention-Aware Intervention
EACL 2026
Gated Temporal Fusion Transformers for Robust Multi-Object Tracking
WACV 2026
High-Rate Mixout: Revisiting Mixout for Robust Domain Generalization
WACV 2026
GAST: Gradient-aligned Sparse Tuning of Large Language Models with Data-layer Selection
EACL 2026
Confidence-Calibrated Small-Large Language Model Collaboration for Cost-Efficient Reasoning
EACL 2026
1LoRA: Summation Compression for Very Low-Rank Adaptation
WACV 2026
Enhancing Auto-regressive Chain-of-Thought through Loop-Aligned Reasoning
EACL 2026
On the Dataless Training of Neural Networks
AAAI 2026
Sleep-Like Replay Reduces Loss-Landscape Sharpness to Improve Generalization (Student Abstract)
AAAI 2026
EA: Managing Green Data Centers Using Deep Reinforcement Learning Without Discounting
AAAI 2026
Super Level Sets and Exponential Decay: A Synergistic Approach to Stable Neural Network Training (Abstract Reprint)
AAAI 2026
Scalable Synthesis of Formally Verified Neural Value Function for Hamilton-Jacobi Reachability Analysis (Abstract Reprint)
AAAI 2026
MoLoRA: Boosting LLM-based End-to-end Speech Translation with Mixture of Low-rank Experts
AAAI 2026
Dynamic Deep Prompt Optimization for Defending Against Jailbreak Attacks on LLMs
AAAI 2026
Scaling and Transferability of Annealing Strategies in Large Language Model Training
AAAI 2026
Efficient Post-Training Refinement of Latent Reasoning in Large Language Models
AAAI 2026
AsFT: Anchoring Safety During LLM Fine-Tuning Within Narrow Safety Basin
AAAI 2026
OPERA: A Reinforcement Learning--Enhanced Orchestrated Planner-Executor Architecture for Reasoning-Oriented Multi-Hop Retrieval
AAAI 2026
<
1
2
3
4
5
…
146
>