Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Optimization & Theory
Machine Learning
›
Optimization & Theory
›
Neural Network Optimization
3648 directly classified papers
Papers per year
2001: 1
2003: 1
2005: 2
2006: 3
2007: 6
2008: 1
2009: 7
2010: 5
2011: 7
2012: 9
2013: 17
2014: 18
2015: 40
2016: 76
2017: 113
2018: 214
2019: 324
2020: 414
2021: 489
2022: 445
2023: 524
2024: 469
2025: 386
2026: 77
Papers
Representation Collapse in Machine Translation Through the Lens of Angular Dispersion
EACL 2026
Distillation Dynamics: Towards Understanding Feature-Based Distillation in Vision Transformers
AAAI 2026
Call, Reward, Repeat: Advancing Dialog State Tracking with GRPO and Function Calling
EACL 2026
Acceleration of Backpropagation in Linear Layers of Transformer Models Based on Gradient Structure
EACL 2026
Activation-wise Propagation: A One-Timestep Strategy for Spiking Neural Networks
AAAI 2026
Correcting Quantization-Induced Gradient Mismatch in Neural Image Compression
AAAI 2026
AdaptViG: Adaptive Vision GNN with Exponential Decay Gating
WACV 2026
Confidence-Calibrated Small-Large Language Model Collaboration for Cost-Efficient Reasoning
EACL 2026
Can Calibration of Positional Encodings Enhance Long Context Utilization?
EACL 2026
Improving Chain-of-Thought for Logical Reasoning via Attention-Aware Intervention
EACL 2026
DevLake at LoResMT 2026: The Impact of Pre-training and Model Scale on Russian-Bashkir Low-Resource Translation
EACL 2026
WeightFlow: Learning Stochastic Dynamics via Evolving Weight of Neural Network
AAAI 2026
Generalized Threshold Optimization with Harmony Multi-Threshold Neurons for Accurate ANN-to-SNN Conversion
AAAI 2026
I-INR: Iterative Implicit Neural Representations
AAAI 2026
Gated Temporal Fusion Transformers for Robust Multi-Object Tracking
WACV 2026
High-Rate Mixout: Revisiting Mixout for Robust Domain Generalization
WACV 2026
1LoRA: Summation Compression for Very Low-Rank Adaptation
WACV 2026
GAST: Gradient-aligned Sparse Tuning of Large Language Models with Data-layer Selection
EACL 2026
Iterative Structured Pruning for Large Language Models with Multi-Domain Calibration
EACL 2026
Suppressing Final Layer Hidden State Jumps in Transformer Pretraining
EACL 2026
Imbalanced Gradients in RL Post-Training of Multi-Task LLMs
EACL 2026
Position Encoding with Random Float Sampling Enhances Length Generalization of Transformers
EACL 2026
Domain-Specific Quality Estimation for Machine Translation in Low-Resource Scenarios
EACL 2026
Balancing Fluency and Adherence: Hybrid Fallback Term Injection in Low-Resource Terminology Translation
EACL 2026
Start Small, Think Big: Curriculum-based Relative Policy Optimization for Visual Grounding
AAAI 2026
<
1
2
3
4
5
…
146
>