← Optimization & Theory

Machine Learning › Optimization & Theory ›

Neural Network Optimization

3648 directly classified papers

Papers per year

Papers

Imbalanced Gradients in RL Post-Training of Multi-Task LLMs EACL 2026

Acceleration of Backpropagation in Linear Layers of Transformer Models Based on Gradient Structure EACL 2026

Iterative Structured Pruning for Large Language Models with Multi-Domain Calibration EACL 2026

Suppressing Final Layer Hidden State Jumps in Transformer Pretraining EACL 2026

Stabilizing Direct Training of Spiking Neural Networks: Membrane Potential Initialization and Threshold-robust Surrogate Gradient WACV 2026

Domain-Specific Quality Estimation for Machine Translation in Low-Resource Scenarios EACL 2026

Representation Collapse in Machine Translation Through the Lens of Angular Dispersion EACL 2026

Improving Chain-of-Thought for Logical Reasoning via Attention-Aware Intervention EACL 2026

Gated Temporal Fusion Transformers for Robust Multi-Object Tracking WACV 2026

High-Rate Mixout: Revisiting Mixout for Robust Domain Generalization WACV 2026

GAST: Gradient-aligned Sparse Tuning of Large Language Models with Data-layer Selection EACL 2026

Confidence-Calibrated Small-Large Language Model Collaboration for Cost-Efficient Reasoning EACL 2026

1LoRA: Summation Compression for Very Low-Rank Adaptation WACV 2026

Enhancing Auto-regressive Chain-of-Thought through Loop-Aligned Reasoning EACL 2026

On the Dataless Training of Neural Networks AAAI 2026

Sleep-Like Replay Reduces Loss-Landscape Sharpness to Improve Generalization (Student Abstract) AAAI 2026

EA: Managing Green Data Centers Using Deep Reinforcement Learning Without Discounting AAAI 2026

Super Level Sets and Exponential Decay: A Synergistic Approach to Stable Neural Network Training (Abstract Reprint) AAAI 2026

Scalable Synthesis of Formally Verified Neural Value Function for Hamilton-Jacobi Reachability Analysis (Abstract Reprint) AAAI 2026

MoLoRA: Boosting LLM-based End-to-end Speech Translation with Mixture of Low-rank Experts AAAI 2026

Dynamic Deep Prompt Optimization for Defending Against Jailbreak Attacks on LLMs AAAI 2026

Scaling and Transferability of Annealing Strategies in Large Language Model Training AAAI 2026

Efficient Post-Training Refinement of Latent Reasoning in Large Language Models AAAI 2026

AsFT: Anchoring Safety During LLM Fine-Tuning Within Narrow Safety Basin AAAI 2026

OPERA: A Reinforcement Learning--Enhanced Orchestrated Planner-Executor Architecture for Reasoning-Oriented Multi-Hop Retrieval AAAI 2026