← Optimization & Theory

Machine Learning › Optimization & Theory ›

Optimization

14207 directly classified papers

Papers per year

Papers

Instance Generation for Meta-Black-Box Optimization Through Latent Space Reverse Engineering AAAI 2026

DNR Bench: Benchmarking Over-Reasoning in Reasoning LLMs AAAI 2026

Preference Optimization via Contrastive Divergence: Your Policy Is Secretly an NLL Estimator AAAI 2026

ThinkPilot: Steering Reasoning Models via Automated Think-prefixes Optimization EACL 2026

Themis: Automated Constraint-Aware Test Synthesis Framework for Code Reinforcement Learning AAAI 2026

Promoting Efficient Reasoning with Verifiable Stepwise Reward AAAI 2026

Multi-Metric Preference Alignment for Generative Speech Restoration AAAI 2026

ASKD: Reinforcement Learning-Style Knowledge Distillation with Quality-Adaptive Skewness AAAI 2026

When Instinct Guides and Insight Grounds: Staged RL Training for LLM Agents AAAI 2026

Beyond Step Pruning: Information Theory Based Step-level Optimization for Self-Refining Large Language Models AAAI 2026

Don’t Start Over: A Cost-Effective Framework for Migrating Personalized Prompts Between LLMs AAAI 2026

Optimizer Choice and Calibration for QARiB on Arabic-Script Social Media Offensive Language Detection EACL 2026

ResMAS: Resilience Optimization in LLM-based Multi-agent Systems AAAI 2026

Efficient Verification and Falsification of ReLU Neural Barrier Certificates AAAI 2026

LLMdoctor: Token-Level Flow-Guided Preference Optimization for Efficient Test-Time Alignment of Large Language Models AAAI 2026

ProFuser: Progressive Fusion of Large Language Models AAAI 2026

qa-FLoRA: Data-free query-adaptive Fusion of LoRAs for LLMs AAAI 2026

Well Begun, Half Done: Reinforcement Learning with Prefix Optimization for LLM Reasoning AAAI 2026

Improving Value-based Process Verifier via Low-Cost Variance Reduction AAAI 2026

Put the Space of LoRA Initialization to the Extreme to Preserve Pre-trained Knowledge AAAI 2026

MCW-KD: Multi-Cost Wasserstein Knowledge Distillation for Large Language Models AAAI 2026

Re-SpS: A Reinforcement Learning Approach to Speculative Sampling AAAI 2026

OptScale: Probabilistic Optimality for Inference-time Scaling AAAI 2026

ReCode: Updating Code API Knowledge with Reinforcement Learning AAAI 2026

Audio-Thinker: Guiding Large Audio Language Model When and How to Think via Reinforcement Learning AAAI 2026