Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Optimization & Theory
Machine Learning
›
Optimization & Theory
›
Optimization
14207 directly classified papers
Papers per year
2001: 10
2002: 9
2003: 16
2004: 6
2005: 16
2006: 58
2007: 67
2008: 72
2009: 84
2010: 106
2011: 132
2012: 164
2013: 333
2014: 295
2015: 310
2016: 380
2017: 509
2018: 669
2019: 1072
2020: 1217
2021: 1489
2022: 1470
2023: 1746
2024: 1819
2025: 1567
2026: 591
Papers
Instance Generation for Meta-Black-Box Optimization Through Latent Space Reverse Engineering
AAAI 2026
DNR Bench: Benchmarking Over-Reasoning in Reasoning LLMs
AAAI 2026
Preference Optimization via Contrastive Divergence: Your Policy Is Secretly an NLL Estimator
AAAI 2026
ThinkPilot: Steering Reasoning Models via Automated Think-prefixes Optimization
EACL 2026
Themis: Automated Constraint-Aware Test Synthesis Framework for Code Reinforcement Learning
AAAI 2026
Promoting Efficient Reasoning with Verifiable Stepwise Reward
AAAI 2026
Multi-Metric Preference Alignment for Generative Speech Restoration
AAAI 2026
ASKD: Reinforcement Learning-Style Knowledge Distillation with Quality-Adaptive Skewness
AAAI 2026
When Instinct Guides and Insight Grounds: Staged RL Training for LLM Agents
AAAI 2026
Beyond Step Pruning: Information Theory Based Step-level Optimization for Self-Refining Large Language Models
AAAI 2026
Don’t Start Over: A Cost-Effective Framework for Migrating Personalized Prompts Between LLMs
AAAI 2026
Optimizer Choice and Calibration for QARiB on Arabic-Script Social Media Offensive Language Detection
EACL 2026
ResMAS: Resilience Optimization in LLM-based Multi-agent Systems
AAAI 2026
Efficient Verification and Falsification of ReLU Neural Barrier Certificates
AAAI 2026
LLMdoctor: Token-Level Flow-Guided Preference Optimization for Efficient Test-Time Alignment of Large Language Models
AAAI 2026
ProFuser: Progressive Fusion of Large Language Models
AAAI 2026
qa-FLoRA: Data-free query-adaptive Fusion of LoRAs for LLMs
AAAI 2026
Well Begun, Half Done: Reinforcement Learning with Prefix Optimization for LLM Reasoning
AAAI 2026
Improving Value-based Process Verifier via Low-Cost Variance Reduction
AAAI 2026
Put the Space of LoRA Initialization to the Extreme to Preserve Pre-trained Knowledge
AAAI 2026
MCW-KD: Multi-Cost Wasserstein Knowledge Distillation for Large Language Models
AAAI 2026
Re-SpS: A Reinforcement Learning Approach to Speculative Sampling
AAAI 2026
OptScale: Probabilistic Optimality for Inference-time Scaling
AAAI 2026
ReCode: Updating Code API Knowledge with Reinforcement Learning
AAAI 2026
Audio-Thinker: Guiding Large Audio Language Model When and How to Think via Reinforcement Learning
AAAI 2026
<
1
…
5
6
7
…
569
>