Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Methods
Reinforcement Learning
›
Methods
›
Deep RL
3861 directly classified papers
Papers per year
2005: 1
2006: 9
2007: 14
2008: 15
2009: 9
2010: 21
2011: 27
2012: 32
2013: 21
2014: 17
2015: 10
2016: 33
2017: 102
2018: 222
2019: 399
2020: 450
2021: 533
2022: 478
2023: 532
2024: 513
2025: 326
2026: 97
Papers
Hierarchical Reinforcement Learning with Topology-Aware Exploration Framework for Multi-path Commodity Flow Problem
AAAI 2026
First-Order Representation Languages for Goal-Conditioned RL
AAAI 2026
Meta-Black-Box Optimization with Bi-Space Landscape Analysis and Dual-Control Mechanism for SAEA
AAAI 2026
Beyond Single-Step Updates: Reinforcement Learning of Heuristics with Limited-Horizon Search
AAAI 2026
CRAF: A Clinical Reasoning-Adaptive Framework via Reinforcement Learning for Similar Case Retrieval
AAAI 2026
Promoting Efficient Reasoning with Verifiable Stepwise Reward
AAAI 2026
Stability-Aware Reinforcement Learning for Robust Class Integration Test Order Generation
AAAI 2026
ShoppingBench: A Real-World Intent-Grounded Shopping Benchmark for LLM-based Agents
AAAI 2026
A Rolling Stone Gathers No Moss: Adaptive Policy Optimization for Stable Self-Evaluation in Large Multimodal Models
AAAI 2026
V-Pruner: A Fast and Globally-informed Token Pruning Framework for Vision Transformer
AAAI 2026
PQDA:Policy-Aligned Q-Consistency Meets Decoupled Augmentation for Generalizable Visual RL
AAAI 2026
Achieving Equilibrium Under Utility Heterogeneity: An Agent-Attention Framework for Multi-Agent Multi-Objective Reinforcement Learning
AAAI 2026
HCPO: Hierarchical Conductor-Based Policy Optimization in Multi-Agent Reinforcement Learning
AAAI 2026
AcoustoReinforce: Multi-Particle Acoustophoretic Path Planning with Deep Reinforcement Learning
AAAI 2026
Test-Time Reinforcement Learning for GUI Grounding via Region Consistency
AAAI 2026
VideoSeg-R1:Reasoning Video Object Segmentation via Reinforcement Learning
AAAI 2026
PulseMind: A Multi-Modal Medical Model for Real-World Clinical Diagnosis
AAAI 2026
ReasonAct: Progressive Training for Fine-Grained Video Reasoning in Small Models
AAAI 2026
AURORA: Augmented Understanding via Structured Reasoning and Reinforcement Learning for Reference Audio-Visual Segmentation
AAAI 2026
Informative Subgraph Extraction with Deep Reinforcement Learning for Drug-Drug Interaction Prediction
AAAI 2026
VQAThinker: Exploring Generalizable and Explainable Video Quality Assessment via Reinforcement Learning
AAAI 2026
RLSLM: A Hybrid Framework Combining Reinforcement Learning and a Rule-based Social Locomotion Model for Socially-aware Navigation
AAAI 2026
Highly Imperceptible Black-Box Graph Injection Attacks with Reinforcement Learning
AAAI 2025
GeoExplorer: Active Geo-localization with Curiosity-Driven Exploration
ICCV 2025
PRED: Performance-oriented Random Early Detection for Consistently Stable Performance in Datacenters
NSDI 2025
<
1
2
3
4
5
…
155
>