Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Methods
Reinforcement Learning
›
Methods
›
Deep RL
3861 directly classified papers
Papers per year
2005: 1
2006: 9
2007: 14
2008: 15
2009: 9
2010: 21
2011: 27
2012: 32
2013: 21
2014: 17
2015: 10
2016: 33
2017: 102
2018: 222
2019: 399
2020: 450
2021: 533
2022: 478
2023: 532
2024: 513
2025: 326
2026: 97
Papers
TaxonRL: Reinforcement Learning with Intermediate Rewards for Interpretable Fine-Grained Visual Reasoning
WACV 2026
An Interactive Simulation Framework by Ensemble Imitation Learning Agents for Training Robust Trading Policies
AAAI 2026
Conformal Feedback Alignment: Quantifying Answer-Level Reliability for Robust LLM Alignment
EACL 2026
Reinforcement Learning-based Adaptive Control of Classifier-Free Guidance and Timestep Embeddings in Diffusion Models
WACV 2026
AI-Driven Real-Time Acoustic Modelling for Better Audio Perception in Dynamic Environments
AAAI 2026
Towards Fairness in Transportation Gig Markets: Identifying, Imitating, and Mitigating Algorithm Discrimination via Deep Reinforcement Learning
AAAI 2026
CAPO: A Unified Policy Gradient Approach for Reward and Cost Optimization in Safe Reinforcement Learning (Student Abstract)
AAAI 2026
Multi-Stage Reinforcement Learning for Robust Charging of Quantum Batteries (Student Abstract)
AAAI 2026
EA: Managing Green Data Centers Using Deep Reinforcement Learning Without Discounting
AAAI 2026
Safe Reinforcement Learning for Trustworthy AI: Theory, Algorithms, and Applications
AAAI 2026
Reinforcement Learning Without Explicit Rewards: Theory and Practice
AAAI 2026
Learngene: Inheritable ‘Genes’ in Intelligent Agents (Abstract Reprint)
AAAI 2026
Symbolic Task Inference in Deep Reinforcement Learning (Abstract Reprint)
AAAI 2026
An MRP Formulation for Supervised Learning: Generalized Temporal Difference Learning Models (Abstract Reprint)
AAAI 2026
TORA: Train Once, Realign Anytime for Offline Multi-Objective Reinforcement Learning
AAAI 2026
Best-Effort Policies for Robust Markov Decision Processes
AAAI 2026
Hierarchical Reinforcement Learning with Topology-Aware Exploration Framework for Multi-path Commodity Flow Problem
AAAI 2026
First-Order Representation Languages for Goal-Conditioned RL
AAAI 2026
Meta-Black-Box Optimization with Bi-Space Landscape Analysis and Dual-Control Mechanism for SAEA
AAAI 2026
Beyond Single-Step Updates: Reinforcement Learning of Heuristics with Limited-Horizon Search
AAAI 2026
CRAF: A Clinical Reasoning-Adaptive Framework via Reinforcement Learning for Similar Case Retrieval
AAAI 2026
Promoting Efficient Reasoning with Verifiable Stepwise Reward
AAAI 2026
Stability-Aware Reinforcement Learning for Robust Class Integration Test Order Generation
AAAI 2026
ShoppingBench: A Real-World Intent-Grounded Shopping Benchmark for LLM-based Agents
AAAI 2026
A Rolling Stone Gathers No Moss: Adaptive Policy Optimization for Stable Self-Evaluation in Large Multimodal Models
AAAI 2026
<
1
2
3
4
5
…
155
>