Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Methods
Reinforcement Learning
›
Methods
›
Deep RL
3861 directly classified papers
Papers per year
2005: 1
2006: 9
2007: 14
2008: 15
2009: 9
2010: 21
2011: 27
2012: 32
2013: 21
2014: 17
2015: 10
2016: 33
2017: 102
2018: 222
2019: 399
2020: 450
2021: 533
2022: 478
2023: 532
2024: 513
2025: 326
2026: 97
Papers
Self-Consistent Model-based Adaptation for Visual Reinforcement Learning
IJCAI 2025
CADP: Towards Better Centralized Learning for Decentralized Execution in MARL
IJCAI 2025
BFTBrain: Adaptive BFT Consensus with Reinforcement Learning
NSDI 2025
Robust Finite-Memory Policy Gradients for Hidden-Model POMDPs
IJCAI 2025
EFormer: An Effective Edge-based Transformer for Vehicle Routing Problems
IJCAI 2025
Preference-based Deep Reinforcement Learning for Historical Route Estimation
IJCAI 2025
Text2World: Benchmarking Large Language Models for Symbolic World Model Generation
ACL 2025
GRSN: Gated Recurrent Spiking Neurons for POMDPs and MARL
AAAI 2025
Adversarial Preference Learning for Robust LLM Alignment
ACL 2025
Reward Adaptation via Q-Manipulation: Provably Beneficial Reward Function Transfer in Reinforcement Learning
IJCAI 2025
AI-Powered Algorithm-Centric Quantum Processor Topology Design
AAAI 2025
APIRL: Deep Reinforcement Learning for REST API Fuzzing
AAAI 2025
GenAL: Generative Agent for Adaptive Learning
AAAI 2025
SLRL: Semi-Supervised Local Community Detection Based on Reinforcement Learning
AAAI 2025
SORREL: Suboptimal-Demonstration-Guided Reinforcement Learning for Learning to Branch
AAAI 2025
ERL-MPP: Evolutionary Reinforcement Learning with Multi-head Puzzle Perception for Solving Large-scale Jigsaw Puzzles of Eroded Gaps
AAAI 2025
Should I Trust You? Detecting Deception in Negotiations using Counterfactual RL
ACL 2025
FFCG: Effective and Fast Family Column Generation for Solving Large-Scale Linear Program
AAAI 2025
Removing Prompt-template Bias in Reinforcement Learning from Human Feedback
ACL 2025
Guaranteeing Out-Of-Distribution Detection in Deep RL via Transition Estimation
AAAI 2025
DutyTTE: Deciphering Uncertainty in Origin-Destination Travel Time Estimation
AAAI 2025
Skill Disentanglement in Reproducing Kernel Hilbert Space
AAAI 2025
Embodied Navigation with Auxiliary Task of Action Description Prediction
ICCV 2025
RAT: Adversarial Attacks on Deep Reinforcement Agents for Targeted Behaviors
AAAI 2025
AToM: Aligning Text-to-Motion Model at Event-Level with GPT-4Vision Reward
CVPR 2025
<
1
…
8
9
10
…
155
>