Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Methods
Reinforcement Learning
›
Methods
›
Deep RL
3861 directly classified papers
Papers per year
2005: 1
2006: 9
2007: 14
2008: 15
2009: 9
2010: 21
2011: 27
2012: 32
2013: 21
2014: 17
2015: 10
2016: 33
2017: 102
2018: 222
2019: 399
2020: 450
2021: 533
2022: 478
2023: 532
2024: 513
2025: 326
2026: 97
Papers
WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning
EMNLP 2025
Assess and Prompt: A Generative RL Framework for Improving Engagement in Online Mental Health Communities
EMNLP 2025
Dialogue Is Not Enough to Make a Communicative BabyLM (But Neither Is Developmentally Inspired Reinforcement Learning)
EMNLP 2025
Wavelet Policy: Lifting Scheme for Policy Learning in Long-Horizon Tasks
ICCV 2025
SPPD: Self-training with Process Preference Learning Using Dynamic Value Margin
EMNLP 2025
A Graph Interaction Framework on Relevance for Multimodal Named Entity Recognition with Multiple Images
COLING 2025
Exploration-Driven Generative Interactive Environments
CVPR 2025
Decision SpikeFormer: Spike-Driven Transformer for Decision Making
CVPR 2025
SeqMvRL: A Sequential Fusion Framework for Multi-view Representation Learning
CVPR 2025
All-Optical Nonlinear Diffractive Deep Network for Ultrafast Image Denoising
CVPR 2025
InterMimic: Towards Universal Whole-Body Control for Physics-Based Human-Object Interactions
CVPR 2025
Critic-V: VLM Critics Help Catch VLM Errors in Multimodal Reasoning
CVPR 2025
Touch2Shape: Touch-Conditioned 3D Diffusion for Shape Exploration and Reconstruction
CVPR 2025
On the Effects of Fine-tuning Language Models for Text-Based Reinforcement Learning
COLING 2025
Towards Adaptive Mechanism Activation in Language Agent
COLING 2025
MBA-RAG: a Bandit Approach for Adaptive Retrieval-Augmented Generation through Question Complexity
COLING 2025
One Encoder to Rule them All: Representation Learning for Model-free Visual Reinforcement Learning using Fourier Neural Operators
ICCV 2025
RIPE: Reinforcement Learning on Unlabeled Image Pairs for Robust Keypoint Extraction
ICCV 2025
Gain Tuning Is Not What You Need: Reward Gain Adaptation for Constrained Locomotion Learning
RSS 2025
Enhancing AMR Parsing with Group Relative Policy Optimization
ACL 2025
LLMSR@XLLM25: A Language Model-Based Pipeline for Structured Reasoning Data Construction
ACL 2025
Learning Getting-Up Policies for Real-World Humanoid Robots
RSS 2025
Resolving Conflicting Constraints in Multi-Agent Reinforcement Learning with Layered Safety
RSS 2025
Sparks of Tabular Reasoning via Text2SQL Reinforcement Learning
ACL 2025
RL + Transformer = A General-Purpose Problem Solver
ACL 2025
<
1
…
5
6
7
…
155
>