Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Core AI
Artificial Intelligence
›
Core AI
›
Reinforcement Learning
767 directly classified papers
Papers per year
2006: 1
2007: 6
2008: 3
2009: 2
2010: 4
2011: 3
2012: 8
2013: 3
2014: 4
2016: 4
2017: 21
2018: 48
2019: 75
2020: 73
2021: 86
2022: 107
2023: 116
2024: 127
2025: 76
Papers
DRBO: Mitigating the Bottleneck Effect via Dynamic Reward Balancing in Multi-reward LLM Optimization
EMNLP 2025
Real-Time Recurrent Reinforcement Learning
AAAI 2025
Marginal Benefit Driven RL Teacher for Unsupervised Environment Design
AAAI 2025
Enhancing Efficiency and Exploration in Reinforcement Learning for LLMs
EMNLP 2025
CTD4 – a Deep Continuous Distributional Actor-Critic Agent with a Kalman Fusion of Multiple Critics
AAAI 2025
SrSv: Integrating Sequential Rollouts with Sequential Value Estimation for Multi-agent Reinforcement Learning
AAAI 2025
Efficient Multi-Policy Evaluation for Reinforcement Learning
AAAI 2025
Deterministic Policy Gradient Primal-Dual Methods for Continuous-Space Constrained MDPs
AAAI 2025
Epistemic Bellman Operators
AAAI 2025
Efficient Reinforcement Learning Through Adaptively Pretrained Visual Encoder
AAAI 2025
Differentiable Information Enhanced Model-Based Reinforcement Learning
AAAI 2025
SMoSE: Sparse Mixture of Shallow Experts for Interpretable Reinforcement Learning in Continuous Control Tasks
AAAI 2025
Do LLMs Need Inherent Reasoning Before Reinforcement Learning? A Study in Korean Self-Correction
AACL 2025
AI-Powered Algorithm-Centric Quantum Processor Topology Design
AAAI 2025
Multi Agent Reinforcement Learning for Sequential Satellite Assignment Problems
AAAI 2025
On Shallow Planning Under Partial Observability
AAAI 2025
Reinforcement Learning Platform for Adversarial Black-box Attacks with Custom Distortion Filters
AAAI 2025
Partial Identifiability in Inverse Reinforcement Learning for Agents with Non-Exponential Discounting
AAAI 2025
An Efficient Task-Oriented Dialogue Policy: Evolutionary Reinforcement Learning Injected by Elite Individuals
ACL 2025
Dynamic and Generalizable Process Reward Modeling
ACL 2025
APIRL: Deep Reinforcement Learning for REST API Fuzzing
AAAI 2025
World Modeling Makes a Better Planner: Dual Preference Optimization for Embodied Task Planning
ACL 2025
CEAES: Bidirectional Reinforcement Learning Optimization for Consistent and Explainable Essay Assessment
ACL 2025
SynWorld: Virtual Scenario Synthesis for Agentic Action Knowledge Refinement
ACL 2025
ReAL: How Can LLMs Simulate the Real Teacher? Retrieval-enhanced Agent for Adaptive Learning
EMNLP 2025
<
1
2
3
4
5
…
31
>