Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Core AI
Artificial Intelligence
›
Core AI
›
Reinforcement Learning
767 directly classified papers
Papers per year
2006: 1
2007: 6
2008: 3
2009: 2
2010: 4
2011: 3
2012: 8
2013: 3
2014: 4
2016: 4
2017: 21
2018: 48
2019: 75
2020: 73
2021: 86
2022: 107
2023: 116
2024: 127
2025: 76
Papers
PrefPaint: Aligning Image Inpainting Diffusion Model with Human Preference
NIPS 2024
P2BPO: Permeable Penalty Barrier-Based Policy Optimization for Safe RL
AAAI 2024
Interpretable Preferences via Multi-Objective Reward Modeling and Mixture-of-Experts
EMNLP 2024
TRIP NEGOTIATOR: A Travel Persona-aware Reinforced Dialogue Generation Model for Personalized Integrative Negotiation in Tourism
EMNLP 2024
Reward Modeling Requires Automatic Adjustment Based on Data Quality
EMNLP 2024
π-Light: Programmatic Interpretable Reinforcement Learning for Resource-Limited Traffic Signal Control
AAAI 2024
Balance Reward and Safety Optimization for Safe Reinforcement Learning: A Perspective of Gradient Manipulation
AAAI 2024
Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint
ACL 2024
Not All Preference Pairs Are Created Equal: A Recipe for Annotation-Efficient Iterative Preference Learning
EMNLP 2024
ToolPlanner: A Tool Augmented LLM for Multi Granularity Instructions with Path Planning and Feedback
EMNLP 2024
Exploration via linearly perturbed loss minimisation
AISTATS 2024
Active Reinforcement Learning for Robust Building Control
AAAI 2024
Coffee-Gym: An Environment for Evaluating and Improving Natural Language Feedback on Erroneous Code
EMNLP 2024
Semi-Supervised Reward Modeling via Iterative Self-Training
EMNLP 2024
Teaching Embodied Reinforcement Learning Agents: Informativeness and Diversity of Language Use
EMNLP 2024
On the Statistical Efficiency of Mean-Field Reinforcement Learning with General Function Approximation
AISTATS 2024
Enhancing Reinforcement Learning with Dense Rewards from Language Model Critic
EMNLP 2024
Handling Long and Richly Constrained Tasks through Constrained Hierarchical Reinforcement Learning
AAAI 2024
Virtual Action Actor-Critic Framework for Exploration (Student Abstract)
AAAI 2024
AlignSAM: Aligning Segment Anything Model to Open Context via Reinforcement Learning
CVPR 2024
StablePrompt : Automatic Prompt Tuning using Reinforcement Learning for Large Language Model
EMNLP 2024
E2CL: Exploration-based Error Correction Learning for Embodied Agents
EMNLP 2024
Transformers Learn Transition Dynamics when Trained to Predict Markov Decision Processes
EMNLP 2024
Test Where Decisions Matter: Importance-driven Testing for Deep Reinforcement Learning
NIPS 2024
MO-DDN: A Coarse-to-Fine Attribute-based Exploration Agent for Multi-Object Demand-driven Navigation
NIPS 2024
<
1
…
6
7
8
…
31
>