Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Core AI
Artificial Intelligence
›
Core AI
›
Reinforcement Learning
767 directly classified papers
Papers per year
2006: 1
2007: 6
2008: 3
2009: 2
2010: 4
2011: 3
2012: 8
2013: 3
2014: 4
2016: 4
2017: 21
2018: 48
2019: 75
2020: 73
2021: 86
2022: 107
2023: 116
2024: 127
2025: 76
Papers
Deep Implicit Imitation Reinforcement Learning in Heterogeneous Action Settings
AAAI 2025
Fourier Guided Adaptive Adversarial Augmentation for Generalization in Visual Reinforcement Learning
AAAI 2025
Multi-fingered Hand Grasps with Visuo-Tactile Fusion via Multi-Agent Deep Reinforcement Learning
AAAI 2025
Understanding Individual Agent Importance in Multi-Agent System via Counterfactual Reasoning
AAAI 2025
DUO: Diverse, Uncertain, On-Policy Query Generation and Selection for Reinforcement Learning from Human Feedback
AAAI 2025
Truncated Gaussian Policy for Debiased Continuous Control
AAAI 2025
AI-Powered Algorithm-Centric Quantum Processor Topology Design
AAAI 2025
Efficient Language-instructed Skill Acquisition via Reward-Policy Co-Evolution
AAAI 2025
Evolutionary Reinforcement Learning with Parameterized Action Primitives for Diverse Manipulation Tasks
AAAI 2025
ASP-Driven Emergency Planning for Norm Violations in Reinforcement Learning
AAAI 2025
Skill Disentanglement in Reproducing Kernel Hilbert Space
AAAI 2025
Active Reinforcement Learning Strategies for Offline Policy Improvement
AAAI 2025
FedAA: A Reinforcement Learning Perspective on Adaptive Aggregation for Fair and Robust Federated Learning
AAAI 2025
GLAM: Global-Local Variation Awareness in Mamba-based World Model
AAAI 2025
Do LLMs Need Inherent Reasoning Before Reinforcement Learning? A Study in Korean Self-Correction
AACL 2025
APIRL: Deep Reinforcement Learning for REST API Fuzzing
AAAI 2025
Deep Reinforcement Learning with Time-Scale Invariant Memory
AAAI 2025
SORREL: Suboptimal-Demonstration-Guided Reinforcement Learning for Learning to Branch
AAAI 2025
When Should We Prefer State-to-Visual DAgger over Visual Reinforcement Learning?
AAAI 2025
RMultiplex200K: Toward Reliable Multimodal Process Supervision for Visual Language Models on Telecommunications
ICCV 2025
DRBO: Mitigating the Bottleneck Effect via Dynamic Reward Balancing in Multi-reward LLM Optimization
EMNLP 2025
NaviFormer: A Spatio-Temporal Context-Aware Transformer for Object Navigation
AAAI 2025
Probabilistic Shielding for Safe Reinforcement Learning
AAAI 2025
On-Policy Self-Alignment with Fine-grained Knowledge Feedback for Hallucination Mitigation
ACL 2025
Real-Time Recurrent Reinforcement Learning
AAAI 2025
<
1
2
3
4
5
…
31
>