Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Learning Types
Deep Learning
›
Learning Types
›
Reinforcement Learning
1263 directly classified papers
Papers per year
2006: 1
2007: 2
2008: 3
2009: 2
2010: 1
2011: 2
2012: 3
2013: 2
2014: 3
2015: 2
2016: 8
2017: 44
2018: 95
2019: 134
2020: 123
2021: 131
2022: 143
2023: 127
2024: 194
2025: 240
2026: 3
Papers
ReinDiffuse: Crafting Physically Plausible Motions with Reinforced Diffusion Model
WACV 2025
Dense Policy: Bidirectional Autoregressive Learning of Actions
ICCV 2025
RoBridge: A Hierarchical Architecture Bridging Cognition and Execution for General Robotic Manipulation
ICCV 2025
RMultiplex200K: Toward Reliable Multimodal Process Supervision for Visual Language Models on Telecommunications
ICCV 2025
Token-Level Accept or Reject: A Micro Alignment Approach for Large Language Models
IJCAI 2025
DreamAlign: Dynamic Text-to-3D Optimization with Human Preference Alignment
AAAI 2025
Finite Expression Method for Solving High-Dimensional Partial Differential Equations
JMLR 2025
World Models with Hints of Large Language Models for Goal Achieving
NAACL 2025
When2Call: When (not) to Call Tools
NAACL 2025
Understanding Reference Policies in Direct Preference Optimization
NAACL 2025
Do LLMs Need Inherent Reasoning Before Reinforcement Learning? A Study in Korean Self-Correction
AACL 2025
VQ-VLA: Improving Vision-Language-Action Models via Scaling Vector-Quantized Action Tokenizers
ICCV 2025
Cycle Consistency as Reward: Learning Image-Text Alignment without Human Preferences
ICCV 2025
VCA: Video Curious Agent for Long Video Understanding
ICCV 2025
A0: An Affordance-Aware Hierarchical Model for General Robotic Manipulation
ICCV 2025
Guaranteeing Out-Of-Distribution Detection in Deep RL via Transition Estimation
AAAI 2025
POI Recommendation via Multi-Objective Adversarial Imitation Learning
AAAI 2025
RAT: Adversarial Attacks on Deep Reinforcement Agents for Targeted Behaviors
AAAI 2025
Teaching Models to Improve on Tape
AAAI 2025
Understanding Individual Agent Importance in Multi-Agent System via Counterfactual Reasoning
AAAI 2025
The Distributional Reward Critic Framework for Reinforcement Learning Under Perturbed Rewards
AAAI 2025
Enhancing Predictive Healthcare Using AI-Driven Early Warning Systems
AAAI 2025
MagicID: Hybrid Preference Optimization for ID-Consistent and Dynamic-Preserved Video Customization
ICCV 2025
Learned Perceptive Forward Dynamics Model for Safe and Platform-aware Robotic Navigation
RSS 2025
Removing Prompt-template Bias in Reinforcement Learning from Human Feedback
ACL 2025
<
1
2
3
4
5
…
51
>