Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Learning Types
Machine Learning
›
Learning Types
›
Reinforcement Learning
2932 directly classified papers
Papers per year
2003: 1
2006: 11
2007: 18
2008: 23
2009: 14
2010: 22
2011: 24
2012: 34
2013: 26
2014: 24
2015: 14
2016: 23
2017: 79
2018: 182
2019: 255
2020: 284
2021: 333
2022: 319
2023: 315
2024: 457
2025: 419
2026: 55
Papers
RemoteReasoner: Towards Unifying Geospatial Reasoning Workflow
AAAI 2026
Prototype Entropy Alignment: Reinforcing Structured Uncertainty in LLM Reasoning
AAAI 2026
VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation
AAAI 2026
PA-FAS: Towards Interpretable and Generalizable Multimodal Face Anti-Spoofing via Path-Augmented Reinforcement Learning
AAAI 2026
Think Wise, Collaborate Effectively: A Rationale-Aware LLM-Based Recommender with Reinforcement Learning from Collaborative Signals
AAAI 2026
SHADOW: Dynamic-Aware Credit Assignment Against Long-Horizon Tasks
AAAI 2026
Pseudo-Likelihood Training for Reasoning Diffusion Language Models
EACL 2026
Structure-based RNA Design by Step-wise Optimization of Latent Diffusion Model
AAAI 2026
Vision-Language Reasoning for Geolocalization: A Reinforcement Learning Approach
AAAI 2026
Social Influence-Based Mutual Acknowledgement Token Exchange (Student Abstract)
AAAI 2026
Think Then Rewrite: Reasoning Enhanced Query Rewriting for Domain Specific Retrieval
AAAI 2026
Knowledge-Enhanced Image Captioning with Adaptive Graph-based Multimodal Alignment and LLM
AAAI 2026
Aligning Cross-View Visual Geometries in LVLMs Through Human-Like Reasoning Learning
AAAI 2026
VCGD: Visual Clue Guided Decoding with Caption Model for Mitigating Hallucination in Multimodal Large Language Models
AAAI 2026
FAST-EQA: Efficient Embodied Question Answering with Global and Local Region Relevancy
WACV 2026
ST-Think: How Multimodal Large Language Models Reason About 4D Worlds from Ego-Centric Videos
WACV 2026
Reflect, Rewrite, Repeat: How Simple Arithmetic Enables Advanced Reasoning in Small Language Models
EACL 2026
Think Just Enough: Leveraging Self-Assessed Confidence for Adaptive Reasoning in Language Models
EACL 2026
ReaSon: Reinforced Causal Search with Information Bottleneck for Video Understanding
AAAI 2026
Vision-G1: Towards General Reasoning Vision-Language Models via Reinforcement Learning
AAAI 2026
USPR: Learning a Unified Solver for Profiled Routing
AAAI 2026
KOALA: Knowledge of Optimization and Learning Algorithms for Healthcare
AAAI 2026
When Eyes and Ears Disagree: Can MLLMs Discern Audio-Visual Confusion?
AAAI 2026
LENS: Learning to Segment Anything with Unified Reinforced Reasoning
AAAI 2026
RESTL: Reinforcement Learning Guided by Multi-Aspect Rewards for Signal Temporal Logic Transformation
AAAI 2026
<
1
2
3
4
5
…
118
>