Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Methods
Reinforcement Learning
›
Methods
›
Offline RL
725 directly classified papers
Papers per year
2007: 2
2008: 1
2009: 1
2011: 1
2012: 2
2014: 3
2015: 2
2016: 6
2017: 4
2018: 8
2019: 29
2020: 60
2021: 105
2022: 129
2023: 187
2024: 126
2025: 37
2026: 22
Papers
Efficient Multi-Policy Evaluation for Reinforcement Learning
AAAI 2025
Offline Reinforcement Learning for LLM Multi-step Reasoning
ACL 2025
A Finite-State Controller Based Offline Solver for Deterministic POMDPs
IJCAI 2025
Dynamic Rank Adjustment in Diffusion Policies for Efficient and Flexible Training
RSS 2025
State Revisit and Re-explore: Bridging Sim-to-Real Gaps in Offline-and-Online Reinforcement Learning with An Imperfect Simulator
IJCAI 2025
Evaluation of Active Feature Acquisition Methods for Time-varying Feature Settings
JMLR 2025
Cooperative Policy Agreement: Learning Diverse Policy for Offline MARL
AAAI 2025
Direct Value Optimization: Improving Chain-of-Thought Reasoning in LLMs with Refined Values
EMNLP 2025
LongReward: Improving Long-context Large Language Models with AI Feedback
ACL 2025
Entropy Regularized Task Representation Learning for Offline Meta-Reinforcement Learning
AAAI 2025
Offline Multi-Agent Reinforcement Learning via In-Sample Sequential Policy Optimization
AAAI 2025
SUMO: Search-Based Uncertainty Estimation for Model-Based Offline Reinforcement Learning
AAAI 2025
Selective Uncertainty Propagation in Offline RL
AAAI 2025
Are Expressive Models Truly Necessary for Offline RL?
AAAI 2025
Dynamic Uncertainty Estimation for Offline Reinforcement Learning
AAAI 2025
Distribution-Free Uncertainty Quantification in Mechanical Ventilation Treatment: A Conformal Deep Q-Learning Framework
AAAI 2025
Offline Safe Reinforcement Learning Using Trajectory Classification
AAAI 2025
Stabilizing and Accelerating Autofocus with Expert Trajectory Regularized Deep Reinforcement Learning
CVPR 2025
Decision SpikeFormer: Spike-Driven Transformer for Decision Making
CVPR 2025
Imagination-Limited Q-Learning for Offline Reinforcement Learning
IJCAI 2025
Beyond the Known: Decision Making with Counterfactual Reasoning Decision Transformer
IJCAI 2025
MGDA: Model-based Goal Data Augmentation for Offline Goal-conditioned Weighted Supervised Learning
AAAI 2025
In-Dataset Trajectory Return Regularization for Offline Preference-based Reinforcement Learning
AAAI 2025
Simulation-Free Hierarchical Latent Policy Planning for Proactive Dialogues
AAAI 2025
Relational Neurosymbolic Markov Models
AAAI 2025
<
1
2
3
4
5
…
29
>