Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Methods
Reinforcement Learning
›
Methods
›
Offline RL
725 directly classified papers
Papers per year
2007: 2
2008: 1
2009: 1
2011: 1
2012: 2
2014: 3
2015: 2
2016: 6
2017: 4
2018: 8
2019: 29
2020: 60
2021: 105
2022: 129
2023: 187
2024: 126
2025: 37
2026: 22
Papers
Building Persona Consistent Dialogue Agents with Offline Reinforcement Learning
EMNLP 2023
An effective negotiating agent framework based on deep offline reinforcement learning
UAI 2023
Diffused Task-Agnostic Milestone Planner
NIPS 2023
Constrained Policy Optimization with Explicit Behavior Density For Offline Reinforcement Learning
NIPS 2023
Off-Policy Evaluation for Large Action Spaces via Conjunct Effect Modeling
ICML 2023
Policy Regularization with Dataset Constraint for Offline Reinforcement Learning
ICML 2023
Fighting Uncertainty with Gradients: Offline Reinforcement Learning via Diffusion Score Matching
CORL 2023
MetaDiffuser: Diffusion Model as Conditional Planner for Offline Meta-RL
ICML 2023
Supported Trust Region Optimization for Offline Reinforcement Learning
ICML 2023
MOTO: Offline Pre-training to Online Fine-tuning for Model-based Robot Learning
CORL 2023
Contrastive Energy Prediction for Exact Energy-Guided Diffusion Sampling in Offline Reinforcement Learning
ICML 2023
Constrained Decision Transformer for Offline Safe Reinforcement Learning
ICML 2023
Safe Offline Reinforcement Learning with Real-Time Budget Constraints
ICML 2023
Offline Reinforcement Learning with Closed-Form Policy Improvement Operators
ICML 2023
Off-Policy Actor-Critic with Emphatic Weightings
JMLR 2023
Learning from Good Trajectories in Offline Multi-Agent Reinforcement Learning
AAAI 2023
Hierarchical Diffusion for Offline Decision Making
ICML 2023
Parallel $Q$-Learning: Scaling Off-policy Reinforcement Learning under Massively Parallel Simulation
ICML 2023
A Complete Characterization of Linear Estimators for Offline Policy Evaluation
JMLR 2023
MAHALO: Unifying Offline Reinforcement Learning and Imitation Learning from Observations
ICML 2023
Learning in POMDPs is Sample-Efficient with Hindsight Observability
ICML 2023
Contrastive Value Learning: Implicit Models for Simple Offline RL
CORL 2023
ChiPFormer: Transferable Chip Placement via Offline Decision Transformer
ICML 2023
Model-based Offline Reinforcement Learning with Count-based Conservatism
ICML 2023
Beyond Reward: Offline Preference-guided Policy Optimization
ICML 2023
<
1
…
9
10
11
…
29
>