Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Methods
Reinforcement Learning
›
Methods
›
Offline RL
725 directly classified papers
Papers per year
2007: 2
2008: 1
2009: 1
2011: 1
2012: 2
2014: 3
2015: 2
2016: 6
2017: 4
2018: 8
2019: 29
2020: 60
2021: 105
2022: 129
2023: 187
2024: 126
2025: 37
2026: 22
Papers
Safe Offline Reinforcement Learning with Real-Time Budget Constraints
ICML 2023
Offline Reinforcement Learning with Closed-Form Policy Improvement Operators
ICML 2023
Offline Model-Based Reinforcement Learning for Tokamak Control
L4DC 2023
Hierarchical Diffusion for Offline Decision Making
ICML 2023
Parallel $Q$-Learning: Scaling Off-policy Reinforcement Learning under Massively Parallel Simulation
ICML 2023
trlX: A Framework for Large Scale Reinforcement Learning from Human Feedback
EMNLP 2023
MAHALO: Unifying Offline Reinforcement Learning and Imitation Learning from Observations
ICML 2023
Learning in POMDPs is Sample-Efficient with Hindsight Observability
ICML 2023
ChiPFormer: Transferable Chip Placement via Offline Decision Transformer
ICML 2023
Model-based Offline Reinforcement Learning with Count-based Conservatism
ICML 2023
Beyond Reward: Offline Preference-guided Policy Optimization
ICML 2023
Principled Offline RL in the Presence of Rich Exogenous Information
ICML 2023
Future-Dependent Value-Based Off-Policy Evaluation in POMDPs
NIPS 2023
Multi-Task Off-Policy Learning from Bandit Feedback
ICML 2023
Distance Weighted Supervised Learning for Offline Interaction Data
ICML 2023
Building Persona Consistent Dialogue Agents with Offline Reinforcement Learning
EMNLP 2023
Learning Temporally AbstractWorld Models without Online Experimentation
ICML 2023
A Connection between One-Step RL and Critic Regularization in Reinforcement Learning
ICML 2023
Counterfactual-Augmented Importance Sampling for Semi-Offline Policy Evaluation
NIPS 2023
Bayesian Reparameterization of Reward-Conditioned Reinforcement Learning with Energy-based Models
ICML 2023
Trajectory-Aware Eligibility Traces for Off-Policy Reinforcement Learning
ICML 2023
Semi-Offline Reinforcement Learning for Optimized Text Generation
ICML 2023
Scalable Safe Policy Improvement via Monte Carlo Tree Search
ICML 2023
Semi-Supervised Off-Policy Reinforcement Learning and Value Estimation for Dynamic Treatment Regimes
JMLR 2023
Revisiting the Minimalist Approach to Offline Reinforcement Learning
NIPS 2023
<
1
…
10
11
12
…
29
>