Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Methods
Reinforcement Learning
›
Methods
›
Offline RL
725 directly classified papers
Papers per year
2007: 2
2008: 1
2009: 1
2011: 1
2012: 2
2014: 3
2015: 2
2016: 6
2017: 4
2018: 8
2019: 29
2020: 60
2021: 105
2022: 129
2023: 187
2024: 126
2025: 37
2026: 22
Papers
Koopman Q-learning: Offline Reinforcement Learning via Symmetries of Dynamics
ICML 2022
Offline Reinforcement Learning from Human Feedback in Real-World Sequence-to-Sequence Tasks
IJCNLP 2021
Confident Off-Policy Evaluation and Selection through Self-Normalized Importance Weighting
AISTATS 2021
Boosting Offline Reinforcement Learning with Residual Generative Modeling
IJCAI 2021
Learn to Intervene: An Adaptive Learning Policy for Restless Bandits in Application to Preventive Healthcare
IJCAI 2021
Augmented World Models Facilitate Zero-Shot Dynamics Generalization From a Single Offline Environment
ICML 2021
Model-Free and Model-Based Policy Evaluation when Causality is Uncertain
ICML 2021
Learning Routines for Effective Off-Policy Reinforcement Learning
ICML 2021
Solving Challenging Dexterous Manipulation Tasks With Trajectory Optimisation and Reinforcement Learning
ICML 2021
Actionable Models: Unsupervised Offline Reinforcement Learning of Robotic Skills
ICML 2021
Offline Reinforcement Learning with Pseudometric Learning
ICML 2021
Risk Bounds and Rademacher Complexity in Batch Reinforcement Learning
ICML 2021
A Deep Reinforcement Learning Approach to Marginalized Importance Sampling with the Successor Representation
ICML 2021
EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL
ICML 2021
Sparse Feature Selection Makes Batch Reinforcement Learning More Sample Efficient
ICML 2021
Bootstrapping Fitted Q-Evaluation for Off-Policy Inference
ICML 2021
A Lower Bound for the Sample Complexity of Inverse Reinforcement Learning
ICML 2021
Offline Reinforcement Learning with Fisher Divergence Critic Regularization
ICML 2021
Improved Regret Bound and Experience Replay in Regularized Policy Iteration
ICML 2021
OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation
ICML 2021
PEBBLE: Feedback-Efficient Interactive Reinforcement Learning via Relabeling Experience and Unsupervised Pre-training
ICML 2021
Is Pessimism Provably Efficient for Offline RL?
ICML 2021
Offline Meta-Reinforcement Learning with Advantage Weighting
ICML 2021
On Reward-Free RL with Kernel and Neural Function Approximations: Single-Agent MDP and Markov Game
ICML 2021
State Relevance for Off-Policy Evaluation
ICML 2021
<
1
…
20
21
22
…
29
>