Reinforcement Learning › Methods ›

Offline RL

725 directly classified papers

Papers per year

Papers

Koopman Q-learning: Offline Reinforcement Learning via Symmetries of Dynamics ICML 2022

Offline Reinforcement Learning from Human Feedback in Real-World Sequence-to-Sequence Tasks IJCNLP 2021

Confident Off-Policy Evaluation and Selection through Self-Normalized Importance Weighting AISTATS 2021

Boosting Offline Reinforcement Learning with Residual Generative Modeling IJCAI 2021

Learn to Intervene: An Adaptive Learning Policy for Restless Bandits in Application to Preventive Healthcare IJCAI 2021

Augmented World Models Facilitate Zero-Shot Dynamics Generalization From a Single Offline Environment ICML 2021

Model-Free and Model-Based Policy Evaluation when Causality is Uncertain ICML 2021

Learning Routines for Effective Off-Policy Reinforcement Learning ICML 2021

Solving Challenging Dexterous Manipulation Tasks With Trajectory Optimisation and Reinforcement Learning ICML 2021

Actionable Models: Unsupervised Offline Reinforcement Learning of Robotic Skills ICML 2021

Offline Reinforcement Learning with Pseudometric Learning ICML 2021

Risk Bounds and Rademacher Complexity in Batch Reinforcement Learning ICML 2021

A Deep Reinforcement Learning Approach to Marginalized Importance Sampling with the Successor Representation ICML 2021

EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL ICML 2021

Sparse Feature Selection Makes Batch Reinforcement Learning More Sample Efficient ICML 2021

Bootstrapping Fitted Q-Evaluation for Off-Policy Inference ICML 2021

A Lower Bound for the Sample Complexity of Inverse Reinforcement Learning ICML 2021

Offline Reinforcement Learning with Fisher Divergence Critic Regularization ICML 2021

Improved Regret Bound and Experience Replay in Regularized Policy Iteration ICML 2021

OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation ICML 2021

PEBBLE: Feedback-Efficient Interactive Reinforcement Learning via Relabeling Experience and Unsupervised Pre-training ICML 2021

Is Pessimism Provably Efficient for Offline RL? ICML 2021

Offline Meta-Reinforcement Learning with Advantage Weighting ICML 2021

On Reward-Free RL with Kernel and Neural Function Approximations: Single-Agent MDP and Markov Game ICML 2021

State Relevance for Off-Policy Evaluation ICML 2021