Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Methods
Reinforcement Learning
›
Methods
›
Offline RL
725 directly classified papers
Papers per year
2007: 2
2008: 1
2009: 1
2011: 1
2012: 2
2014: 3
2015: 2
2016: 6
2017: 4
2018: 8
2019: 29
2020: 60
2021: 105
2022: 129
2023: 187
2024: 126
2025: 37
2026: 22
Papers
Controlling Overestimation Bias with Truncated Mixture of Continuous Distributional Quantile Critics
ICML 2020
Double Reinforcement Learning for Efficient and Robust Off-Policy Evaluation
ICML 2020
Interpretable Off-Policy Evaluation in Reinforcement Learning by Highlighting Influential Transitions
ICML 2020
Representations for Stable Off-Policy Reinforcement Learning
ICML 2020
Accountable Off-Policy Evaluation With Kernel Bellman Statistics
ICML 2020
Revisiting Fundamentals of Experience Replay
ICML 2020
Minimax-Optimal Off-Policy Evaluation with Linear Function Approximation
ICML 2020
An Optimistic Perspective on Offline Reinforcement Learning
ICML 2020
Learning Dexterous Manipulation from Suboptimal Experts
CORL 2020
Dueling Posterior Sampling for Preference-Based Reinforcement Learning
UAI 2020
Off-Policy Evaluation via the Regularized Lagrangian
NIPS 2020
Variance-Reduced Off-Policy TDC Learning: Non-Asymptotic Convergence Analysis
NIPS 2020
Confounding-Robust Policy Evaluation in Infinite-Horizon Reinforcement Learning
NIPS 2020
Minimax Value Interval for Off-Policy Evaluation and Policy Optimization
NIPS 2020
Provably Good Batch Off-Policy Reinforcement Learning Without Great Exploration
NIPS 2020
Conservative Q-Learning for Offline Reinforcement Learning
NIPS 2020
Remember and Forget for Experience Replay
ICML 2019
Divergence-Augmented Policy Optimization
NIPS 2019
Off-Policy Evaluation via Off-Policy Classification
NIPS 2019
Importance Resampling for Off-policy Prediction
NIPS 2019
Two Time-scale Off-Policy TD Learning: Non-asymptotic Analysis over Markovian Samples
NIPS 2019
Towards Optimal Off-Policy Evaluation for Reinforcement Learning with Marginalized Importance Sampling
NIPS 2019
Generalized Off-Policy Actor-Critic
NIPS 2019
Semi-Parametric Efficient Policy Learning with Continuous Actions
NIPS 2019
DualDICE: Behavior-Agnostic Estimation of Discounted Stationary Distribution Corrections
NIPS 2019
<
1
…
25
26
27
28
29
>