Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Methods
Reinforcement Learning
›
Methods
›
Offline RL
725 directly classified papers
Papers per year
2007: 2
2008: 1
2009: 1
2011: 1
2012: 2
2014: 3
2015: 2
2016: 6
2017: 4
2018: 8
2019: 29
2020: 60
2021: 105
2022: 129
2023: 187
2024: 126
2025: 37
2026: 22
Papers
Provable Benefits of Actor-Critic Methods for Offline Reinforcement Learning
NIPS 2021
Decision Transformer: Reinforcement Learning via Sequence Modeling
NIPS 2021
Nearly Horizon-Free Offline Reinforcement Learning
NIPS 2021
Regret Minimization Experience Replay in Off-Policy Reinforcement Learning
NIPS 2021
Weighted model estimation for offline model-based reinforcement learning
NIPS 2021
Pessimism Meets Invariance: Provably Efficient Offline Mean-Field Multi-Agent RL
NIPS 2021
PerSim: Data-Efficient Offline Reinforcement Learning with Heterogeneous Agents via Personalized Simulators
NIPS 2021
SOPE: Spectrum of Off-Policy Estimators
NIPS 2021
Conservative Offline Distributional Reinforcement Learning
NIPS 2021
A Minimalist Approach to Offline Reinforcement Learning
NIPS 2021
BCORLE($\lambda$): An Offline Reinforcement Learning and Evaluation Framework for Coupons Allocation in E-commerce Market
NIPS 2021
Provably Efficient Causal Reinforcement Learning with Confounded Observational Data
NIPS 2021
The Difficulty of Passive Learning in Deep Reinforcement Learning
NIPS 2021
Active Offline Policy Selection
NIPS 2021
Offline Constrained Multi-Objective Reinforcement Learning via Pessimistic Dual Value Iteration
NIPS 2021
Policy Finetuning: Bridging Sample-Efficient Offline and Online Reinforcement Learning
NIPS 2021
Online and Offline Reinforcement Learning by Planning with a Learned Model
NIPS 2021
Asymptotically Exact Error Characterization of Offline Policy Evaluation with Misspecified Linear Models
NIPS 2021
COMBO: Conservative Offline Model-Based Policy Optimization
NIPS 2021
Offline Reinforcement Learning with Reverse Model-based Imagination
NIPS 2021
Provable Representation Learning for Imitation with Contrastive Fourier Features
NIPS 2021
Mitigating Covariate Shift in Imitation Learning via Offline Data With Partial Coverage
NIPS 2021
Offline Reinforcement Learning as One Big Sequence Modeling Problem
NIPS 2021
Reinforcement Learning with State Observation Costs in Action-Contingent Noiselessly Observable Markov Decision Processes
NIPS 2021
Off-Policy Risk Assessment in Contextual Bandits
NIPS 2021
<
1
…
23
24
25
…
29
>