Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Methods
Reinforcement Learning
›
Methods
›
Offline RL
725 directly classified papers
Papers per year
2007: 2
2008: 1
2009: 1
2011: 1
2012: 2
2014: 3
2015: 2
2016: 6
2017: 4
2018: 8
2019: 29
2020: 60
2021: 105
2022: 129
2023: 187
2024: 126
2025: 37
2026: 22
Papers
Improved Algorithm for Adversarial Linear Mixture MDPs with Bandit Feedback and Unknown Transition
AISTATS 2024
A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning
AAAI 2024
Reward-Relevance-Filtered Linear Offline Reinforcement Learning
AISTATS 2024
Generalizable Task Representation Learning for Offline Meta-Reinforcement Learning with Data Limitations
AAAI 2024
Deterministic Uncertainty Propagation for Improved Model-Based Offline Reinforcement Learning
NIPS 2024
Distributionally Robust Off-Dynamics Reinforcement Learning: Provable Efficiency with Linear Function Approximation
AISTATS 2024
A2PO: Towards Effective Offline Reinforcement Learning from an Advantage-aware Perspective
NIPS 2024
Exploiting Action Impact Regularity and Exogenous State Variables for Offline Reinforcement Learning (Abstract Reprint)
AAAI 2024
Offline Policy Learning via Skill-step Abstraction for Long-horizon Goal-Conditioned Tasks
IJCAI 2024
Offline Model-Based Optimization via Policy-Guided Gradient Search
AAAI 2024
Semi-Supervised Off-Policy Reinforcement Learning and Value Estimation for Dynamic Treatment Regimes
JMLR 2023
Guide to Control: Offline Hierarchical Reinforcement Learning Using Subgoal Generation for Long-Horizon and Sparse-Reward Tasks
IJCAI 2023
Off-Policy Actor-Critic with Emphatic Weightings
JMLR 2023
A Complete Characterization of Linear Estimators for Offline Policy Evaluation
JMLR 2023
Reinforcement Learning in Low-rank MDPs with Density Features
ICML 2023
More for Less: Safe Policy Improvement with Stronger Performance Guarantees
IJCAI 2023
Coordinate Ascent for Off-Policy RL with Global Convergence Guarantees
AISTATS 2023
Distributionally Robust Policy Gradient for Offline Contextual Bandits
AISTATS 2023
Neural Laplace Control for Continuous-time Delayed Systems
AISTATS 2023
Continuous-Time Decision Transformer for Healthcare Applications
AISTATS 2023
Revisiting the Linear-Programming Framework for Offline RL with General Function Approximation
ICML 2023
Learning from Visual Observation via Offline Pretrained State-to-Go Transformer
NIPS 2023
Exponential Smoothing for Off-Policy Learning
ICML 2023
Asymptotically Unbiased Off-Policy Policy Evaluation when Reusing Old Data in Nonstationary Environments
AISTATS 2023
Distributed Offline Policy Optimization Over Batch Data
AISTATS 2023
<
1
…
7
8
9
…
29
>