Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Learning Types
Machine Learning
›
Learning Types
›
Offline RL
28 directly classified papers
Papers per year
2010: 1
2018: 1
2020: 2
2021: 7
2022: 5
2023: 4
2024: 7
2026: 1
Papers
Trajectory Tactics: When Transformers Learn Exploration to Generate Online Signature
WACV 2026
Probabilistic Offline Policy Ranking with Approximate Bayesian Computation
AAAI 2024
Offline Model-Based Optimization via Policy-Guided Gradient Search
AAAI 2024
MADiff: Offline Multi-agent Learning with Diffusion Models
NIPS 2024
Meta-DT: Offline Meta-RL as Conditional Sequence Modeling with World Model Disentanglement
NIPS 2024
Q-Distribution guided Q-learning for offline reinforcement learning: Uncertainty penalized Q-value via consistency model
NIPS 2024
Scaling Offline Evaluation of Reinforcement Learning Agents through Abstraction
AAAI 2024
Multi-Agent Domain Calibration with a Handful of Offline Data
NIPS 2024
An Instrumental Variable Approach to Confounded Off-Policy Evaluation
ICML 2023
Model-based Offline Reinforcement Learning with Count-based Conservatism
ICML 2023
Counterfactual Learning with General Data-Generating Policies
AAAI 2023
Safe Offline Reinforcement Learning with Real-Time Budget Constraints
ICML 2023
Conformal Off-Policy Prediction in Contextual Bandits
NIPS 2022
The Curse of Passive Data Collection in Batch Reinforcement Learning
AISTATS 2022
Doubly Robust Distributionally Robust Off-Policy Evaluation and Learning
ICML 2022
[CASPI] Causal-aware Safe Policy Improvement for Task-oriented Dialogue
ACL 2022
Off-Policy Evaluation with Deficient Support Using Side Information
NIPS 2022
Boosting Offline Reinforcement Learning with Residual Generative Modeling
IJCAI 2021
Deeply-Debiased Off-Policy Interval Estimation
ICML 2021
A Deep Reinforcement Learning Approach to Marginalized Importance Sampling with the Successor Representation
ICML 2021
Model-Free and Model-Based Policy Evaluation when Causality is Uncertain
ICML 2021
Offline Contextual Bandits with Overparameterized Models
ICML 2021
Actionable Models: Unsupervised Offline Reinforcement Learning of Robotic Skills
ICML 2021
Near-Optimal Provable Uniform Convergence in Offline Policy Evaluation for Reinforcement Learning
AISTATS 2021
Distributionally Robust Policy Evaluation and Learning in Offline Contextual Bandits
ICML 2020
<
1
2
>