← Applications

Reinforcement Learning › Applications ›

Value Iteration

306 directly classified papers

Papers per year

Papers

Learning Generalizable and Composable Abstractions for Transfer in Reinforcement Learning AAAI 2024

Transition Constrained Bayesian Optimization via Markov Decision Processes NIPS 2024

Gaussian Approximation and Multiplier Bootstrap for Polyak-Ruppert Averaged Linear Stochastic Approximation with Applications to TD Learning NIPS 2024

Efficient Constraint Generation for Stochastic Shortest Path Problems AAAI 2024

SCAFFLSA: Taming Heterogeneity in Federated Linear Stochastic Approximation and TD Learning NIPS 2024

Increasing information for model predictive control with semi-Markov decision processes L4DC 2024

Span-Based Optimal Sample Complexity for Weakly Communicating and General Average Reward MDPs NIPS 2024

Learning to stabilize high-dimensional unknown systems using Lyapunov-guided exploration L4DC 2024

Deep Reinforcement Learning for Early Diagnosis of Lung Cancer AAAI 2024

Deterministic Uncertainty Propagation for Improved Model-Based Offline Reinforcement Learning NIPS 2024

Adaptive Exploration for Data-Efficient General Value Function Evaluations NIPS 2024

Knowledge-Infused Legal Wisdom: Navigating LLM Consultation through the Lens of Diagnostics and Positive-Unlabeled Reinforcement Learning ACL 2024

Reinforcement Learning Under Latent Dynamics: Toward Statistical and Algorithmic Modularity NIPS 2024

The Benefits of Model-Based Generalization in Reinforcement Learning ICML 2023

Q-functionals for Value-Based Continuous Control AAAI 2023

The Statistical Benefits of Quantile Temporal-Difference Learning for Value Estimation ICML 2023

Provably Efficient Model-Free Algorithms for Non-stationary CMDPs AISTATS 2023

Towards a better understanding of representation dynamics under TD-learning ICML 2023

ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for Last-Iterate Convergence in Constrained MDPs ICML 2023

Cooperative Multi-Agent Reinforcement Learning: Asynchronous Communication and Linear Function Approximation ICML 2023

Optimistic Planning by Regularized Dynamic Programming ICML 2023

Regret Bounds for Markov Decision Processes with Recursive Optimized Certainty Equivalents ICML 2023

Planning and Learning with Adaptive Lookahead AAAI 2023

Efficient Planning in Combinatorial Action Spaces with Applications to Cooperative Multi-Agent Reinforcement Learning AISTATS 2023

SelfTune: Tuning Cluster Managers NSDI 2023