Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Applications
Reinforcement Learning
›
Applications
›
Value Iteration
306 directly classified papers
Papers per year
2002: 3
2005: 3
2007: 1
2008: 1
2009: 2
2010: 1
2011: 1
2012: 5
2013: 4
2014: 3
2015: 7
2016: 10
2017: 9
2018: 20
2019: 33
2020: 47
2021: 39
2022: 37
2023: 42
2024: 23
2025: 13
2026: 2
Papers
Learning Generalizable and Composable Abstractions for Transfer in Reinforcement Learning
AAAI 2024
Transition Constrained Bayesian Optimization via Markov Decision Processes
NIPS 2024
Gaussian Approximation and Multiplier Bootstrap for Polyak-Ruppert Averaged Linear Stochastic Approximation with Applications to TD Learning
NIPS 2024
Efficient Constraint Generation for Stochastic Shortest Path Problems
AAAI 2024
SCAFFLSA: Taming Heterogeneity in Federated Linear Stochastic Approximation and TD Learning
NIPS 2024
Increasing information for model predictive control with semi-Markov decision processes
L4DC 2024
Span-Based Optimal Sample Complexity for Weakly Communicating and General Average Reward MDPs
NIPS 2024
Learning to stabilize high-dimensional unknown systems using Lyapunov-guided exploration
L4DC 2024
Deep Reinforcement Learning for Early Diagnosis of Lung Cancer
AAAI 2024
Deterministic Uncertainty Propagation for Improved Model-Based Offline Reinforcement Learning
NIPS 2024
Adaptive Exploration for Data-Efficient General Value Function Evaluations
NIPS 2024
Knowledge-Infused Legal Wisdom: Navigating LLM Consultation through the Lens of Diagnostics and Positive-Unlabeled Reinforcement Learning
ACL 2024
Reinforcement Learning Under Latent Dynamics: Toward Statistical and Algorithmic Modularity
NIPS 2024
The Benefits of Model-Based Generalization in Reinforcement Learning
ICML 2023
Q-functionals for Value-Based Continuous Control
AAAI 2023
The Statistical Benefits of Quantile Temporal-Difference Learning for Value Estimation
ICML 2023
Provably Efficient Model-Free Algorithms for Non-stationary CMDPs
AISTATS 2023
Towards a better understanding of representation dynamics under TD-learning
ICML 2023
ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for Last-Iterate Convergence in Constrained MDPs
ICML 2023
Cooperative Multi-Agent Reinforcement Learning: Asynchronous Communication and Linear Function Approximation
ICML 2023
Optimistic Planning by Regularized Dynamic Programming
ICML 2023
Regret Bounds for Markov Decision Processes with Recursive Optimized Certainty Equivalents
ICML 2023
Planning and Learning with Adaptive Lookahead
AAAI 2023
Efficient Planning in Combinatorial Action Spaces with Applications to Cooperative Multi-Agent Reinforcement Learning
AISTATS 2023
SelfTune: Tuning Cluster Managers
NSDI 2023
<
1
2
3
4
5
…
13
>