Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Applications
Reinforcement Learning
›
Applications
›
Value Iteration
306 directly classified papers
Papers per year
2002: 3
2005: 3
2007: 1
2008: 1
2009: 2
2010: 1
2011: 1
2012: 5
2013: 4
2014: 3
2015: 7
2016: 10
2017: 9
2018: 20
2019: 33
2020: 47
2021: 39
2022: 37
2023: 42
2024: 23
2025: 13
2026: 2
Papers
Retrosynthetic Planning with Dual Value Networks
ICML 2023
Cooperative Multi-Agent Reinforcement Learning: Asynchronous Communication and Linear Function Approximation
ICML 2023
ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for Last-Iterate Convergence in Constrained MDPs
ICML 2023
Online POMDP Planning with Anytime Deterministic Guarantees
NIPS 2023
StockFormer: Learning Hybrid Trading Machines with Predictive Coding
IJCAI 2023
Reinforcement Learning Augmented Asymptotically Optimal Index Policy for Finite-Horizon Restless Bandits
AAAI 2022
Chaining Value Functions for Off-Policy Learning
AAAI 2022
Rethinking Value Function Learning for Generalization in Reinforcement Learning
NIPS 2022
How Private Is Your RL Policy? An Inverse RL Based Analysis Framework
AAAI 2022
Operator Splitting Value Iteration
NIPS 2022
Globally Optimal Hierarchical Reinforcement Learning for Linearly-Solvable Markov Decision Processes
AAAI 2022
Fast and Data Efficient Reinforcement Learning from Pixels via Non-parametric Value Approximation
AAAI 2022
Using Reinforcement Learning for Operating Educational Campuses Safely during a Pandemic (Student Abstract)
AAAI 2022
Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation
AAAI 2022
Approximate Value Equivalence
NIPS 2022
Spending Thinking Time Wisely: Accelerating MCTS with Virtual Expansions
NIPS 2022
Evaluating the perceived safety of urban city via maximum entropy deep inverse reinforcement learning
ACML 2022
Learn to Match with No Regret: Reinforcement Learning in Markov Matching Markets
NIPS 2022
Empirical Gateaux Derivatives for Causal Inference
NIPS 2022
Continuous Deep Q-Learning in Optimal Control Problems: Normalized Advantage Functions Analysis
NIPS 2022
End-to-End Trajectory Distribution Prediction Based on Occupancy Grid Maps
CVPR 2022
Towards Real-World Navigation With Deep Differentiable Planners
CVPR 2022
Reinforcement Learning Based Dynamic Model Combination for Time Series Forecasting
AAAI 2022
Doubly-Asynchronous Value Iteration: Making Value Iteration Asynchronous in Actions
NIPS 2022
DeepTOP: Deep Threshold-Optimal Policy for MDPs and RMABs
NIPS 2022
<
1
2
3
4
5
…
13
>