Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Methods
Reinforcement Learning
›
Methods
›
Policy Learning
2068 directly classified papers
Papers per year
2002: 6
2003: 1
2004: 1
2006: 11
2007: 10
2008: 14
2009: 9
2010: 23
2011: 15
2012: 25
2013: 25
2014: 24
2015: 23
2016: 27
2017: 61
2018: 107
2019: 187
2020: 216
2021: 274
2022: 259
2023: 321
2024: 247
2025: 153
2026: 29
Papers
Learning of Non-Parametric Control Policies with High-Dimensional State Features
AISTATS 2015
Abstraction Selection in Model-based Reinforcement Learning
ICML 2015
Shared Autonomy via Hindsight Optimization
RSS 2015
Robust Trajectory Optimization: A Cooperative Stochastic Game Theoretic Approach
RSS 2015
Cover Tree Bayesian Reinforcement Learning
JMLR 2014
Probabilistic Differential Dynamic Programming
NIPS 2014
Universal Option Models
NIPS 2014
Probably Approximately Correct MDP Learning and Control With Temporal Logic Constraints
RSS 2014
Pre- and Post-Contact Policy Decomposition for Planar Contact Manipulation Under Uncertainty
RSS 2014
Learning Neural Network Policies with Guided Policy Search under Unknown Dynamics
NIPS 2014
Optimizing Energy Production Using Policy Search and Predictive State Representations
NIPS 2014
Bayes-Adaptive Simulation-based Search with Value Function Approximation
NIPS 2014
Near-optimal Reinforcement Learning in Factored MDPs
NIPS 2014
Dynamic Programming Boosting for Discriminative Macro-Action Discovery
ICML 2014
Approximate Policy Iteration Schemes: A Comparison
ICML 2014
Online Multi-Task Learning for Policy Gradient Methods
ICML 2014
A new Q(lambda) with interim forward view and Monte Carlo equivalence
ICML 2014
PAC-inspired Option Discovery in Lifelong Reinforcement Learning
ICML 2014
Using Trajectory Data to Improve Bayesian Optimization for Reinforcement Learning
JMLR 2014
Policy Evaluation with Temporal Differences: A Survey and Comparison
JMLR 2014
Active Contextual Policy Search
JMLR 2014
Multi-Objective Reinforcement Learning using Sets of Pareto Dominating Policies
JMLR 2014
Tracking Adversarial Targets
ICML 2014
Learning Complex Neural Network Policies with Trajectory Optimization
ICML 2014
Sparse Reinforcement Learning via Convex Optimization
ICML 2014
<
1
…
76
77
78
…
83
>