Reinforcement Learning
2932 directly classified papers
Papers per year
Papers
Monotone multi-armed bandit allocations
COLT 2011
Transfer from Multiple MDPs
NIPS 2011
The Fixed Points of Off-Policy TD
NIPS 2011
Contextual Bandits with Linear Payoff Functions
AISTATS 2011
LSTD with Random Projections
NIPS 2010
Model-Free Monte Carlo-like Policy Evaluation
AISTATS 2010