Papers
Double Q-learning
NIPS 2010
Active Sequential Learning with Tactile Feedback
AISTATS 2010
Reward Design via Online Gradient Ascent
NIPS 2010
Bootstrapping Apprenticeship Learning
NIPS 2010