Papers
Imitation Learning by Coaching
NIPS 2012
Formalizing Assistive Teleoperation
RSS 2012
Committing Bandits
NIPS 2011
Transfer from Multiple MDPs
NIPS 2011
Speedy Q-Learning
NIPS 2011
The Fixed Points of Off-Policy TD
NIPS 2011