Papers
Deep Exploration via Bootstrapped DQN
NIPS 2016
Trust Region Policy Optimization
ICML 2015
Universal Option Models
NIPS 2014
Reinforcement learning with value advice
ACML 2014
Predicting Dynamic Difficulty
NIPS 2011