Papers
Value Iteration Networks
NIPS 2016
Deep Exploration via Bootstrapped DQN
NIPS 2016
Universal Value Function Approximators
ICML 2015