Papers
Combinatorial Bandits Revisited
NIPS 2015
Bias in Natural Actor-Critic Algorithms
ICML 2014
Deterministic Policy Gradient Algorithms
ICML 2014