Papers
MDPs with Non-Deterministic Policies
NIPS 2008
Regularized Policy Iteration
NIPS 2008
Learning Operational Space Control
RSS 2006