2024 ICML ICML 2024

Implicit Bias of Policy Gradient in Linear Quadratic Control: Extrapolation to Unseen Initial States