2025 L4DC L4DC 2025

Approximate Thompson Sampling for Learning Linear Quadratic Regulators with $O(\sqrtT)$ Regret