2024 ICML ICML 2024

More Benefits of Being Distributional: Second-Order Bounds for Reinforcement Learning