2024 ICML ICML 2024

Provably Efficient Long-Horizon Exploration in Monte Carlo Tree Search through State Occupancy Regularization