2025 ICML ICML 2025

Towards Theoretical Understanding of Sequential Decision Making with Preference Feedback