2025 COLT COLT 2025

Sample and Oracle Efficient Reinforcement Learning for MDPs with Linearly-Realizable Value Functions