2025 ICML ICML 2025

Penalizing Infeasible Actions and Reward Scaling in Reinforcement Learning with Offline Data