2025 AISTATS AISTATS 2025

Reinforcement Learning with Intrinsically Motivated Feedback Graph for Lost-sales Inventory Control