2025 L4DC L4DC 2025

WAVE: Wasserstein Adaptive Value Estimation for Actor-Critic Reinforcement Learning