2025 ICML ICML 2025

R*: Efficient Reward Design via Reward Structure Evolution and Parameter Alignment Optimization with Large Language Models