2025 ICML ICML 2025

Active Reward Modeling: Adaptive Preference Labeling for Large Language Model Alignment