2025 ICML ICML 2025

Position: AI Competitions Provide the Gold Standard for Empirical Rigor in GenAI Evaluation