2025 ICML ICML 2025

Position: Medical Large Language Model Benchmarks Should Prioritize Construct Validity