Research Explorer
Papers Conferences Authors Topics Keywords Trends Achievements Explore
← Back to papers
2025 ICML ICML 2025

Position: In-House Evaluation Is Not Enough. Towards Robust Third-Party Evaluation and Flaw Disclosure for General-Purpose AI

👥 Mega-Team — 34 authors

Authors

Shayne Longpre , Kevin Klyman , Ruth Elisabeth Appel , Sayash Kapoor , Rishi Bommasani , Michelle Sahar , Sean McGregor , Avijit Ghosh , Borhane Blili-Hamelin , Nathan Butters , Alondra Nelson , Dr. Amit Elazari , Andrew Sellars , Casey John Ellis , Dane Sherrets , Dawn Song , Harley Geiger , Ilona Cohen , Lauren Mcilvenny , Madhulika Srikumar , Mark M. Jaycox , Markus Anderljung , Nadine Farid Johnson , Nicholas Carlini , Nicolas Miailhe , Nik Marda , Peter Henderson , Rebecca S. Portnoff , Rebecca Weiss , Victoria Westerhoff , Yacine Jernite , Rumman Chowdhury , Percy Liang , Arvind Narayanan
Download PDF

Related papers

Scaling Sparse Feature Circuits For Studying In-Context Learning 2025
Incremental Gradient Descent with Small Epoch Counts is Surprisingly Slow on Ill-Conditioned Problems 2025
SToFM: a Multi-scale Foundation Model for Spatial Transcriptomics 2025
Batch List-Decodable Linear Regression via Higher Moments 2025
GS-Bias: Global-Spatial Bias Learner for Single-Image Test-Time Adaptation of Vision-Language Models 2025