2025
ICML
ICML 2025
RE-Bench: Evaluating Frontier AI R&D Capabilities of Language Model Agents against Human Experts
👥
Mega-Team
— 22 authors
Authors
Hjalmar Wijk
,
Tao Roa Lin
,
Joel Becker
,
Sami Jawhar
,
Neev Parikh
,
Thomas Broadley
,
Lawrence Chan
,
Michael Chen
,
Joshua M Clymer
,
Jai Dhyani
,
Elena Ericheva
,
Katharyn Garcia
,
Brian Goodrich
,
Nikola Jurkovic
,
Megan Kinniment
,
Aron Lajko
,
Seraphina Nix
,
Lucas Jun Koba Sato
,
William Saunders
,
Maksym Taran
,
Ben West
,
Elizabeth Barnes