2025
ICML
ICML 2025
MIB: A Mechanistic Interpretability Benchmark
👥
Mega-Team
— 23 authors
Authors
Aaron Mueller
,
Atticus Geiger
,
Sarah Wiegreffe
,
Dana Arad
,
Iván Arcuschin
,
Adam Belfki
,
Yik Siu Chan
,
Jaden Fried Fiotto-Kaufman
,
Tal Haklay
,
Michael Hanna
,
Jing Huang
,
Rohan Gupta
,
Yaniv Nikankin
,
Hadas Orgad
,
Nikhil Prakash
,
Anja Reusch
,
Aruna Sankaranarayanan
,
Shun Shao
,
Alessandro Stolfo
,
Martin Tutek
,
Amir Zur
,
David Bau
,
Yonatan Belinkov