Research Explorer
Papers Conferences Authors Topics Keywords Trends Achievements Explore
← Back to papers
2025 ICML ICML 2025

MIB: A Mechanistic Interpretability Benchmark

👥 Mega-Team — 23 authors

Authors

Aaron Mueller , Atticus Geiger , Sarah Wiegreffe , Dana Arad , Iván Arcuschin , Adam Belfki , Yik Siu Chan , Jaden Fried Fiotto-Kaufman , Tal Haklay , Michael Hanna , Jing Huang , Rohan Gupta , Yaniv Nikankin , Hadas Orgad , Nikhil Prakash , Anja Reusch , Aruna Sankaranarayanan , Shun Shao , Alessandro Stolfo , Martin Tutek , Amir Zur , David Bau , Yonatan Belinkov
Download PDF

Related papers

Scaling Sparse Feature Circuits For Studying In-Context Learning 2025
Incremental Gradient Descent with Small Epoch Counts is Surprisingly Slow on Ill-Conditioned Problems 2025
SToFM: a Multi-scale Foundation Model for Spatial Transcriptomics 2025
Batch List-Decodable Linear Regression via Higher Moments 2025
GS-Bias: Global-Spatial Bias Learner for Single-Image Test-Time Adaptation of Vision-Language Models 2025