2024 ICML ICML 2024

Position: An Inner Interpretability Framework for AI Inspired by Lessons from Cognitive Neuroscience