Neel Nanda
22 papers
· 2022–2025
· 5 conferences
· across top CS/AI conferences
Achievements
π£
Hot Topic Early Bird
π§
Keyword Pioneer
π
Conference Polyglot
(5)
π
Cross-Pollinator
(5)
π
Renaissance Researcher
(5)
πΊοΈ
Taxonomy Completionist
(19)
π
Interdisciplinary Bridge
π
Triple Crown
π
Keyword Champion
(2)
β‘
Prolific Year
(8)
β
The Questioner
(3)
π
Century Club
(22)
Conferences
ICML (7)
ICLR (6)
EMNLP (4)
NIPS (4)
JMLR (1)
Top co-authors
Keywords
mechanistic interpretability
(3)
sparse autoencoder
(3)
linear representation
(2)
imitation learning
(1)
sentiment analysis
(1)
model calibration
(1)
bayesian inference
(1)
policy learning
(1)
neural network interpretability
(1)
model analysis
(1)
group theory
(1)
model interpretability
(1)
latent representation
(1)
language model
(1)
world model
(1)
circuit analysis
(1)
feature decomposition
(1)
self-supervised learning
(1)
feature extraction
(1)
online learning
(1)
Papers
Language Models Linearly Represent Sentiment
EMNLP 2024
Fully General Online Imitation Learning
JMLR 2022