Evan Hubinger
2 papers
· 2023–2024
· 1 conference
· across top CS/AI conferences
Achievements
π
Interdisciplinary Bridge
π§
Keyword Pioneer
π
Cross-Pollinator
(6)
πΊοΈ
Taxonomy Completionist
(12)
π₯
Mega-Team
(63)
Conferences
ACL (2)
Top co-authors
Keywords
contrastive learning
(1)
knowledge editing
(1)
factual knowledge
(1)
language model evaluation
(1)
model behavior
(1)
reinforcement learning from human feedback
(1)
language model
(1)
steering vector
(1)
residual stream
(1)
inverse scaling
(1)
model-written evaluation
(1)
behavior discovery
(1)
activation steering
(1)
large language model
(1)