Adrià Garriga-Alonso
10 papers
· 2019–2025
· 3 conferences
· across top CS/AI conferences
Achievements
🌉
Interdisciplinary Bridge
🧭
Keyword Pioneer
🌈
Renaissance Researcher
(5)
🗺️
Taxonomy Completionist
(21)
🐣
Hot Topic Early Bird
🌍
Conference Polyglot
(3)
🏃
Academic Marathon
(6)
🐝
Cross-Pollinator
(11)
🏆
Keyword Champion
(2)
💎
Century Club
(10)
🔥
Unstoppable
(5)
Conferences
NIPS (5)
ICLR (3)
UAI (2)
Top co-authors
Keywords
mechanistic interpretability
(3)
circuit discovery
(2)
neural network analysis
(2)
language model
(2)
policy optimization
(1)
data augmentation
(1)
kl divergence
(1)
model behavior
(1)
model interpretability
(1)
reinforcement learning from human feedback
(1)
hypothesis testing
(1)
convolutional neural network
(1)
heavy-tailed distribution
(1)
reward misspecification
(1)
reward hacking
(1)
circuit analysis
(1)
neural network verification
(1)
bayesian neural network
(1)
steering vector
(1)
causal model
(1)
Papers
Bayesian Neural Network Priors Revisited
ICLR 2022