Jacob Hilton
7 papers
· 2020–2025
· 4 conferences
· across top CS/AI conferences
Achievements
π
Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(18)
π£
Hot Topic Early Bird
π§
Keyword Pioneer
π
Conference Polyglot
(4)
π
Academic Marathon
(5)
π
Cross-Pollinator
(7)
π
Renaissance Researcher
(5)
π₯
Mega-Team
(20)
π
Trend Setter
Conferences
ICML (3)
NIPS (2)
ACL (1)
ICLR (1)
Top co-authors
Keywords
reinforcement learning
(3)
reinforcement learning from human feedback
(2)
reward model
(2)
sample efficiency
(2)
model evaluation
(1)
falsehood detection
(1)
instruction following
(1)
language model alignment
(1)
model alignment
(1)
value function
(1)
off-policy learning
(1)
human feedback
(1)
language model
(1)
synthetic datum
(1)
scaling law
(1)
supervised fine-tuning
(1)
proximal policy optimization
(1)
on-policy learning
(1)
actor-critic method
(1)
feature distillation
(1)
Papers
Phasic Policy Gradient
ICML 2021