Paul F Christiano
3 papers
· 2017–2022
· 1 conference
· across top CS/AI conferences
Achievements
π§
Keyword Pioneer
π
Academic Marathon
(5)
π
Cross-Pollinator
(15)
π
Renaissance Researcher
(5)
π
Interdisciplinary Bridge
π£
Hot Topic Early Bird
πΊοΈ
Taxonomy Completionist
(10)
π
The Namer
π₯
Mega-Team
(20)
π
Trend Setter
Conferences
NIPS (3)
Top co-authors
Keywords
preference learning
(2)
reinforcement learning
(2)
instruction following
(1)
language model alignment
(1)
reinforcement learning from human feedback
(1)
model alignment
(1)
reward function
(1)
human feedback
(1)
language model
(1)
reward model
(1)
supervised fine-tuning
(1)
human preference
(1)
deep reinforcement learning
(1)
trajectory segment
(1)
reward modeling
(1)