Hiroki Furuta
12 papers
· 2021–2025
· 3 conferences
· across top CS/AI conferences
Achievements
πΊοΈ
Taxonomy Completionist
(15)
π§
Keyword Pioneer
π
Conference Polyglot
(3)
π
Cross-Pollinator
(13)
π
Renaissance Researcher
(5)
π
Interdisciplinary Bridge
π
Triple Crown
π€
Dynamic Duo
(10)
π₯
Unstoppable
(5)
π
Century Club
(12)
Conferences
ICLR (6)
ICML (4)
NIPS (2)
Top co-authors
Research topics
Keywords
deep reinforcement learning
(2)
reinforcement learning
(1)
preference learning
(1)
direct preference optimization
(1)
language model alignment
(1)
reinforcement learning from human feedback
(1)
model alignment
(1)
mutual information
(1)
loss function
(1)
reward model
(1)
reward shaping
(1)
soft label
(1)
geometric average
(1)
information-theoretic measure
(1)
preference distribution
(1)
large language model alignment
(1)
task difficulty
(1)
task complexity
(1)
policy information capacity
(1)
information-theoretic metric
(1)