Yuanzhao Zhai
5 papers
· 2024–2025
· 3 conferences
· across top CS/AI conferences
Achievements
π
Conference Polyglot
(3)
π
Renaissance Researcher
(5)
π
Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(15)
π§
Keyword Pioneer
π
Cross-Pollinator
(15)
Conferences
AAAI (3)
ACL (1)
ICML (1)
Top co-authors
Keywords
large language model
(3)
reinforcement learning
(2)
offline reinforcement learning
(1)
policy optimization
(1)
preference learning
(1)
preference optimization
(1)
out-of-distribution generalization
(1)
markov decision process
(1)
model alignment
(1)
model-based reinforcement learning
(1)
monte carlo tree search
(1)
human feedback
(1)
influence function
(1)
model collapse
(1)
policy regularization
(1)
ai alignment
(1)
multi-agent system
(1)
pessimistic markov decision process
(1)
optimistic rollout
(1)
direct policy optimization
(1)