Hanbo Zhang
7 papers
· 2021–2025
· 6 conferences
· across top CS/AI conferences
Achievements
π
Interdisciplinary Bridge
π
Conference Polyglot
(6)
π
Renaissance Researcher
(5)
π
Cross-Pollinator
(15)
πΊοΈ
Taxonomy Completionist
(12)
π£
Hot Topic Early Bird
β
The Questioner
Conferences
CORL (2)
EMNLP (1)
ICLR (1)
IJCAI (1)
NAACL (1)
RSS (1)
Top co-authors
Keywords
reinforcement learning
(1)
policy gradient
(1)
object detection
(1)
in-context learning
(1)
multimodal learning
(1)
visual grounding
(1)
instruction following
(1)
pomdp planning
(1)
robot manipulation
(1)
model training
(1)
vision language model
(1)
multimodal large language model
(1)
sparse reward
(1)
hindsight experience replay
(1)
task planning
(1)
image understanding
(1)
spatial reasoning
(1)
partial observation
(1)
trust region policy optimization
(1)
embodied navigation
(1)
Papers
Hindsight Trust Region Policy Optimization
IJCAI 2021