Rupam Mahmood
3 papers
· 2022–2022
· 2 conferences
· across top CS/AI conferences
Achievements
🌍
Conference Polyglot
(2)
🌉
Interdisciplinary Bridge
🧭
Keyword Pioneer
🐣
Hot Topic Early Bird
🐝
Cross-Pollinator
(15)
Conferences
AISTATS (2)
ICML (1)
Top co-authors
Keywords
policy gradient
(3)
sample efficiency
(2)
reinforcement learning
(2)
off-policy learning
(1)
proximal policy optimization
(1)
model-free learning
(1)
gradient critic
(1)
reward gradient
(1)
softmax policy
(1)
policy saturation
(1)
softmax policies
(1)
bellman equation
(1)
temporal-difference learning
(1)