Mohammad Ghavamzadeh
88 papers
· 2006–2026
· 11 conferences
· across top CS/AI conferences
Achievements
π
Renaissance Researcher
(5)
π§
Keyword Pioneer
π
Conference Polyglot
(11)
π£
Hot Topic Early Bird
π
Cross-Pollinator
(12)
πΊοΈ
Taxonomy Completionist
(32)
π
Interdisciplinary Bridge
π
Academic Marathon
(19)
π
Conference Loyalist
(26)
π
Keyword Trendsetter Combo
(4)
π±
Topic Pioneer
π
Triple Crown
π¬
Deep Specialist
(10)
π§¬
Topic Evolution
π
Keyword Champion
π
Grand Slam
π€
Dynamic Duo
(18)
ποΈ
Keyword Collector
(115)
π
Trend Setter
π₯
Unstoppable
(16)
π
Conference Pioneer
π
Century Club
(87)
β‘
Prolific Year
(10)
Conferences
NIPS (26)
ICML (18)
AISTATS (13)
JMLR (8)
ICLR (7)
IJCAI (6)
AAAI (4)
L4DC (2)
UAI (2)
ACML (1)
CORL (1)
Top co-authors
Research topics
Keywords
reinforcement learning
(16)
regret bound
(15)
multi-armed bandit
(14)
markov decision process
(13)
policy gradient
(12)
dynamic programming
(7)
policy iteration
(7)
policy learning
(6)
sample complexity
(5)
contextual bandit
(5)
regret minimization
(5)
model-based reinforcement learning
(5)
online algorithm
(5)
temporal difference learning
(4)
value iteration
(4)
sequential decision making
(4)
sample efficiency
(4)
stochastic optimization
(4)
thompson sampling
(4)
online learning
(4)
Papers
Multiple-policy High-confidence Policy Evaluation
AISTATS 2023
Entropic Risk Optimization in Discounted MDPs
AISTATS 2023
Thompson Sampling with a Mixture Prior
AISTATS 2022
Operator Splitting Value Iteration
NIPS 2022
Deep Hierarchy in Bandits
ICML 2022
Hierarchical Bayesian Bandits
AISTATS 2022
Mirror Descent Policy Optimization
ICLR 2022
Stochastic Bandits with Linear Constraints
AISTATS 2021
Variational Model-based Policy Optimization
IJCAI 2021
Neural Lyapunov Redesign
L4DC 2021
Conservative Exploration in Reinforcement Learning
AISTATS 2020
Randomized Exploration in Generalized Linear Bandits
AISTATS 2020
Online Planning with Lookahead Policies
NIPS 2020
Optimizing over a Restricted Policy Class in MDPs
AISTATS 2019
Robust Locally-Linear Controllable Embedding
AISTATS 2018
Conservative Contextual Linear Bandits
NIPS 2017
High Confidence Policy Improvement
ICML 2015
Algorithms for CVaR Optimization in MDPs
NIPS 2014
Multi-Bandit Best Arm Identification
NIPS 2011
Speedy Q-Learning
NIPS 2011
LSTD with Random Projections
NIPS 2010
Regularized Policy Iteration
NIPS 2008
Bayesian Policy Gradient Algorithms
NIPS 2006