Shie Mannor
143 papers
· 2003–2025
· 15 conferences
· across top CS/AI conferences
Achievements
πΊοΈ
Taxonomy Completionist
(40)
π
Academic Marathon
(22)
π§
Keyword Pioneer
π
Interdisciplinary Bridge
π
Conference Polyglot
(15)
π£
Hot Topic Early Bird
π
Renaissance Researcher
(7)
π
Cross-Pollinator
(13)
π
Conference Loyalist
(41)
π
Keyword Trendsetter Combo
(5)
π
Keyword Champion
(3)
π
Triple Crown
π±
Topic Pioneer
π¬
Deep Specialist
(18)
π€
Dynamic Duo
(19)
π
Grand Slam
ποΈ
Keyword Collector
(210)
β
The Questioner
(2)
π
Trend Setter
π
Conference Pioneer
π₯
Unstoppable
(18)
β‘
Prolific Year
(10)
π
Century Club
(143)
Conferences
ICML (47)
NIPS (41)
COLT (13)
AAAI (11)
JMLR (10)
ICLR (7)
UAI (4)
AISTATS (2)
CVPR (2)
ACML (1)
ALT (1)
CORL (1)
IJCAI (1)
RSS (1)
WACV (1)
Top co-authors
Research topics
Keywords
reinforcement learning
(32)
online learning
(25)
regret bound
(21)
multi-armed bandit
(13)
markov decision process
(12)
policy gradient
(11)
robust optimization
(9)
regret minimization
(8)
stochastic optimization
(8)
sample complexity
(6)
contextual bandit
(6)
policy optimization
(6)
value function
(6)
model-based reinforcement learning
(6)
policy iteration
(5)
game theory
(5)
deep reinforcement learning
(5)
temporal difference learning
(5)
thompson sampling
(5)
robust markov decision process
(5)
Papers
Policy Gradient with Tree Expansion
ICML 2025
Online Apprenticeship Learning
AAAI 2022
The Geometry of Robust Value Functions
ICML 2022
Reinforcement Learning with a Terminator
NIPS 2022
Sim and Real: Better Together
NIPS 2021
Lenient Regret for Multi-Armed Bandits
AAAI 2021
Online Planning with Lookahead Policies
NIPS 2020
Reward Constrained Policy Optimization
ICLR 2019
The Natural Language of Actions
ICML 2019
Rotting Bandits
NIPS 2017
Consistent On-Line Off-Policy Evaluation
ICML 2017
Sensor Selection for Crowdsensing Dynamical Systems
AISTATS 2015
Latent Bandits.
ICML 2014
Approachability, fast and slow
COLT 2013
Online PCA for Contaminated Data
NIPS 2013
The Perturbed Variation
NIPS 2012
Statistical Optimization in High Dimensions
AISTATS 2012
Committing Bandits
NIPS 2011
Regularized Policy Iteration
NIPS 2008
Robust Regression and Lasso
NIPS 2008