Multi-Armed Bandits
1044 directly classified papers
Papers per year
Papers
Further Optimal Regret Bounds for Thompson Sampling
AISTATS 2013
Contextual Bandit Learning with Predictable Rewards
AISTATS 2012
Stochastic Bandit Based on Empirical Moments
AISTATS 2012
Multi-armed Bandit Problems with History
AISTATS 2012
Risk-Aversion in Multi-armed Bandits
NIPS 2012