Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Learning Types
Machine Learning
›
Learning Types
›
Multi-Armed Bandits
1044 directly classified papers
Papers per year
2002: 1
2006: 2
2007: 3
2008: 5
2009: 3
2010: 5
2011: 23
2012: 16
2013: 32
2014: 42
2015: 27
2016: 33
2017: 46
2018: 55
2019: 80
2020: 87
2021: 124
2022: 160
2023: 136
2024: 126
2025: 38
Papers
Learning the Learning Rate for Prediction with Expert Advice
NIPS 2014
Efficient Partial Monitoring with Prior Information
NIPS 2014
The Blinded Bandit: Learning with Adaptive Feedback
NIPS 2014
Extreme bandits
NIPS 2014
Almost Optimal Exploration in Multi-Armed Bandits
ICML 2013
How to Hedge an Option Against an Adversary: Black-Scholes Pricing is Minimax Optimal
NIPS 2013
Information Complexity in Bandit Subset Selection
COLT 2013
Beating Bandits in Gradually Evolving Worlds
COLT 2013
Bounded regret in stochastic multi-armed bandits
COLT 2013
On the Complexity of Bandit and Derivative-Free Stochastic Convex Optimization
COLT 2013
Opportunistic Strategies for Generalized No-Regret Problems
COLT 2013
Multiple Identifications in Multi-Armed Bandits
ICML 2013
Regret Minimization for Branching Experts
COLT 2013
A near-optimal algorithm for finite partial-monitoring games against adversarial opponents
COLT 2013
Adaptive Submodular Maximization in Bandit Setting
NIPS 2013
(Nearly) Optimal Algorithms for Private Online Learning in Full-information and Bandit Settings
NIPS 2013
Thompson Sampling for 1-Dimensional Exponential Family Bandits
NIPS 2013
From Bandits to Experts: A Tale of Domination and Independence
NIPS 2013
Online Learning with Switching Costs and Other Adaptive Adversaries
NIPS 2013
High-Dimensional Gaussian Process Bandits
NIPS 2013
Estimation Bias in Multi-Armed Bandit Algorithms for Search Advertising
NIPS 2013
Forgetful Bayes and myopic planning: Human learning and decision-making in a bandit setting
NIPS 2013
Eluder Dimension and the Sample Complexity of Optimistic Exploration
NIPS 2013
Prior-free and prior-dependent regret bounds for Thompson Sampling
NIPS 2013
A Gang of Bandits
NIPS 2013
<
1
…
38
39
40
41
42
>