Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Learning Types
Machine Learning
›
Learning Types
›
Multi-Armed Bandits
1044 directly classified papers
Papers per year
2002: 1
2006: 2
2007: 3
2008: 5
2009: 3
2010: 5
2011: 23
2012: 16
2013: 32
2014: 42
2015: 27
2016: 33
2017: 46
2018: 55
2019: 80
2020: 87
2021: 124
2022: 160
2023: 136
2024: 126
2025: 38
Papers
No-Regret Bandit Exploration based on Soft Tree Ensemble Model
NIPS 2024
Online Learning with Sublinear Best-Action Queries
NIPS 2024
Identifying Copeland Winners in Dueling Bandits with Indifferences
AISTATS 2024
Towards Efficient and Optimal Covariance-Adaptive Algorithms for Combinatorial Semi-Bandits
NIPS 2024
Global Rewards in Restless Multi-Armed Bandits
NIPS 2024
Fast Proxy Experiment Design for Causal Effect Identification
NIPS 2024
Piecewise-Stationary Bandits with Knapsacks
NIPS 2024
No-Regret M${}^{\natural}$-Concave Function Maximization: Stochastic Bandit Algorithms and NP-Hardness of Adversarial Full-Information Setting
NIPS 2024
On the Minimax Regret for Contextual Linear Bandits and Multi-Armed Bandits with Expert Advice
NIPS 2024
Stochastic contextual bandits with graph feedback: from independence number to MAS number
NIPS 2024
Reinforcement Learning with Lookahead Information
NIPS 2024
Queueing Matching Bandits with Preference Feedback
NIPS 2024
Optimal Top-Two Method for Best Arm Identification and Fluid Analysis
NIPS 2024
Putting Gale & Shapley to Work: Guaranteeing Stability Through Learning
NIPS 2024
Almost Free: Self-concordance in Natural Exponential Families and an Application to Bandits
NIPS 2024
Improved Bayes Regret Bounds for Multi-Task Hierarchical Bayesian Bandit Algorithms
NIPS 2024
Small steps no more: Global convergence of stochastic gradient bandits for arbitrary learning rates
NIPS 2024
Sequential learning of the Pareto front for multi-objective bandits
AISTATS 2024
Near Optimal Adversarial Attacks on Stochastic Bandits and Defenses with Smoothed Responses
AISTATS 2024
Bandits with Concave Aggregated Reward
IJCAI 2024
Randomized Exploration for Reinforcement Learning with Multinomial Logistic Function Approximation
NIPS 2024
Strategic Aspects of Stable Matching Markets: A Survey
IJCAI 2024
Bandits with Ranking Feedback
NIPS 2024
Tree Search-Based Evolutionary Bandits for Protein Sequence Optimization
AAAI 2024
Randomized Strategic Facility Location with Predictions
NIPS 2024
<
1
…
5
6
7
…
42
>