← Learning Types

Machine Learning › Learning Types ›

Multi-Armed Bandits

1044 directly classified papers

Papers per year

Papers

No-Regret Bandit Exploration based on Soft Tree Ensemble Model NIPS 2024

Online Learning with Sublinear Best-Action Queries NIPS 2024

Identifying Copeland Winners in Dueling Bandits with Indifferences AISTATS 2024

Towards Efficient and Optimal Covariance-Adaptive Algorithms for Combinatorial Semi-Bandits NIPS 2024

Global Rewards in Restless Multi-Armed Bandits NIPS 2024

Fast Proxy Experiment Design for Causal Effect Identification NIPS 2024

Piecewise-Stationary Bandits with Knapsacks NIPS 2024

No-Regret M${}^{\natural}$-Concave Function Maximization: Stochastic Bandit Algorithms and NP-Hardness of Adversarial Full-Information Setting NIPS 2024

On the Minimax Regret for Contextual Linear Bandits and Multi-Armed Bandits with Expert Advice NIPS 2024

Stochastic contextual bandits with graph feedback: from independence number to MAS number NIPS 2024

Reinforcement Learning with Lookahead Information NIPS 2024

Queueing Matching Bandits with Preference Feedback NIPS 2024

Optimal Top-Two Method for Best Arm Identification and Fluid Analysis NIPS 2024

Putting Gale & Shapley to Work: Guaranteeing Stability Through Learning NIPS 2024

Almost Free: Self-concordance in Natural Exponential Families and an Application to Bandits NIPS 2024

Improved Bayes Regret Bounds for Multi-Task Hierarchical Bayesian Bandit Algorithms NIPS 2024

Small steps no more: Global convergence of stochastic gradient bandits for arbitrary learning rates NIPS 2024

Sequential learning of the Pareto front for multi-objective bandits AISTATS 2024

Near Optimal Adversarial Attacks on Stochastic Bandits and Defenses with Smoothed Responses AISTATS 2024

Bandits with Concave Aggregated Reward IJCAI 2024

Randomized Exploration for Reinforcement Learning with Multinomial Logistic Function Approximation NIPS 2024

Strategic Aspects of Stable Matching Markets: A Survey IJCAI 2024

Bandits with Ranking Feedback NIPS 2024

Tree Search-Based Evolutionary Bandits for Protein Sequence Optimization AAAI 2024

Randomized Strategic Facility Location with Predictions NIPS 2024