Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Learning Types
Machine Learning
›
Learning Types
›
Multi-Armed Bandits
1044 directly classified papers
Papers per year
2002: 1
2006: 2
2007: 3
2008: 5
2009: 3
2010: 5
2011: 23
2012: 16
2013: 32
2014: 42
2015: 27
2016: 33
2017: 46
2018: 55
2019: 80
2020: 87
2021: 124
2022: 160
2023: 136
2024: 126
2025: 38
Papers
Nearly Tight Bounds for Exploration in Streaming Multi-Armed Bandits with Known Optimality Gap
AAAI 2025
The Adaptive Q-Network for Recommendation Tasks with Dynamic Item Space
AAAI 2025
Prediction-Based Adaptive Variable Ordering Heuristics for Constraint Satisfaction Problems
AAAI 2025
Efficient Graph Bandit Learning with Side-Observations and Switching Constraints
AAAI 2025
PRIORITY2REWARD: Incorporating Healthworker Preferences for Resource Allocation Planning
AAAI 2025
Explicit and Implicit Examinee-Question Relation Exploiting for Efficient Computerized Adaptive Testing
AAAI 2025
Balans: Multi-Armed Bandits-based Adaptive Large Neighborhood Search for Mixed-Integer Programming Problems
IJCAI 2025
Randomised Optimism via Competitive Co-Evolution for Matrix Games with Bandit Feedback
IJCAI 2025
Constant-Factor Distortion Mechanisms for k-Committee Election
AAAI 2025
Batch Ensemble for Variance Dependent Regret in Stochastic Bandits
AAAI 2025
p-Mean Regret for Stochastic Bandits
AAAI 2025
Optimizing Vital Sign Monitoring in Resource-Constrained Maternal Care: An RL-Based Restless Bandit Approach
AAAI 2025
MARK: Multi-agent Collaboration with Ranking Guidance for Text-attributed Graph Clustering
ACL 2025
Adapting to Non-Stationary Environments: Multi-Armed Bandit Enhanced Retrieval-Augmented Generation on Knowledge Graphs
AAAI 2025
A Unified Analysis of Nonstochastic Delayed Feedback for Combinatorial Semi-Bandits, Linear Bandits, and MDPs
JMLR 2025
Contextual Bandits with Stage-wise Constraints
JMLR 2025
Multi-Objective Neural Bandits with Random Scalarization
IJCAI 2025
Problem-dependent Regret for Lexicographic Multi-Armed Bandits with Adversarial Corruptions
IJCAI 2025
Public Opinion Field Effect and Hawkes Process Join Hands for Information Popularity Prediction
AAAI 2025
Robust Performance Incentivizing Algorithms for Multi-Armed Bandits with Strategic Agents
AAAI 2025
In-Domain African Languages Translation Using LLMs and Multi-armed Bandits
ACL 2025
Neural Combinatorial Clustered Bandits for Recommendation Systems
AAAI 2025
On the Asymptotic Optimality of Confidence Interval Based Algorithms for Fixed Confidence MABs
AAAI 2025
FCOM: A Federated Collaborative Online Monitoring Framework via Representation Learning
AAAI 2025
Every Bit Helps: Achieving the Optimal Distortion with a Few Queries
AAAI 2025
<
1
2
3
4
5
…
42
>