Submodular Bandit Problem Under Multiple Constraints

Sho Takemori; Masahiro Sato; Takashi Sonoda; Janmajay Singh; Tomoko Ohkuma

2020 UAI UAI 2020

Submodular Bandit Problem Under Multiple Constraints

Abstract

The linear submodular bandit problemwas proposedto simultaneously address diversified retrieval and online learning in a recommender system.If there is no uncertainty, this problem is equivalent toa submodular maximization problem under a cardinality constraint.However, in some situations, recommendation lists should satisfyadditional constraints such as budget constraints,other than a cardinality constraint.Thus, motivated by diversified retrieval considering budget constraints,we introduce a submodular bandit problem under the intersection of$l$ knapsacks and a $k$-system constraint.Here $k$-system constraints forma very general class of constraints includingcardinality constraints and the intersection of $k$ matroid constraints.To solve this problem, we propose a non-greedy algorithm that adaptively focuses ona standard or modified upper-confidence bound.We provide a high-probability upper bound of an approximation regret,where the approximation ratio matches that of a fast offline algorithm.Moreover, we perform experimentsunder various combinations of constraints using asynthetic and two real-world datasetsand demonstrate that our proposed method outperforms the existing baselines.

🌉 Interdisciplinary Bridge — Data Science & Analytics and Machine Learning and Mathematics & Optimization

🧭 Keyword Pioneer — submodular bandit

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Sho Takemori , Masahiro Sato , Takashi Sonoda , Janmajay Singh , Tomoko Ohkuma

Topics

Data Science & Analytics > Applications > Recommender Systems Mathematics & Optimization > Optimization > Combinatorial Optimization Machine Learning > Learning Types > Online Learning

Keywords

online learning regret bound recommender system knapsack constraint submodular bandit

Download PDF

Related papers

Walking on Two Legs: Learning Image Segmentation with Noisy Labels 2020

Finite-Memory Near-Optimal Learning for Markov Decision Processes with Long-Run Average Reward 2020

Automated Dependence Plots 2020

Collapsible IDA: Collapsing Parental Sets for Locally Estimating Possible Causal Effects 2020

Learning by Repetition: Stochastic Multi-armed Bandits under Priming Effect 2020