On Context-Dependent Clustering of Bandits

Claudio Gentile; Shuai Li; Purushottam Kar; Alexandros Karatzoglou; Giovanni Zappella; Evans Etrue

2017 ICML ICML 2017

On Context-Dependent Clustering of Bandits

Abstract

We investigate a novel cluster-of-bandit algorithm CAB for collaborative recommendation tasks that implements the underlying feedback sharing mechanism by estimating user neighborhoods in a context-dependent manner. CAB makes sharp departures from the state of the art by incorporating collaborative effects into inference, as well as learning processes in a manner that seamlessly interleaves explore-exploit tradeoffs and collaborative steps. We prove regret bounds for CAB under various data-dependent assumptions which exhibit a crisp dependence on the expected number of clusters over the users, a natural measure of the statistical difficulty of the learning task. Experiments on production and real-world datasets show that CAB offers significantly increased prediction performance against a representative pool of state-of-the-art methods.

🧭 Keyword Pioneer — explore-exploit tradeoff

🐝 Cross-Pollinator — Artificial Intelligence, Data Science & Analytics, Deep Learning, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Reinforcement Learning, Robotics, Security & Privacy

🐣 Hot Topic Early Bird — contextual bandit

Authors

Claudio Gentile , Shuai Li , Purushottam Kar , Alexandros Karatzoglou , Giovanni Zappella , Evans Etrue

Topics

Machine Learning > Core Methods > Clustering Machine Learning > Learning Types > Multi-Agent Systems Machine Learning > Learning Types > Multi-Armed Bandits

Keywords

collaborative filtering multi-armed bandit regret bound contextual bandit explore-exploit tradeoff cluster-of-bandit algorithm collaborative recommendation context-dependent clustering exploration-exploit tradeoff

Download PDF

Related papers

Bottleneck Conditional Density Estimation 2017

Constrained Policy Optimization 2017

Near-Optimal Design of Experiments via Regret Minimization 2017

Input Convex Neural Networks 2017

An Efficient, Sparsity-Preserving, Online Algorithm for Low-Rank Approximation 2017