Low-Rank Bandit Methods for High-Dimensional Dynamic Pricing

Jonas W Mueller; Vasilis Syrgkanis; Matt Taddy

2019 NIPS NeurIPS 2019

Low-Rank Bandit Methods for High-Dimensional Dynamic Pricing

Abstract

We consider dynamic pricing with many products under an evolving but low-dimensional demand model. Assuming the temporal variation in cross-elasticities exhibits low-rank structure based on fixed (latent) features of the products, we show that the revenue maximization problem reduces to an online bandit convex optimization with side information given by the observed demands. We design dynamic pricing algorithms whose revenue approaches that of the best fixed price vector in hindsight, at a rate that only depends on the intrinsic rank of the demand model and not the number of products. Our approach applies a bandit convex optimization algorithm in a projected low-dimensional space spanned by the latent product features, while simultaneously learning this span via online singular value decomposition of a carefully-crafted matrix containing the observed demands.

🌉 Interdisciplinary Bridge — Machine Learning and Mathematics & Optimization

🧭 Keyword Pioneer — demand model

🐣 Hot Topic Early Bird — singular value decomposition

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Jonas W Mueller , Vasilis Syrgkanis , Matt Taddy

Topics

Machine Learning > Learning Types > Unsupervised Learning Mathematics & Optimization > Optimization > Online Algorithms

Keywords

online optimization singular value decomposition bandit algorithm low-rank model dynamic pricing demand model

Download PDF

Related papers

Two Generator Game: Learning to Sample via Linear Goodness-of-Fit Test 2019

Metalearned Neural Memory 2019

Model Similarity Mitigates Test Set Overuse 2019

Continual Unsupervised Representation Learning 2019

Reinforcement Learning with Convex Constraints 2019