Maximizing the Success Probability of Policy Allocations in Online Systems

Artem Betlei; Mariia Vladimirova; Mehdi Sebbar; Nicolas Urien; Thibaud Rahier; Benjamin Heymann

2024 AAAI AAAI 2024

Maximizing the Success Probability of Policy Allocations in Online Systems

Abstract

Abstract The effectiveness of advertising in e-commerce largely depends on the ability of merchants to bid on and win impressions for their targeted users. The bidding procedure is highly complex due to various factors such as market competition, user behavior, and the diverse objectives of advertisers. In this paper we consider the problem at the level of user timelines instead of individual bid requests, manipulating full policies (i.e. pre-defined bidding strategies) and not bid values. In order to optimally allocate policies to users, typical multiple treatments allocation methods solve knapsack-like problems which aim at maximizing an expected value under constraints. In the specific context of online advertising, we argue that optimizing for the probability of success is a more suited objective than expected value maximization, and we introduce the SuccessProbaMax algorithm that aims at finding the policy allocation which is the most likely to outperform a fixed reference policy. Finally, we conduct comprehensive experiments both on synthetic and real-world data to evaluate its performance. The results demonstrate that our proposed algorithm outperforms conventional expected-value maximization algorithms in terms of success rate.

🌉 Interdisciplinary Bridge — Machine Learning and Mathematics & Optimization

🧭 Keyword Pioneer — policy allocation

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy

Authors

Artem Betlei , Mariia Vladimirova , Mehdi Sebbar , Nicolas Urien , Thibaud Rahier , Benjamin Heymann

Topics

Machine Learning > Optimization & Theory > Optimization Machine Learning > Application Areas > Risk Management Machine Learning > Learning Types > Online Learning Mathematics & Optimization > Optimization > Optimization Machine Learning > Learning Types > Multi-Armed Bandits

Keywords

online advertising optimization algorithm knapsack problem policy allocation success probability knapsack optimization bidding strategy

Download PDF

Related papers

Goal Alignment: Re-analyzing Value Alignment Problems Using Human-Aware AI 2024

Meta-Inverse Reinforcement Learning for Mean Field Games via Probabilistic Context Variables 2024

Suppressing Uncertainty in Gaze Estimation 2024

Mask-Homo: Pseudo Plane Mask-Guided Unsupervised Multi-Homography Estimation 2024

Heterogeneous Test-Time Training for Multi-Modal Person Re-identification 2024