Multiple Mean-Payoff Optimization Under Local Stability Constraints

David Klaška; Antonín Kučera; Vojtěch Kůr; Vít Musil; Vojtěch Řehák

2025 AAAI AAAI 2025

Multiple Mean-Payoff Optimization Under Local Stability Constraints

Abstract

Abstract The long-run average payoff per transition (mean payoff) is the main tool for specifying the performance and dependability properties of discrete systems. The problem of constructing a controller (strategy) simultaneously optimizing several mean payoffs has been deeply studied for stochastic and game-theoretic models. One common issue of the constructed controllers is the instability of the mean payoffs, measured by the deviations of the average rewards per transition computed in a finite "window" sliding along a run. Unfortunately, the problem of simultaneously optimizing the mean payoffs under local stability constraints is computationally hard, and the existing works do not provide a practically usable algorithm even for non-stochastic models such as two-player games. In this paper, we design and evaluate the first efficient and scalable solution to this problem applicable to Markov decision processes.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning and Mathematics & Optimization

🧭 Keyword Pioneer — mean payoff optimization

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

David Klaška , Antonín Kučera , Vojtěch Kůr , Vít Musil , Vojtěch Řehák

Topics

Artificial Intelligence > Core AI > Planning Machine Learning > Optimization & Theory > Optimization Mathematics & Optimization > Optimization > Optimization Mathematics & Optimization > Probability > Stochastic Processes Machine Learning > Learning Types > Optimization

Keywords

markov decision process sliding window controller synthesis local stability mean payoff optimization local stability constraint mean-payoff optimization discrete system

Download PDF

Related papers

BEV-TSR: Text-Scene Retrieval in BEV Space for Autonomous Driving 2025

APIRL: Deep Reinforcement Learning for REST API Fuzzing 2025

Anywhere: A Multi-Agent Framework for User-Guided, Reliable, and Diverse Foreground-Conditioned Image Generation 2025

3CAD: A Large-Scale Real-World 3C Product Dataset for Unsupervised Anomaly Detection 2025

Collaborative Learning for 3D Hand-Object Reconstruction and Compositional Action Recognition from Egocentric RGB Videos Using Superquadrics 2025