Policy Optimization with Linear Temporal Logic Constraints

Cameron Voloshin; Hoang Le; Swarat Chaudhuri; Yisong Yue

2022 NIPS NeurIPS 2022

Policy Optimization with Linear Temporal Logic Constraints

Abstract

We study the problem of policy optimization (PO) with linear temporal logic (LTL) constraints. The language of LTL allows flexible description of tasks that may be unnatural to encode as a scalar cost function. We consider LTL-constrained PO as a systematic framework, decoupling task specification from policy selection, and an alternative to the standard of cost shaping. With access to a generative model, we develop a model-based approach that enjoys a sample complexity analysis for guaranteeing both task satisfaction and cost optimality (through a reduction to a reachability problem). Empirically, our algorithm can achieve strong performance even in low sample regimes.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Mathematics & Optimization and Reinforcement Learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Cameron Voloshin , Hoang Le , Swarat Chaudhuri , Yisong Yue

Topics

Artificial Intelligence > Core AI > Planning Reinforcement Learning > Methods > Deep RL Mathematics & Optimization > Optimization > Optimal Control Artificial Intelligence > Core AI > Reinforcement Learning

Keywords

policy optimization sample complexity formal methods model-based reinforcement learning linear temporal logic

Download PDF

Related papers

Transferring Pre-trained Multimodal Representations with Cross-modal Similarity Matching 2022

A Theoretical View on Sparsely Activated Networks 2022

Prune and distill: similar reformatting of image information along rat visual cortex and deep neural networks 2022

Matryoshka Representation Learning 2022

Off-Policy Evaluation with Deficient Support Using Side Information 2022