π-Light: Programmatic Interpretable Reinforcement Learning for Resource-Limited Traffic Signal Control

Yin Gu; Kai Zhang; Qi Liu; Weibo Gao; Longfei Li; Jun Zhou

2024 AAAI AAAI 2024

π-Light: Programmatic Interpretable Reinforcement Learning for Resource-Limited Traffic Signal Control

Abstract

Abstract The recent advancements in Deep Reinforcement Learning (DRL) have significantly enhanced the performance of adaptive Traffic Signal Control (TSC). However, DRL policies are typically represented by neural networks, which are over-parameterized black-box models. As a result, the learned policies often lack interpretability, and cannot be deployed directly in the real-world edge hardware due to resource constraints. In addition, the DRL methods often exhibit limited generalization performance, struggling to generalize the learned policy to other geographical regions. These factors limit the practical application of learning-based approaches. To address these issues, we suggest the use of an inherently interpretable program for representing the control policy. We present a new approach, Programmatic Interpretable reinforcement learning for traffic signal control (π-light), designed to autonomously discover non-differentiable programs. Specifically, we define a Domain Specific Language (DSL) and transformation rules for constructing programs, and utilize Monte Carlo Tree Search (MCTS) to find the optimal program in a discrete space. Extensive experiments demonstrate that our method consistently outperforms baseline approaches. Moreover, π-Light exhibits superior generalization capabilities compared to DRL, enabling training and evaluation across intersections from different cities. Finally, we analyze how the learned program policies can directly deploy on edge devices with extremely limited resources.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning and Reinforcement Learning

🧭 Keyword Pioneer — programmatic interpretability

🐣 Hot Topic Early Bird — edge deployment

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Speech & Audio

Authors

Yin Gu , Kai Zhang , Qi Liu , Weibo Gao , Longfei Li , Jun Zhou

Topics

Artificial Intelligence > Core AI > Interpretability Machine Learning > Application Areas > Efficient Computing Reinforcement Learning > Applications > Robotics Artificial Intelligence > Core AI > Reinforcement Learning

Keywords

monte carlo tree search generalization capability edge deployment traffic signal control domain specific language programmatic interpretability programmatic interpretable reinforcement learning

Download PDF

Related papers

Goal Alignment: Re-analyzing Value Alignment Problems Using Human-Aware AI 2024

Meta-Inverse Reinforcement Learning for Mean Field Games via Probabilistic Context Variables 2024

Suppressing Uncertainty in Gaze Estimation 2024

Mask-Homo: Pseudo Plane Mask-Guided Unsupervised Multi-Homography Estimation 2024

Heterogeneous Test-Time Training for Multi-Modal Person Re-identification 2024