Abstract
Signalized intersections are managed by controllers that assign right of way (green, yellow, and red lights) to non-conflicting directions. Optimizing the actuation policy of such controllers is expected to alleviate traffic congestion and its adverse impact. Given such a safety-critical domain, the affiliated actuation policy is required to be interpretable in a way that can be understood and regulated by a human. This paper presents and analyzes several on-line optimization techniques for tuning interpretable control functions. Although these techniques are defined in a general way, this paper assumes a specific class of interpretable control functions (polynomial functions) for analysis purposes. We show that such an interpretable policy function can be as effective as a deep neural network for approximating an optimized signal actuation policy. We present empirical evidence that supports the use of value-based reinforcement learning for on-line training of the control function. Specifically, we present and study three variants of the Deep Q-learning algorithm that allow the training of an interpretable policy function. Our Deep Regulatable Hardmax Q-learning variant is shown to be particularly effective in optimizing our interpretable actuation policy, resulting in up to 19.4% reduced vehicles delay compared to commonly deployed actuated signal controllers.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2020 |
| Editors | Bo An, Amal El Fallah Seghrouchni, Gita Sukthankar |
| Publisher | International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS) |
| Pages | 88-96 |
| Number of pages | 9 |
| ISBN (Electronic) | 9781450375184 |
| State | Published - 1 Jan 2020 |
| Externally published | Yes |
| Event | 19th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2020 - Virtual, Auckland, New Zealand Duration: 19 May 2020 → … |
Publication series
| Name | Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS |
|---|---|
| Volume | 2020-May |
| ISSN (Print) | 1548-8403 |
| ISSN (Electronic) | 1558-2914 |
Conference
| Conference | 19th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2020 |
|---|---|
| Country/Territory | New Zealand |
| City | Virtual, Auckland |
| Period | 19/05/20 → … |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 11 Sustainable Cities and Communities
Keywords
- Deep reinforcement learning
- Intelligent transportation
- Interpretable
ASJC Scopus subject areas
- Artificial Intelligence
- Software
- Control and Systems Engineering
Fingerprint
Dive into the research topics of 'Learning an interpretable traffic signal control policy'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver