Enhancing the Robustness of Traffic Signal Control with StageLight: A Multiscale Learning Approach

Eng Pub Date : 2024-01-08 DOI:10.3390/eng5010007
Gang Su, Jidong J. Yang
{"title":"Enhancing the Robustness of Traffic Signal Control with StageLight: A Multiscale Learning Approach","authors":"Gang Su, Jidong J. Yang","doi":"10.3390/eng5010007","DOIUrl":null,"url":null,"abstract":"The continuous evolution of artificial intelligence and cyber–physical systems has presented promising opportunities for optimizing traffic signal control in densely populated urban areas, with the aim of alleviating traffic congestion. One area that has garnered significant interest from both researchers and practitioners is the application of deep reinforcement learning (DRL) in traffic signal control. However, DRL-based algorithms often suffer from instability due to the dynamic nature of traffic flows. Discrepancies between the environments used for training and those encountered during deployment often lead to operational failures. Moreover, conventional DRL-based traffic signal control algorithms tend to reveal vulnerabilities when faced with unforeseen events, such as sensor failure. These challenges highlight the need for innovative solutions to enhance the robustness and adaptability of such systems. To address these pertinent issues, this paper introduces StageLight, a novel two-stage multiscale learning approach, which involves learning optimal timings on a coarse time scale in stage 1, while finetuning them on a finer time scale in stage 2. Our experimental results demonstrate StageLight’s remarkable capability to generalize across diverse traffic conditions and its robustness to various sensor-failure scenarios.","PeriodicalId":502660,"journal":{"name":"Eng","volume":"59 22","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Eng","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/eng5010007","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The continuous evolution of artificial intelligence and cyber–physical systems has presented promising opportunities for optimizing traffic signal control in densely populated urban areas, with the aim of alleviating traffic congestion. One area that has garnered significant interest from both researchers and practitioners is the application of deep reinforcement learning (DRL) in traffic signal control. However, DRL-based algorithms often suffer from instability due to the dynamic nature of traffic flows. Discrepancies between the environments used for training and those encountered during deployment often lead to operational failures. Moreover, conventional DRL-based traffic signal control algorithms tend to reveal vulnerabilities when faced with unforeseen events, such as sensor failure. These challenges highlight the need for innovative solutions to enhance the robustness and adaptability of such systems. To address these pertinent issues, this paper introduces StageLight, a novel two-stage multiscale learning approach, which involves learning optimal timings on a coarse time scale in stage 1, while finetuning them on a finer time scale in stage 2. Our experimental results demonstrate StageLight’s remarkable capability to generalize across diverse traffic conditions and its robustness to various sensor-failure scenarios.
利用 StageLight 增强交通信号控制的鲁棒性:多尺度学习法
人工智能和网络物理系统的不断发展,为优化人口稠密城市地区的交通信号控制提供了大有可为的机会,从而达到缓解交通拥堵的目的。其中,深度强化学习(DRL)在交通信号控制中的应用引起了研究人员和从业人员的极大兴趣。然而,由于交通流的动态性质,基于 DRL 的算法往往存在不稳定性。用于训练的环境与部署过程中遇到的环境之间的差异往往会导致运行失败。此外,基于 DRL 的传统交通信号控制算法在遇到传感器故障等意外事件时往往会暴露出漏洞。这些挑战凸显了对创新解决方案的需求,以增强此类系统的鲁棒性和适应性。为了解决这些相关问题,本文介绍了一种新颖的两阶段多尺度学习方法 StageLight,即在第一阶段学习粗时间尺度上的最佳定时,同时在第二阶段对其进行更细时间尺度上的微调。我们的实验结果表明,StageLight 在各种交通条件下都具有卓越的泛化能力,并且对各种传感器故障情况都具有鲁棒性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Eng
Eng
CiteScore
2.10
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信