Two-step deep reinforcement learning for traffic signal control to improve pedestrian safety using connected vehicle data

IF 6.2 1区工程技术 Q1 ERGONOMICS

Accident; analysis and prevention Pub Date : 2025-09-17 DOI:10.1016/j.aap.2025.108161

A. Dian Ren , B. Gongquan Zhang , C. Fangrong Chang , D. Helai Huang

{"title":"Two-step deep reinforcement learning for traffic signal control to improve pedestrian safety using connected vehicle data","authors":"A. Dian Ren , B. Gongquan Zhang , C. Fangrong Chang , D. Helai Huang","doi":"10.1016/j.aap.2025.108161","DOIUrl":null,"url":null,"abstract":"<div><div>The primary goal of traffic signals control (TSC) is to enhance safety and protect all traffic participants. However, there exists enhancement such as increasing safety for vulnerable road users (VRUs), especially pedestrians. This study proposes a novel two-step traffic signal control framework based on deep reinforcement learning (TSDRL-TSC) to improve pedestrian safety and overall traffic efficiency at intersections. Based on advanced communication technologies of connected vehicles (CV), the TSDRL-TSC acquires the data from real-time traffic conditions and dynamically adjusts traffic signals, aiming to minimize traffic conflicts and delays of pedestrians and vehicles. In the first step, TSDRL-TSC decides whether to use traditional four-signal phases or a modified version considering the protected/prohibited right turn (PPRT) strategy based on pedestrian conditions. In the second step, TSDRL-TSC optimizes the specific control scheme through deep reinforcement learning techniques, selecting the optimal signal phases/actions for the current intersection state to obtain long-term reward returns. The reward function considers the safety and efficiency of all traffic participant, designed to balance the requirement for pedestrian safety, pedestrian efficiency, and vehicle throughput. Simulation experiments at a representative six-lane bidirectional intersection in Changsha City validate the effectiveness of the proposed method. Results demonstrate that (1) TSDRL-TSC significantly reduces pedestrian-vehicle conflicts, jaywalking incidents, and total delays compared to adaptive traffic signal control and PPRT control; (2) TSDRL-TSC presents the potential as a robust solution to enhance pedestrian safety and traffic efficiency for complex urban traffic management.</div></div>","PeriodicalId":6926,"journal":{"name":"Accident; analysis and prevention","volume":"222 ","pages":"Article 108161"},"PeriodicalIF":6.2000,"publicationDate":"2025-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Accident; analysis and prevention","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0001457525002477","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ERGONOMICS","Score":null,"Total":0}

引用次数: 0

Abstract

The primary goal of traffic signals control (TSC) is to enhance safety and protect all traffic participants. However, there exists enhancement such as increasing safety for vulnerable road users (VRUs), especially pedestrians. This study proposes a novel two-step traffic signal control framework based on deep reinforcement learning (TSDRL-TSC) to improve pedestrian safety and overall traffic efficiency at intersections. Based on advanced communication technologies of connected vehicles (CV), the TSDRL-TSC acquires the data from real-time traffic conditions and dynamically adjusts traffic signals, aiming to minimize traffic conflicts and delays of pedestrians and vehicles. In the first step, TSDRL-TSC decides whether to use traditional four-signal phases or a modified version considering the protected/prohibited right turn (PPRT) strategy based on pedestrian conditions. In the second step, TSDRL-TSC optimizes the specific control scheme through deep reinforcement learning techniques, selecting the optimal signal phases/actions for the current intersection state to obtain long-term reward returns. The reward function considers the safety and efficiency of all traffic participant, designed to balance the requirement for pedestrian safety, pedestrian efficiency, and vehicle throughput. Simulation experiments at a representative six-lane bidirectional intersection in Changsha City validate the effectiveness of the proposed method. Results demonstrate that (1) TSDRL-TSC significantly reduces pedestrian-vehicle conflicts, jaywalking incidents, and total delays compared to adaptive traffic signal control and PPRT control; (2) TSDRL-TSC presents the potential as a robust solution to enhance pedestrian safety and traffic efficiency for complex urban traffic management.

查看原文本刊更多论文

基于车联网数据的交通信号控制两步深度强化学习提高行人安全。

交通信号控制（TSC）的主要目的是提高交通安全，保护所有交通参与者。然而，也存在一些改进措施，例如提高弱势道路使用者（vru），特别是行人的安全。本文提出了一种新的基于深度强化学习的两步交通信号控制框架（TSDRL-TSC），以提高十字路口的行人安全和整体交通效率。TSDRL-TSC基于先进的车联网通信技术，从实时交通状况中获取数据，并动态调整交通信号，以最大限度地减少行人和车辆的交通冲突和延误。第一步，TSDRL-TSC根据行人情况，考虑保护/禁止右转（PPRT）策略，决定是使用传统的四信号相位还是修改后的四信号相位。第二步，TSDRL-TSC通过深度强化学习技术对具体控制方案进行优化，选择当前交叉口状态下最优的信号相位/动作，以获得长期奖励回报。奖励函数考虑所有交通参与者的安全和效率，旨在平衡行人安全、行人效率和车辆吞吐量的要求。在长沙市一个具有代表性的六车道双向交叉口进行了仿真实验，验证了该方法的有效性。结果表明：(1)与自适应交通信号控制和PPRT控制相比，TSDRL-TSC控制显著降低了行人与车辆冲突、乱穿马路事件和总延误；(2) TSDRL-TSC在复杂的城市交通管理中具有提高行人安全和交通效率的潜力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Accident; analysis and prevention Multiple-

CiteScore

11.90

自引率

16.90%

发文量

264

审稿时长

48 days

期刊介绍： Accident Analysis & Prevention provides wide coverage of the general areas relating to accidental injury and damage, including the pre-injury and immediate post-injury phases. Published papers deal with medical, legal, economic, educational, behavioral, theoretical or empirical aspects of transportation accidents, as well as with accidents at other sites. Selected topics within the scope of the Journal may include: studies of human, environmental and vehicular factors influencing the occurrence, type and severity of accidents and injury; the design, implementation and evaluation of countermeasures; biomechanics of impact and human tolerance limits to injury; modelling and statistical analysis of accident data; policy, planning and decision-making in safety.