Ziqian Zhang , Haojie Li , Tiantian Chen , N.N. Sze , Wenzhang Yang , Yihao Zhang , Gang Ren
{"title":"Decision-making of autonomous vehicles in interactions with jaywalkers: A risk-aware deep reinforcement learning approach","authors":"Ziqian Zhang , Haojie Li , Tiantian Chen , N.N. Sze , Wenzhang Yang , Yihao Zhang , Gang Ren","doi":"10.1016/j.aap.2024.107843","DOIUrl":null,"url":null,"abstract":"<div><div>Jaywalking, as a hazardous crossing behavior, leaves little time for drivers to anticipate and respond promptly, resulting in high crossing risks. The prevalence of Autonomous Vehicle (AV) technologies has offered new solutions for mitigating jaywalking risks. In this study, we propose a risk-aware deep reinforcement learning (DRL) approach for AVs to make decisions safely and efficiently in jaywalker-vehicle interactions. Notably, a risk prediction module is incorporated into the traditional DRL framework, making the AV agent risk-aware. Considering the complexity of jaywalker-vehicle conflicts, an encoder-decoder model is adopted as the risk prediction module, which comprehensively integrates multi-source data and predicts probabilities of the final conflict severity levels. The risk-aware DRL approach is applied in a simulated environment established in Anylogic, where the motion features of jaywalkers and vehicles are calibrated using real-world survey data.</div><div>The trained driving policies are evaluated from perspectives of safety and efficiency across three scenarios with escalading levels of jaywalker volume. Regarding safety performance, the <em>Baseline</em> policy performs the worst in “medium jaywalker volume” scenario and “high jaywalker volume” scenario, while our <em>Proposed risk-aware</em> method outperforms the other methods, with the “low TTC ratio” metric stabilizing near 0.08. Moreover, as the scenario gets more complex, the superiority of our <em>Proposed risk-aware</em> policy gets more evident. In terms of efficiency performance, our <em>Proposed risk-aware</em> policy ranks the second best, achieving an “AV delay” metric around 8.1 s in the “medium jaywalker volume” scenario and 8.5 s in the “high jaywalker volume” scenario. In practice, the proposed risk-aware DRL approach can help AV agents perceive potential risks in advance and navigate through potential jaywalking areas safely and efficiently, further enhancing pedestrian safety.</div></div>","PeriodicalId":6926,"journal":{"name":"Accident; analysis and prevention","volume":"210 ","pages":"Article 107843"},"PeriodicalIF":5.7000,"publicationDate":"2024-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Accident; analysis and prevention","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0001457524003889","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ERGONOMICS","Score":null,"Total":0}
引用次数: 0
Abstract
Jaywalking, as a hazardous crossing behavior, leaves little time for drivers to anticipate and respond promptly, resulting in high crossing risks. The prevalence of Autonomous Vehicle (AV) technologies has offered new solutions for mitigating jaywalking risks. In this study, we propose a risk-aware deep reinforcement learning (DRL) approach for AVs to make decisions safely and efficiently in jaywalker-vehicle interactions. Notably, a risk prediction module is incorporated into the traditional DRL framework, making the AV agent risk-aware. Considering the complexity of jaywalker-vehicle conflicts, an encoder-decoder model is adopted as the risk prediction module, which comprehensively integrates multi-source data and predicts probabilities of the final conflict severity levels. The risk-aware DRL approach is applied in a simulated environment established in Anylogic, where the motion features of jaywalkers and vehicles are calibrated using real-world survey data.
The trained driving policies are evaluated from perspectives of safety and efficiency across three scenarios with escalading levels of jaywalker volume. Regarding safety performance, the Baseline policy performs the worst in “medium jaywalker volume” scenario and “high jaywalker volume” scenario, while our Proposed risk-aware method outperforms the other methods, with the “low TTC ratio” metric stabilizing near 0.08. Moreover, as the scenario gets more complex, the superiority of our Proposed risk-aware policy gets more evident. In terms of efficiency performance, our Proposed risk-aware policy ranks the second best, achieving an “AV delay” metric around 8.1 s in the “medium jaywalker volume” scenario and 8.5 s in the “high jaywalker volume” scenario. In practice, the proposed risk-aware DRL approach can help AV agents perceive potential risks in advance and navigate through potential jaywalking areas safely and efficiently, further enhancing pedestrian safety.
期刊介绍:
Accident Analysis & Prevention provides wide coverage of the general areas relating to accidental injury and damage, including the pre-injury and immediate post-injury phases. Published papers deal with medical, legal, economic, educational, behavioral, theoretical or empirical aspects of transportation accidents, as well as with accidents at other sites. Selected topics within the scope of the Journal may include: studies of human, environmental and vehicular factors influencing the occurrence, type and severity of accidents and injury; the design, implementation and evaluation of countermeasures; biomechanics of impact and human tolerance limits to injury; modelling and statistical analysis of accident data; policy, planning and decision-making in safety.