Junjie Zhou , Lin Wang , Qiang Meng , Xiaofan Wang
{"title":"Generation of naturalistic and critical boundary scenarios: A bi-level adaptive deep reinforcement learning method","authors":"Junjie Zhou , Lin Wang , Qiang Meng , Xiaofan Wang","doi":"10.1016/j.aap.2025.108269","DOIUrl":null,"url":null,"abstract":"<div><div>The complexities of real-world driving environments, coupled with a limited availability of naturalistic and critical test scenarios, have long hindered unbiased and effective comprehensive performance evaluations. In this work, we propose a bi-level adaptive deep reinforcement learning (BADRL) framework designed to generate realistic and diverse critical boundary scenarios. The method involves training AI-driven background agents to impartially assess the overall performance of autonomous vehicles. By leveraging naturalistic driving data, these agents acquire realistic driving behaviors via a neural model that encapsulates naturalistic driving patterns. To enrich the authenticity and diversity of the test scenarios, a wide array of traffic participants, encompassing vehicles, pedestrians, and bicycles, are meticulously modeled and portrayed to engage in intricate interactive behaviors with the tested autonomous vehicles. To address the challenges of high-dimensional environments, we introduce a scenario complexity model that assesses relative complexity in real time. This model enables the upper-level neural network in BADRL to dynamically escalate scenario complexity, with the resulting scenarios subsequently processed by lower-level models to optimize the actions of primary traffic participants. The BADRL method enables online real-time generation of naturalistic and critical boundary scenarios. Extensive simulation experiments validate the effectiveness of the BADRL approach in diverse driving environments, with results indicating an improvement in the efficiency of critical boundary scenario generation by approximately 15.89 % compared to state-of-the-art methods.</div></div>","PeriodicalId":6926,"journal":{"name":"Accident; analysis and prevention","volume":"223 ","pages":"Article 108269"},"PeriodicalIF":6.2000,"publicationDate":"2025-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Accident; analysis and prevention","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0001457525003574","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ERGONOMICS","Score":null,"Total":0}
引用次数: 0
Abstract
The complexities of real-world driving environments, coupled with a limited availability of naturalistic and critical test scenarios, have long hindered unbiased and effective comprehensive performance evaluations. In this work, we propose a bi-level adaptive deep reinforcement learning (BADRL) framework designed to generate realistic and diverse critical boundary scenarios. The method involves training AI-driven background agents to impartially assess the overall performance of autonomous vehicles. By leveraging naturalistic driving data, these agents acquire realistic driving behaviors via a neural model that encapsulates naturalistic driving patterns. To enrich the authenticity and diversity of the test scenarios, a wide array of traffic participants, encompassing vehicles, pedestrians, and bicycles, are meticulously modeled and portrayed to engage in intricate interactive behaviors with the tested autonomous vehicles. To address the challenges of high-dimensional environments, we introduce a scenario complexity model that assesses relative complexity in real time. This model enables the upper-level neural network in BADRL to dynamically escalate scenario complexity, with the resulting scenarios subsequently processed by lower-level models to optimize the actions of primary traffic participants. The BADRL method enables online real-time generation of naturalistic and critical boundary scenarios. Extensive simulation experiments validate the effectiveness of the BADRL approach in diverse driving environments, with results indicating an improvement in the efficiency of critical boundary scenario generation by approximately 15.89 % compared to state-of-the-art methods.
期刊介绍:
Accident Analysis & Prevention provides wide coverage of the general areas relating to accidental injury and damage, including the pre-injury and immediate post-injury phases. Published papers deal with medical, legal, economic, educational, behavioral, theoretical or empirical aspects of transportation accidents, as well as with accidents at other sites. Selected topics within the scope of the Journal may include: studies of human, environmental and vehicular factors influencing the occurrence, type and severity of accidents and injury; the design, implementation and evaluation of countermeasures; biomechanics of impact and human tolerance limits to injury; modelling and statistical analysis of accident data; policy, planning and decision-making in safety.