{"title":"Critical scenarios adversarial generation method for intelligent vehicles testing based on hierarchical reinforcement architecture","authors":"Bing Zhu, Rui Tang, Jian Zhao, Peixing Zhang, Wenxu Li, Xinran Cao, Siyuan Li","doi":"10.1016/j.aap.2025.108013","DOIUrl":null,"url":null,"abstract":"<div><div>The widespread deployment of intelligent vehicles necessitates comprehensive testing across diverse driving scenarios. A significant challenge is generating critical testing scenarios to accurately evaluate vehicle performance. To overcome the limitations of existing methods, including inadequate diversity and validity, this study proposes an adversarial generation method grounded on a hierarchical reinforcement learning framework. This approach comprises three modules: a hierarchical scheduling module, a conflict prediction module, and a scenario evaluation module. The hierarchical scheduling module segments the testing procedure into guidance, adversarial, and exploration periods, effectively managing reward sparsity to promote varied scenario generation. The conflict prediction module employs kinematic conflict prediction and adaptive action strategies to enhance learning speed and efficiency, directing traffic entities in producing critical scenarios. The evaluation module assesses scenario validity and diversity by analyzing relative trajectories, temporal characteristics, and spatial configurations, in addition to employing a perception-limited model and replay testing to assess performance within the system’s operational limits. Experimental results using the HighD dataset in the highway environment demonstrate that the proposed method efficiently generates varied critical test scenarios, improving the collision rate and period contributions throughout the testing process. When producing an equivalent number of critical scenarios, the overall testing resource utilization decreases by 49.49% relative to the conventional Deep Q-Network method.</div></div>","PeriodicalId":6926,"journal":{"name":"Accident; analysis and prevention","volume":"215 ","pages":"Article 108013"},"PeriodicalIF":5.7000,"publicationDate":"2025-03-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Accident; analysis and prevention","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0001457525000995","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ERGONOMICS","Score":null,"Total":0}
引用次数: 0
Abstract
The widespread deployment of intelligent vehicles necessitates comprehensive testing across diverse driving scenarios. A significant challenge is generating critical testing scenarios to accurately evaluate vehicle performance. To overcome the limitations of existing methods, including inadequate diversity and validity, this study proposes an adversarial generation method grounded on a hierarchical reinforcement learning framework. This approach comprises three modules: a hierarchical scheduling module, a conflict prediction module, and a scenario evaluation module. The hierarchical scheduling module segments the testing procedure into guidance, adversarial, and exploration periods, effectively managing reward sparsity to promote varied scenario generation. The conflict prediction module employs kinematic conflict prediction and adaptive action strategies to enhance learning speed and efficiency, directing traffic entities in producing critical scenarios. The evaluation module assesses scenario validity and diversity by analyzing relative trajectories, temporal characteristics, and spatial configurations, in addition to employing a perception-limited model and replay testing to assess performance within the system’s operational limits. Experimental results using the HighD dataset in the highway environment demonstrate that the proposed method efficiently generates varied critical test scenarios, improving the collision rate and period contributions throughout the testing process. When producing an equivalent number of critical scenarios, the overall testing resource utilization decreases by 49.49% relative to the conventional Deep Q-Network method.
期刊介绍:
Accident Analysis & Prevention provides wide coverage of the general areas relating to accidental injury and damage, including the pre-injury and immediate post-injury phases. Published papers deal with medical, legal, economic, educational, behavioral, theoretical or empirical aspects of transportation accidents, as well as with accidents at other sites. Selected topics within the scope of the Journal may include: studies of human, environmental and vehicular factors influencing the occurrence, type and severity of accidents and injury; the design, implementation and evaluation of countermeasures; biomechanics of impact and human tolerance limits to injury; modelling and statistical analysis of accident data; policy, planning and decision-making in safety.