An Guo, Yuan Zhou, Haoxiang Tian, Chunrong Fang, Yunjian Sun, Weisong Sun, Xinyu Gao, Anh Tuan Luu, Yang Liu, Zhenyu Chen
{"title":"SoVAR: Building Generalizable Scenarios from Accident Reports for Autonomous Driving Testing","authors":"An Guo, Yuan Zhou, Haoxiang Tian, Chunrong Fang, Yunjian Sun, Weisong Sun, Xinyu Gao, Anh Tuan Luu, Yang Liu, Zhenyu Chen","doi":"arxiv-2409.08081","DOIUrl":null,"url":null,"abstract":"Autonomous driving systems (ADSs) have undergone remarkable development and\nare increasingly employed in safety-critical applications. However, recently\nreported data on fatal accidents involving ADSs suggests that the desired level\nof safety has not yet been fully achieved. Consequently, there is a growing\nneed for more comprehensive and targeted testing approaches to ensure safe\ndriving. Scenarios from real-world accident reports provide valuable resources\nfor ADS testing, including critical scenarios and high-quality seeds. However,\nexisting scenario reconstruction methods from accident reports often exhibit\nlimited accuracy in information extraction. Moreover, due to the diversity and\ncomplexity of road environments, matching current accident information with the\nsimulation map data for reconstruction poses significant challenges. In this\npaper, we design and implement SoVAR, a tool for automatically generating\nroad-generalizable scenarios from accident reports. SoVAR utilizes\nwell-designed prompts with linguistic patterns to guide the large language\nmodel in extracting accident information from textual data. Subsequently, it\nformulates and solves accident-related constraints in conjunction with the\nextracted accident information to generate accident trajectories. Finally,\nSoVAR reconstructs accident scenarios on various map structures and converts\nthem into test scenarios to evaluate its capability to detect defects in\nindustrial ADSs. We experiment with SoVAR, using accident reports from the\nNational Highway Traffic Safety Administration's database to generate test\nscenarios for the industrial-grade ADS Apollo. The experimental findings\ndemonstrate that SoVAR can effectively generate generalized accident scenarios\nacross different road structures. Furthermore, the results confirm that SoVAR\nidentified 5 distinct safety violation types that contributed to the crash of\nBaidu Apollo.","PeriodicalId":501278,"journal":{"name":"arXiv - CS - Software Engineering","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Software Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.08081","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Autonomous driving systems (ADSs) have undergone remarkable development and
are increasingly employed in safety-critical applications. However, recently
reported data on fatal accidents involving ADSs suggests that the desired level
of safety has not yet been fully achieved. Consequently, there is a growing
need for more comprehensive and targeted testing approaches to ensure safe
driving. Scenarios from real-world accident reports provide valuable resources
for ADS testing, including critical scenarios and high-quality seeds. However,
existing scenario reconstruction methods from accident reports often exhibit
limited accuracy in information extraction. Moreover, due to the diversity and
complexity of road environments, matching current accident information with the
simulation map data for reconstruction poses significant challenges. In this
paper, we design and implement SoVAR, a tool for automatically generating
road-generalizable scenarios from accident reports. SoVAR utilizes
well-designed prompts with linguistic patterns to guide the large language
model in extracting accident information from textual data. Subsequently, it
formulates and solves accident-related constraints in conjunction with the
extracted accident information to generate accident trajectories. Finally,
SoVAR reconstructs accident scenarios on various map structures and converts
them into test scenarios to evaluate its capability to detect defects in
industrial ADSs. We experiment with SoVAR, using accident reports from the
National Highway Traffic Safety Administration's database to generate test
scenarios for the industrial-grade ADS Apollo. The experimental findings
demonstrate that SoVAR can effectively generate generalized accident scenarios
across different road structures. Furthermore, the results confirm that SoVAR
identified 5 distinct safety violation types that contributed to the crash of
Baidu Apollo.