SECRM-2D：基于rl的高效舒适路线跟随自动驾驶与安全分析保证

IF 2.5 4区工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC

IET Intelligent Transport Systems Pub Date : 2025-03-10 DOI:10.1049/itr2.70013

Tianyu Shi, Ilia Smirnov, Omar ElSamadisy, Baher Abdulhai

{"title":"SECRM-2D：基于rl的高效舒适路线跟随自动驾驶与安全分析保证","authors":"Tianyu Shi, Ilia Smirnov, Omar ElSamadisy, Baher Abdulhai","doi":"10.1049/itr2.70013","DOIUrl":null,"url":null,"abstract":"<p>Over the last decade, there has been increasing interest in autonomous driving systems. Reinforcement learning (RL) shows great promise for training autonomous driving controllers, being able to directly optimize a combination of criteria such as efficiency comfort, and stability. However, RL-based controllers typically offer no safety guarantees, making their readiness for real deployment questionable. In this paper, we propose SECRM-2D (the safe, efficient and comfortable RL-based driving model with lane-changing), an RL autonomous driving controller (both longitudinal and lateral) that balances optimization of efficiency and comfort and follows a fixed route, while being subject to hard analytic safety constraints. The aforementioned safety constraints are derived from the criterion that the follower vehicle must have sufficient headway to be able to avoid a crash if the leader vehicle brakes suddenly. We evaluate SECRM-2D against several learning and non-learning baselines in simulated test scenarios, including freeway driving, exiting, merging, and emergency braking. Our results confirm that representative previously published RL AV controllers may crash in both training and testing, even if they are optimizing a safety objective. By contrast, our controller SECRM-2D is successful in avoiding crashes during both training and testing, improves over the baselines in measures of efficiency and comfort, and is more faithful in following the prescribed route. In addition, we achieve a good theoretical understanding of the longitudinal steady-state of a collection of SECRM-2D vehicles.</p>","PeriodicalId":50381,"journal":{"name":"IET Intelligent Transport Systems","volume":"19 1","pages":""},"PeriodicalIF":2.5000,"publicationDate":"2025-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/itr2.70013","citationCount":"0","resultStr":"{\"title\":\"SECRM-2D: RL-Based Efficient and Comfortable Route-Following Autonomous Driving With Analytic Safety Guarantees\",\"authors\":\"Tianyu Shi, Ilia Smirnov, Omar ElSamadisy, Baher Abdulhai\",\"doi\":\"10.1049/itr2.70013\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Over the last decade, there has been increasing interest in autonomous driving systems. Reinforcement learning (RL) shows great promise for training autonomous driving controllers, being able to directly optimize a combination of criteria such as efficiency comfort, and stability. However, RL-based controllers typically offer no safety guarantees, making their readiness for real deployment questionable. In this paper, we propose SECRM-2D (the safe, efficient and comfortable RL-based driving model with lane-changing), an RL autonomous driving controller (both longitudinal and lateral) that balances optimization of efficiency and comfort and follows a fixed route, while being subject to hard analytic safety constraints. The aforementioned safety constraints are derived from the criterion that the follower vehicle must have sufficient headway to be able to avoid a crash if the leader vehicle brakes suddenly. We evaluate SECRM-2D against several learning and non-learning baselines in simulated test scenarios, including freeway driving, exiting, merging, and emergency braking. Our results confirm that representative previously published RL AV controllers may crash in both training and testing, even if they are optimizing a safety objective. By contrast, our controller SECRM-2D is successful in avoiding crashes during both training and testing, improves over the baselines in measures of efficiency and comfort, and is more faithful in following the prescribed route. In addition, we achieve a good theoretical understanding of the longitudinal steady-state of a collection of SECRM-2D vehicles.</p>\",\"PeriodicalId\":50381,\"journal\":{\"name\":\"IET Intelligent Transport Systems\",\"volume\":\"19 1\",\"pages\":\"\"},\"PeriodicalIF\":2.5000,\"publicationDate\":\"2025-03-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1049/itr2.70013\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IET Intelligent Transport Systems\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://ietresearch.onlinelibrary.wiley.com/doi/10.1049/itr2.70013\",\"RegionNum\":4,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IET Intelligent Transport Systems","FirstCategoryId":"5","ListUrlMain":"https://ietresearch.onlinelibrary.wiley.com/doi/10.1049/itr2.70013","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

摘要

在过去的十年里，人们对自动驾驶系统的兴趣越来越大。强化学习（RL）在训练自动驾驶控制器方面显示出巨大的前景，能够直接优化效率、舒适性和稳定性等标准的组合。然而，基于rl的控制器通常不提供安全保证，这使得它们对实际部署的准备工作值得怀疑。在本文中，我们提出了SECRM-2D（安全，高效和舒适的基于RL的驾驶模型和变道），这是一种平衡效率和舒适性优化的RL自动驾驶控制器（纵向和横向），遵循固定路线，同时受到硬分析安全约束。上述安全约束来源于以下准则：如果前导车辆突然刹车，跟随车辆必须有足够的车头时距以避免碰撞。我们在模拟测试场景（包括高速公路驾驶、出口、合并和紧急制动）中，针对几种学习基线和非学习基线对SECRM-2D进行了评估。我们的研究结果证实，即使在优化安全目标时，具有代表性的RL AV控制器也可能在训练和测试中崩溃。相比之下，我们的控制器SECRM-2D在训练和测试期间都成功地避免了碰撞，在效率和舒适度方面比基线有所提高，并且更忠实地遵循规定的路线。此外，我们对一组SECRM-2D车辆的纵向稳态有了很好的理论理解。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

SECRM-2D: RL-Based Efficient and Comfortable Route-Following Autonomous Driving With Analytic Safety Guarantees

查看原文本刊更多论文

SECRM-2D: RL-Based Efficient and Comfortable Route-Following Autonomous Driving With Analytic Safety Guarantees

Over the last decade, there has been increasing interest in autonomous driving systems. Reinforcement learning (RL) shows great promise for training autonomous driving controllers, being able to directly optimize a combination of criteria such as efficiency comfort, and stability. However, RL-based controllers typically offer no safety guarantees, making their readiness for real deployment questionable. In this paper, we propose SECRM-2D (the safe, efficient and comfortable RL-based driving model with lane-changing), an RL autonomous driving controller (both longitudinal and lateral) that balances optimization of efficiency and comfort and follows a fixed route, while being subject to hard analytic safety constraints. The aforementioned safety constraints are derived from the criterion that the follower vehicle must have sufficient headway to be able to avoid a crash if the leader vehicle brakes suddenly. We evaluate SECRM-2D against several learning and non-learning baselines in simulated test scenarios, including freeway driving, exiting, merging, and emergency braking. Our results confirm that representative previously published RL AV controllers may crash in both training and testing, even if they are optimizing a safety objective. By contrast, our controller SECRM-2D is successful in avoiding crashes during both training and testing, improves over the baselines in measures of efficiency and comfort, and is more faithful in following the prescribed route. In addition, we achieve a good theoretical understanding of the longitudinal steady-state of a collection of SECRM-2D vehicles.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IET Intelligent Transport Systems 工程技术-运输科技

CiteScore

6.50

自引率

7.40%

发文量

159

审稿时长

3 months

期刊介绍： IET Intelligent Transport Systems is an interdisciplinary journal devoted to research into the practical applications of ITS and infrastructures. The scope of the journal includes the following: Sustainable traffic solutions Deployments with enabling technologies Pervasive monitoring Applications; demonstrations and evaluation Economic and behavioural analyses of ITS services and scenario Data Integration and analytics Information collection and processing; image processing applications in ITS ITS aspects of electric vehicles Autonomous vehicles; connected vehicle systems; In-vehicle ITS, safety and vulnerable road user aspects Mobility as a service systems Traffic management and control Public transport systems technologies Fleet and public transport logistics Emergency and incident management Demand management and electronic payment systems Traffic related air pollution management Policy and institutional issues Interoperability, standards and architectures Funding scenarios Enforcement Human machine interaction Education, training and outreach Current Special Issue Call for papers: Intelligent Transportation Systems in Smart Cities for Sustainable Environment - https://digital-library.theiet.org/files/IET_ITS_CFP_ITSSCSE.pdf Sustainably Intelligent Mobility (SIM) - https://digital-library.theiet.org/files/IET_ITS_CFP_SIM.pdf Traffic Theory and Modelling in the Era of Artificial Intelligence and Big Data (in collaboration with World Congress for Transport Research, WCTR 2019) - https://digital-library.theiet.org/files/IET_ITS_CFP_WCTR.pdf