Hongqing Chu , Heng Wang , Yifan Cheng , Aoyong Li , Wei Tian , Bingzhao Gao , Hong Chen
{"title":"Decision making for autonomous vehicles: A mixed curriculum reinforcement learning approach and a novel safety intervention method","authors":"Hongqing Chu , Heng Wang , Yifan Cheng , Aoyong Li , Wei Tian , Bingzhao Gao , Hong Chen","doi":"10.1016/j.trc.2025.105369","DOIUrl":null,"url":null,"abstract":"<div><div>Reinforcement learning is considered one of the most promising approaches for decision-making in autonomous vehicles within interactive scenarios. However, its implementation faces challenges of insufficient safety and limited learning efficiency due to the stochastic nature of exploration and the complexity of the exploration space. In this paper, a mixed curriculum learning (MCL) approach, incorporating an intervention method called discrepancy-directed Bernoulli intervention (DDBI), is proposed to address these challenges in reinforcement learning. Firstly, the algorithm divides the training process into a safety phase and a performance phase. The agent focuses on accomplishing the safety task first, which is followed by the performance task. Secondly, DDBI introduces an additional safety agent to intervene in hazardous situations using a novel probability-based method, thereby enhancing the safety of the training process while preserving the exploratory nature of reinforcement learning. Finally, the proposed approach is evaluated in a lane change scenario with random traffic flow. Comprehensive comparative experiments with other algorithms demonstrate that the proposed approach outperforms in both safety and learning efficiency.</div></div>","PeriodicalId":54417,"journal":{"name":"Transportation Research Part C-Emerging Technologies","volume":"181 ","pages":"Article 105369"},"PeriodicalIF":7.6000,"publicationDate":"2025-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Transportation Research Part C-Emerging Technologies","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0968090X25003730","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"TRANSPORTATION SCIENCE & TECHNOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Reinforcement learning is considered one of the most promising approaches for decision-making in autonomous vehicles within interactive scenarios. However, its implementation faces challenges of insufficient safety and limited learning efficiency due to the stochastic nature of exploration and the complexity of the exploration space. In this paper, a mixed curriculum learning (MCL) approach, incorporating an intervention method called discrepancy-directed Bernoulli intervention (DDBI), is proposed to address these challenges in reinforcement learning. Firstly, the algorithm divides the training process into a safety phase and a performance phase. The agent focuses on accomplishing the safety task first, which is followed by the performance task. Secondly, DDBI introduces an additional safety agent to intervene in hazardous situations using a novel probability-based method, thereby enhancing the safety of the training process while preserving the exploratory nature of reinforcement learning. Finally, the proposed approach is evaluated in a lane change scenario with random traffic flow. Comprehensive comparative experiments with other algorithms demonstrate that the proposed approach outperforms in both safety and learning efficiency.
期刊介绍:
Transportation Research: Part C (TR_C) is dedicated to showcasing high-quality, scholarly research that delves into the development, applications, and implications of transportation systems and emerging technologies. Our focus lies not solely on individual technologies, but rather on their broader implications for the planning, design, operation, control, maintenance, and rehabilitation of transportation systems, services, and components. In essence, the intellectual core of the journal revolves around the transportation aspect rather than the technology itself. We actively encourage the integration of quantitative methods from diverse fields such as operations research, control systems, complex networks, computer science, and artificial intelligence. Join us in exploring the intersection of transportation systems and emerging technologies to drive innovation and progress in the field.