{"title":"Reinforcement learning based overtaking decision-making for highway autonomous driving","authors":"Xin Li, Xin Xu, L. Zuo","doi":"10.1109/ICICIP.2015.7388193","DOIUrl":null,"url":null,"abstract":"In this paper, we develop an intelligent overtaking decision-making method for highway autonomous driving. The key idea is to use reinforcement learning algorithms to learn an optimized policy via a series of simulated driving scenarios. A vehicle model based on data fitting of real vehicles as well as a traffic model is established to simulate driving scenarios and validation tests of obtained policies. Human driving experiences are considered in designing the reward function. A reinforcement learning method called the Q-learning algorithm is used to learn overtaking decision-making policies. Simulations show that our method can learn feasible overtaking policies in different traffic environments and the performance is comparable or even better than manually designed decision rules.","PeriodicalId":265426,"journal":{"name":"2015 Sixth International Conference on Intelligent Control and Information Processing (ICICIP)","volume":"38 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"45","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 Sixth International Conference on Intelligent Control and Information Processing (ICICIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICICIP.2015.7388193","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 45
Abstract
In this paper, we develop an intelligent overtaking decision-making method for highway autonomous driving. The key idea is to use reinforcement learning algorithms to learn an optimized policy via a series of simulated driving scenarios. A vehicle model based on data fitting of real vehicles as well as a traffic model is established to simulate driving scenarios and validation tests of obtained policies. Human driving experiences are considered in designing the reward function. A reinforcement learning method called the Q-learning algorithm is used to learn overtaking decision-making policies. Simulations show that our method can learn feasible overtaking policies in different traffic environments and the performance is comparable or even better than manually designed decision rules.