{"title":"基于强化学习的高速公路自动驾驶超车决策","authors":"Xin Li, Xin Xu, L. Zuo","doi":"10.1109/ICICIP.2015.7388193","DOIUrl":null,"url":null,"abstract":"In this paper, we develop an intelligent overtaking decision-making method for highway autonomous driving. The key idea is to use reinforcement learning algorithms to learn an optimized policy via a series of simulated driving scenarios. A vehicle model based on data fitting of real vehicles as well as a traffic model is established to simulate driving scenarios and validation tests of obtained policies. Human driving experiences are considered in designing the reward function. A reinforcement learning method called the Q-learning algorithm is used to learn overtaking decision-making policies. Simulations show that our method can learn feasible overtaking policies in different traffic environments and the performance is comparable or even better than manually designed decision rules.","PeriodicalId":265426,"journal":{"name":"2015 Sixth International Conference on Intelligent Control and Information Processing (ICICIP)","volume":"38 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"45","resultStr":"{\"title\":\"Reinforcement learning based overtaking decision-making for highway autonomous driving\",\"authors\":\"Xin Li, Xin Xu, L. Zuo\",\"doi\":\"10.1109/ICICIP.2015.7388193\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we develop an intelligent overtaking decision-making method for highway autonomous driving. The key idea is to use reinforcement learning algorithms to learn an optimized policy via a series of simulated driving scenarios. A vehicle model based on data fitting of real vehicles as well as a traffic model is established to simulate driving scenarios and validation tests of obtained policies. Human driving experiences are considered in designing the reward function. A reinforcement learning method called the Q-learning algorithm is used to learn overtaking decision-making policies. Simulations show that our method can learn feasible overtaking policies in different traffic environments and the performance is comparable or even better than manually designed decision rules.\",\"PeriodicalId\":265426,\"journal\":{\"name\":\"2015 Sixth International Conference on Intelligent Control and Information Processing (ICICIP)\",\"volume\":\"38 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"45\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 Sixth International Conference on Intelligent Control and Information Processing (ICICIP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICICIP.2015.7388193\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 Sixth International Conference on Intelligent Control and Information Processing (ICICIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICICIP.2015.7388193","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Reinforcement learning based overtaking decision-making for highway autonomous driving
In this paper, we develop an intelligent overtaking decision-making method for highway autonomous driving. The key idea is to use reinforcement learning algorithms to learn an optimized policy via a series of simulated driving scenarios. A vehicle model based on data fitting of real vehicles as well as a traffic model is established to simulate driving scenarios and validation tests of obtained policies. Human driving experiences are considered in designing the reward function. A reinforcement learning method called the Q-learning algorithm is used to learn overtaking decision-making policies. Simulations show that our method can learn feasible overtaking policies in different traffic environments and the performance is comparable or even better than manually designed decision rules.