{"title":"基于安全强化学习的复杂场景决策","authors":"Jie Xu, Xiaofei Pei, Kexuan Lv","doi":"10.1109/CVCI51460.2020.9338584","DOIUrl":null,"url":null,"abstract":"In recent years, machine learning is widely used in many fields. Compared with the rule-based method, machine learning plays a more excellent role in the decision-making of the autonomous vehicle. Some complex situations are often met in our daily life. To this end, Safe reinforcement learning(RL) is introduced to ensure that safer actions are selected. Constant Turn Rate and Acceleration(CTRA) model is first used to predict the future trajectories of surrounding vehicles. Then Double Deep Q-Learning(DDQN) method is used to make decisions and ensure the autonomous vehicle can move at the desired speed as much as possible. In order to achieve a safer decision-making, some safety rules are introduced. Finally, the algorithm is demonstrated in Simulation of Urban Mobility(SUMO) and has been proved to have an outstanding performance on such a complex scenario.","PeriodicalId":119721,"journal":{"name":"2020 4th CAA International Conference on Vehicular Control and Intelligence (CVCI)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2020-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Decision-Making for Complex Scenario using Safe Reinforcement Learning\",\"authors\":\"Jie Xu, Xiaofei Pei, Kexuan Lv\",\"doi\":\"10.1109/CVCI51460.2020.9338584\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In recent years, machine learning is widely used in many fields. Compared with the rule-based method, machine learning plays a more excellent role in the decision-making of the autonomous vehicle. Some complex situations are often met in our daily life. To this end, Safe reinforcement learning(RL) is introduced to ensure that safer actions are selected. Constant Turn Rate and Acceleration(CTRA) model is first used to predict the future trajectories of surrounding vehicles. Then Double Deep Q-Learning(DDQN) method is used to make decisions and ensure the autonomous vehicle can move at the desired speed as much as possible. In order to achieve a safer decision-making, some safety rules are introduced. Finally, the algorithm is demonstrated in Simulation of Urban Mobility(SUMO) and has been proved to have an outstanding performance on such a complex scenario.\",\"PeriodicalId\":119721,\"journal\":{\"name\":\"2020 4th CAA International Conference on Vehicular Control and Intelligence (CVCI)\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-12-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 4th CAA International Conference on Vehicular Control and Intelligence (CVCI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CVCI51460.2020.9338584\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 4th CAA International Conference on Vehicular Control and Intelligence (CVCI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CVCI51460.2020.9338584","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Decision-Making for Complex Scenario using Safe Reinforcement Learning
In recent years, machine learning is widely used in many fields. Compared with the rule-based method, machine learning plays a more excellent role in the decision-making of the autonomous vehicle. Some complex situations are often met in our daily life. To this end, Safe reinforcement learning(RL) is introduced to ensure that safer actions are selected. Constant Turn Rate and Acceleration(CTRA) model is first used to predict the future trajectories of surrounding vehicles. Then Double Deep Q-Learning(DDQN) method is used to make decisions and ensure the autonomous vehicle can move at the desired speed as much as possible. In order to achieve a safer decision-making, some safety rules are introduced. Finally, the algorithm is demonstrated in Simulation of Urban Mobility(SUMO) and has been proved to have an outstanding performance on such a complex scenario.