{"title":"基于深度强化学习的静态和动态环境四旋翼无地图导航","authors":"Tsung-Hsi Tsai, Qing Li","doi":"10.1109/IAI53119.2021.9619200","DOIUrl":null,"url":null,"abstract":"In this paper, we propose a mapless autonomous navigation planner which plans a collision-free trajectory for quadrotor without any manual operations. Deep Reinforcement Learning (DRL) can optimize the policy by trial and error without knowing the prior information of the environment. The designed reward function has better convergence which compares to the benchmark method. The learned policy makes a real time collision free trajectory which can cope with the dynamic obstacles under different scenarios. The evaluation result shows that the trained model can be applied directly to the unknown environment without retraining the agent.","PeriodicalId":106675,"journal":{"name":"2021 3rd International Conference on Industrial Artificial Intelligence (IAI)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Quadrotor Mapless Navigation in Static and Dynamic Environments based on Deep Reinforcement Learning\",\"authors\":\"Tsung-Hsi Tsai, Qing Li\",\"doi\":\"10.1109/IAI53119.2021.9619200\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we propose a mapless autonomous navigation planner which plans a collision-free trajectory for quadrotor without any manual operations. Deep Reinforcement Learning (DRL) can optimize the policy by trial and error without knowing the prior information of the environment. The designed reward function has better convergence which compares to the benchmark method. The learned policy makes a real time collision free trajectory which can cope with the dynamic obstacles under different scenarios. The evaluation result shows that the trained model can be applied directly to the unknown environment without retraining the agent.\",\"PeriodicalId\":106675,\"journal\":{\"name\":\"2021 3rd International Conference on Industrial Artificial Intelligence (IAI)\",\"volume\":\"36 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-11-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 3rd International Conference on Industrial Artificial Intelligence (IAI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IAI53119.2021.9619200\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 3rd International Conference on Industrial Artificial Intelligence (IAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IAI53119.2021.9619200","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Quadrotor Mapless Navigation in Static and Dynamic Environments based on Deep Reinforcement Learning
In this paper, we propose a mapless autonomous navigation planner which plans a collision-free trajectory for quadrotor without any manual operations. Deep Reinforcement Learning (DRL) can optimize the policy by trial and error without knowing the prior information of the environment. The designed reward function has better convergence which compares to the benchmark method. The learned policy makes a real time collision free trajectory which can cope with the dynamic obstacles under different scenarios. The evaluation result shows that the trained model can be applied directly to the unknown environment without retraining the agent.