{"title":"基于深度强化学习的无人机充电任务规划","authors":"Yanfan Zhang, Hongyuan Zheng, X. Zhai","doi":"10.1145/3603781.3603897","DOIUrl":null,"url":null,"abstract":"This paper focuses on UAV mission planning in an environment with charging using deep reinforcement learning, firstly proposing an improved actor-critic algorithm and constructing a reward function related to the UAV energy as well as the mission to guide the UAV behavior to achieve the completion of the mission, as well as the penalty of the action and the smoothing of the trajectory, for improving the UAV's ability to complete the mission, and secondly adding a charging module to ensure the balance of UAV energy. The simulation results show that the method can ensure the stability of UAV energy and when the energy module is added, the UAV energy level smoothes out and the mission completion is higher than before.","PeriodicalId":391180,"journal":{"name":"Proceedings of the 2023 4th International Conference on Computing, Networks and Internet of Things","volume":"49 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Deep Reinforcement Learning Based UAV Mission Planning with Charging Module\",\"authors\":\"Yanfan Zhang, Hongyuan Zheng, X. Zhai\",\"doi\":\"10.1145/3603781.3603897\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper focuses on UAV mission planning in an environment with charging using deep reinforcement learning, firstly proposing an improved actor-critic algorithm and constructing a reward function related to the UAV energy as well as the mission to guide the UAV behavior to achieve the completion of the mission, as well as the penalty of the action and the smoothing of the trajectory, for improving the UAV's ability to complete the mission, and secondly adding a charging module to ensure the balance of UAV energy. The simulation results show that the method can ensure the stability of UAV energy and when the energy module is added, the UAV energy level smoothes out and the mission completion is higher than before.\",\"PeriodicalId\":391180,\"journal\":{\"name\":\"Proceedings of the 2023 4th International Conference on Computing, Networks and Internet of Things\",\"volume\":\"49 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-05-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2023 4th International Conference on Computing, Networks and Internet of Things\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3603781.3603897\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2023 4th International Conference on Computing, Networks and Internet of Things","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3603781.3603897","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Deep Reinforcement Learning Based UAV Mission Planning with Charging Module
This paper focuses on UAV mission planning in an environment with charging using deep reinforcement learning, firstly proposing an improved actor-critic algorithm and constructing a reward function related to the UAV energy as well as the mission to guide the UAV behavior to achieve the completion of the mission, as well as the penalty of the action and the smoothing of the trajectory, for improving the UAV's ability to complete the mission, and secondly adding a charging module to ensure the balance of UAV energy. The simulation results show that the method can ensure the stability of UAV energy and when the energy module is added, the UAV energy level smoothes out and the mission completion is higher than before.