El Mehdi Ben Laoula, Omar Elfahim, M. Youssfi, O. Bouattane
{"title":"基于q -学习算法的复杂环境下无人机路径优化","authors":"El Mehdi Ben Laoula, Omar Elfahim, M. Youssfi, O. Bouattane","doi":"10.1109/ISCV54655.2022.9806077","DOIUrl":null,"url":null,"abstract":"Path planning of intelligent agents in an emergency context is one of the most popular issues within nowadays context. This work proposes an environment acquisition and a path optimization solution based on reinforcement learning. The proposed solution implements Q-Learning algorithm and enables the agent to choose the path that maximizes the reward and minimizes the penalty. When tested in an experiment grid and compared to other solutions the proposed solution proved to be more stable and more efficient.","PeriodicalId":426665,"journal":{"name":"2022 International Conference on Intelligent Systems and Computer Vision (ISCV)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Drone path optimization in complex environment based on Q-learning algorithm\",\"authors\":\"El Mehdi Ben Laoula, Omar Elfahim, M. Youssfi, O. Bouattane\",\"doi\":\"10.1109/ISCV54655.2022.9806077\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Path planning of intelligent agents in an emergency context is one of the most popular issues within nowadays context. This work proposes an environment acquisition and a path optimization solution based on reinforcement learning. The proposed solution implements Q-Learning algorithm and enables the agent to choose the path that maximizes the reward and minimizes the penalty. When tested in an experiment grid and compared to other solutions the proposed solution proved to be more stable and more efficient.\",\"PeriodicalId\":426665,\"journal\":{\"name\":\"2022 International Conference on Intelligent Systems and Computer Vision (ISCV)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-05-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 International Conference on Intelligent Systems and Computer Vision (ISCV)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISCV54655.2022.9806077\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Intelligent Systems and Computer Vision (ISCV)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISCV54655.2022.9806077","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Drone path optimization in complex environment based on Q-learning algorithm
Path planning of intelligent agents in an emergency context is one of the most popular issues within nowadays context. This work proposes an environment acquisition and a path optimization solution based on reinforcement learning. The proposed solution implements Q-Learning algorithm and enables the agent to choose the path that maximizes the reward and minimizes the penalty. When tested in an experiment grid and compared to other solutions the proposed solution proved to be more stable and more efficient.