Wuyi Luo, Jun Zhang, Xuhui Huang, Zhaolei Wang, Chenhui Jia
{"title":"一种改进的带有位置约束的模糊行为-评价学习方法","authors":"Wuyi Luo, Jun Zhang, Xuhui Huang, Zhaolei Wang, Chenhui Jia","doi":"10.1145/3510362.3510364","DOIUrl":null,"url":null,"abstract":"In the last decades, path planning with position constraints attracts many attentions. In this paper, we propose an innovative approach named improved fuzzy actor-critic learning (IFACL) to solve this problem without modelling the map containing obstacles and complex calculation. Specifically, only the initial position, target position and obstacle position are needed as inputs for the algorithm to learn a desired path. Based on FACL, a penalty factor is added to enable agents obtaining the ability to avoid obstacles through punishing agents when exceeding position constraints. Then, to optimize the path planned, an excessive coordinate point which can be updated iteratively during the training process is utilized to calculate the reward with penalty factor. The simulation results prove the superiority and effectiveness of this algorithm in different scenarios with regular obstacles and hypothetical irregular obstacles. Due to the flexibility of FACL, this approach may be easily extended to path planning with velocity constraints and dynamic constraints.","PeriodicalId":407010,"journal":{"name":"Proceedings of the 2021 6th International Conference on Systems, Control and Communications","volume":"29 6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An Improved Fuzzy Actor-critic Learning Approach for Path Planning with Position Constraints\",\"authors\":\"Wuyi Luo, Jun Zhang, Xuhui Huang, Zhaolei Wang, Chenhui Jia\",\"doi\":\"10.1145/3510362.3510364\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the last decades, path planning with position constraints attracts many attentions. In this paper, we propose an innovative approach named improved fuzzy actor-critic learning (IFACL) to solve this problem without modelling the map containing obstacles and complex calculation. Specifically, only the initial position, target position and obstacle position are needed as inputs for the algorithm to learn a desired path. Based on FACL, a penalty factor is added to enable agents obtaining the ability to avoid obstacles through punishing agents when exceeding position constraints. Then, to optimize the path planned, an excessive coordinate point which can be updated iteratively during the training process is utilized to calculate the reward with penalty factor. The simulation results prove the superiority and effectiveness of this algorithm in different scenarios with regular obstacles and hypothetical irregular obstacles. Due to the flexibility of FACL, this approach may be easily extended to path planning with velocity constraints and dynamic constraints.\",\"PeriodicalId\":407010,\"journal\":{\"name\":\"Proceedings of the 2021 6th International Conference on Systems, Control and Communications\",\"volume\":\"29 6 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-10-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2021 6th International Conference on Systems, Control and Communications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3510362.3510364\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2021 6th International Conference on Systems, Control and Communications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3510362.3510364","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
An Improved Fuzzy Actor-critic Learning Approach for Path Planning with Position Constraints
In the last decades, path planning with position constraints attracts many attentions. In this paper, we propose an innovative approach named improved fuzzy actor-critic learning (IFACL) to solve this problem without modelling the map containing obstacles and complex calculation. Specifically, only the initial position, target position and obstacle position are needed as inputs for the algorithm to learn a desired path. Based on FACL, a penalty factor is added to enable agents obtaining the ability to avoid obstacles through punishing agents when exceeding position constraints. Then, to optimize the path planned, an excessive coordinate point which can be updated iteratively during the training process is utilized to calculate the reward with penalty factor. The simulation results prove the superiority and effectiveness of this algorithm in different scenarios with regular obstacles and hypothetical irregular obstacles. Due to the flexibility of FACL, this approach may be easily extended to path planning with velocity constraints and dynamic constraints.