一种改进的带有位置约束的模糊行为-评价学习方法

Proceedings of the 2021 6th International Conference on Systems, Control and Communications Pub Date : 2021-10-15 DOI:10.1145/3510362.3510364

Wuyi Luo, Jun Zhang, Xuhui Huang, Zhaolei Wang, Chenhui Jia

{"title":"一种改进的带有位置约束的模糊行为-评价学习方法","authors":"Wuyi Luo, Jun Zhang, Xuhui Huang, Zhaolei Wang, Chenhui Jia","doi":"10.1145/3510362.3510364","DOIUrl":null,"url":null,"abstract":"In the last decades, path planning with position constraints attracts many attentions. In this paper, we propose an innovative approach named improved fuzzy actor-critic learning (IFACL) to solve this problem without modelling the map containing obstacles and complex calculation. Specifically, only the initial position, target position and obstacle position are needed as inputs for the algorithm to learn a desired path. Based on FACL, a penalty factor is added to enable agents obtaining the ability to avoid obstacles through punishing agents when exceeding position constraints. Then, to optimize the path planned, an excessive coordinate point which can be updated iteratively during the training process is utilized to calculate the reward with penalty factor. The simulation results prove the superiority and effectiveness of this algorithm in different scenarios with regular obstacles and hypothetical irregular obstacles. Due to the flexibility of FACL, this approach may be easily extended to path planning with velocity constraints and dynamic constraints.","PeriodicalId":407010,"journal":{"name":"Proceedings of the 2021 6th International Conference on Systems, Control and Communications","volume":"29 6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An Improved Fuzzy Actor-critic Learning Approach for Path Planning with Position Constraints\",\"authors\":\"Wuyi Luo, Jun Zhang, Xuhui Huang, Zhaolei Wang, Chenhui Jia\",\"doi\":\"10.1145/3510362.3510364\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the last decades, path planning with position constraints attracts many attentions. In this paper, we propose an innovative approach named improved fuzzy actor-critic learning (IFACL) to solve this problem without modelling the map containing obstacles and complex calculation. Specifically, only the initial position, target position and obstacle position are needed as inputs for the algorithm to learn a desired path. Based on FACL, a penalty factor is added to enable agents obtaining the ability to avoid obstacles through punishing agents when exceeding position constraints. Then, to optimize the path planned, an excessive coordinate point which can be updated iteratively during the training process is utilized to calculate the reward with penalty factor. The simulation results prove the superiority and effectiveness of this algorithm in different scenarios with regular obstacles and hypothetical irregular obstacles. Due to the flexibility of FACL, this approach may be easily extended to path planning with velocity constraints and dynamic constraints.\",\"PeriodicalId\":407010,\"journal\":{\"name\":\"Proceedings of the 2021 6th International Conference on Systems, Control and Communications\",\"volume\":\"29 6 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-10-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2021 6th International Conference on Systems, Control and Communications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3510362.3510364\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2021 6th International Conference on Systems, Control and Communications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3510362.3510364","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

在过去的几十年中，具有位置约束的路径规划受到了广泛的关注。在本文中，我们提出了一种创新的方法，称为改进模糊行为-批评学习(IFACL)来解决这个问题，而不需要对包含障碍物的地图进行建模和复杂的计算。具体来说，算法只需要初始位置、目标位置和障碍物位置作为输入，就可以学习到期望路径。在FACL的基础上，增加了惩罚因子，使agent在超过位置约束时通过惩罚agent获得避障能力。然后，利用训练过程中可迭代更新的多余坐标点计算带惩罚因子的奖励，对规划的路径进行优化。仿真结果证明了该算法在规则障碍物和假设不规则障碍物情况下的优越性和有效性。由于FACL的灵活性，该方法可以很容易地扩展到具有速度约束和动态约束的路径规划。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

An Improved Fuzzy Actor-critic Learning Approach for Path Planning with Position Constraints

In the last decades, path planning with position constraints attracts many attentions. In this paper, we propose an innovative approach named improved fuzzy actor-critic learning (IFACL) to solve this problem without modelling the map containing obstacles and complex calculation. Specifically, only the initial position, target position and obstacle position are needed as inputs for the algorithm to learn a desired path. Based on FACL, a penalty factor is added to enable agents obtaining the ability to avoid obstacles through punishing agents when exceeding position constraints. Then, to optimize the path planned, an excessive coordinate point which can be updated iteratively during the training process is utilized to calculate the reward with penalty factor. The simulation results prove the superiority and effectiveness of this algorithm in different scenarios with regular obstacles and hypothetical irregular obstacles. Due to the flexibility of FACL, this approach may be easily extended to path planning with velocity constraints and dynamic constraints.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the 2021 6th International Conference on Systems, Control and Communications

自引率

0.00%

发文量