网络约束环境下多无人机群体导航与避障研究

IF 8.9 1区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

IEEE Internet of Things Journal Pub Date : 2024-11-29 DOI:10.1109/JIOT.2024.3507782

Junling Shi;Guoyu Zhu;Hanyu Li;Ammar Hawbani;Jiehong Wu;Na Lin;Liang Zhao

{"title":"网络约束环境下多无人机群体导航与避障研究","authors":"Junling Shi;Guoyu Zhu;Hanyu Li;Ammar Hawbani;Jiehong Wu;Na Lin;Liang Zhao","doi":"10.1109/JIOT.2024.3507782","DOIUrl":null,"url":null,"abstract":"The flocking movement is a fundamental and crucial operation in multi-AAVs systems, encompassing navigation and obstacle avoidance. However, traditional flocking algorithms typically rely on rigid rules and exhibit limited adaptability to diverse environments. Reinforcement learning (RL) effectively addresses this issue as a flexible and model-free framework. In this article, RL techniques are utilized to achieve navigation and obstacle avoidance for a swarm of AAVs. To enhance training efficiency, we propose an improved algorithm called heuristic guides TD3 (HGTD3) by integrating heuristic guides with the twin delayed deep deterministic policy gradient (TD3), aiming to address the protracted learning periods commonly observed in traditional RL methods. Considering the network-constrained environment, we propose the negative interference flocking algorithm (NIFA): the network interference flocking algorithm and an AAV flocking algorithm designed based on the sparrow search algorithm. NIFA can guide the losing AAV to follow the swarm and at the same time maintain the overall navigation and avoidance efficiency. Finally, we demonstrate the scalability and adaptability of HGTD3-NIFA in a simulation experiment in terms of multi-AAVs flocking and navigation.","PeriodicalId":54347,"journal":{"name":"IEEE Internet of Things Journal","volume":"12 7","pages":"8931-8946"},"PeriodicalIF":8.9000,"publicationDate":"2024-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multi-AAVs Flocking for Navigation and Obstacle Avoidance in Network-Constrained Environments\",\"authors\":\"Junling Shi;Guoyu Zhu;Hanyu Li;Ammar Hawbani;Jiehong Wu;Na Lin;Liang Zhao\",\"doi\":\"10.1109/JIOT.2024.3507782\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The flocking movement is a fundamental and crucial operation in multi-AAVs systems, encompassing navigation and obstacle avoidance. However, traditional flocking algorithms typically rely on rigid rules and exhibit limited adaptability to diverse environments. Reinforcement learning (RL) effectively addresses this issue as a flexible and model-free framework. In this article, RL techniques are utilized to achieve navigation and obstacle avoidance for a swarm of AAVs. To enhance training efficiency, we propose an improved algorithm called heuristic guides TD3 (HGTD3) by integrating heuristic guides with the twin delayed deep deterministic policy gradient (TD3), aiming to address the protracted learning periods commonly observed in traditional RL methods. Considering the network-constrained environment, we propose the negative interference flocking algorithm (NIFA): the network interference flocking algorithm and an AAV flocking algorithm designed based on the sparrow search algorithm. NIFA can guide the losing AAV to follow the swarm and at the same time maintain the overall navigation and avoidance efficiency. Finally, we demonstrate the scalability and adaptability of HGTD3-NIFA in a simulation experiment in terms of multi-AAVs flocking and navigation.\",\"PeriodicalId\":54347,\"journal\":{\"name\":\"IEEE Internet of Things Journal\",\"volume\":\"12 7\",\"pages\":\"8931-8946\"},\"PeriodicalIF\":8.9000,\"publicationDate\":\"2024-11-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Internet of Things Journal\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10771954/\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Internet of Things Journal","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10771954/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

摘要

在多aav系统中，群集运动是一项基本而关键的操作，包括导航和避障。然而，传统的群集算法通常依赖于严格的规则，对不同环境的适应性有限。强化学习（RL）作为一个灵活且无模型的框架有效地解决了这个问题。在本文中，RL技术被用于实现一群aav的导航和避障。为了提高训练效率，我们提出了一种改进的启发式指南TD3 （HGTD3）算法，该算法将启发式指南与双延迟深度确定性策略梯度（TD3）相结合，旨在解决传统RL方法中常见的学习周期延长的问题。考虑到网络约束环境，提出了负干扰群集算法（NIFA）：网络干扰群集算法和基于麻雀搜索算法设计的AAV群集算法。NIFA可以引导丢失的AAV跟随群体，同时保持整体的导航和回避效率。最后，通过仿真实验验证了HGTD3-NIFA在多自动驾驶飞行器群集和导航方面的可扩展性和适应性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Multi-AAVs Flocking for Navigation and Obstacle Avoidance in Network-Constrained Environments

The flocking movement is a fundamental and crucial operation in multi-AAVs systems, encompassing navigation and obstacle avoidance. However, traditional flocking algorithms typically rely on rigid rules and exhibit limited adaptability to diverse environments. Reinforcement learning (RL) effectively addresses this issue as a flexible and model-free framework. In this article, RL techniques are utilized to achieve navigation and obstacle avoidance for a swarm of AAVs. To enhance training efficiency, we propose an improved algorithm called heuristic guides TD3 (HGTD3) by integrating heuristic guides with the twin delayed deep deterministic policy gradient (TD3), aiming to address the protracted learning periods commonly observed in traditional RL methods. Considering the network-constrained environment, we propose the negative interference flocking algorithm (NIFA): the network interference flocking algorithm and an AAV flocking algorithm designed based on the sparrow search algorithm. NIFA can guide the losing AAV to follow the swarm and at the same time maintain the overall navigation and avoidance efficiency. Finally, we demonstrate the scalability and adaptability of HGTD3-NIFA in a simulation experiment in terms of multi-AAVs flocking and navigation.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE Internet of Things Journal Computer Science-Information Systems

CiteScore

17.60

自引率

13.20%

发文量

1982

期刊介绍： The EEE Internet of Things (IoT) Journal publishes articles and review articles covering various aspects of IoT, including IoT system architecture, IoT enabling technologies, IoT communication and networking protocols such as network coding, and IoT services and applications. Topics encompass IoT's impacts on sensor technologies, big data management, and future internet design for applications like smart cities and smart homes. Fields of interest include IoT architecture such as things-centric, data-centric, service-oriented IoT architecture; IoT enabling technologies and systematic integration such as sensor technologies, big sensor data management, and future Internet design for IoT; IoT services, applications, and test-beds such as IoT service middleware, IoT application programming interface (API), IoT application design, and IoT trials/experiments; IoT standardization activities and technology development in different standard development organizations (SDO) such as IEEE, IETF, ITU, 3GPP, ETSI, etc.