基于共识的无人机差异化编队控制强化学习方法

IF 7.1 2区计算机科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC

IEEE Transactions on Vehicular Technology Pub Date : 2025-02-14 DOI:10.1109/TVT.2024.3473969

Xingyu He;DongXu Luo;Zekun Chen;Guisong Yang;Yunhuai Liu

{"title":"基于共识的无人机差异化编队控制强化学习方法","authors":"Xingyu He;DongXu Luo;Zekun Chen;Guisong Yang;Yunhuai Liu","doi":"10.1109/TVT.2024.3473969","DOIUrl":null,"url":null,"abstract":"Obstacle avoidance is the crux of formation control for a UAV swarm, no matter in urban or wild application environments. But there is a dilemma to promote both of obstacle avoidance flexibility and consistency in UAV formation control, which has not been solved effectively by existing researches. In view of this, this paper proposes a consensus based reinforcement learning method for differentiated formation control (DFC) of UAVs, to balance the obstacle avoidance flexibility and consistency. In this method, to promote obstacle avoidance flexibility of a UAV swarm, a consensus mechanism with differentiated formation control strategies is designed, it allows each UAV in a swarm changes its formation control strategy among aggregation, formation keeping and obstacle avoidance according to its local environment, and calculates its own current subgoal based on the selected strategy. Further, to improve the flight efficiency of the UAV swarm, a reinforcement learning model is provided to generate the optimal offset vector for each UAV according to its current subgoal. Moreover, to enhance the obstacle avoidance consistency of the UAV swarm, a collaborative obstacle avoidance algorithm is designed in the obstacle avoidance strategy, it requires UAVs to share their obstacle information and obstacle avoidance actions, and provides obstacle avoidance consensus rules to help UAVs to choose consistent obstacle avoidance directions. The experiment results show that the proposed method can combine obstacle avoidance flexibility and consistency of UAVs, thereby achieving higher flight efficiency and maintaining stable network connectivity.","PeriodicalId":13421,"journal":{"name":"IEEE Transactions on Vehicular Technology","volume":"74 2","pages":"2429-2442"},"PeriodicalIF":7.1000,"publicationDate":"2025-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Consensus Based Reinforcement Learning Method for Differentiated Formation Control of UAVs\",\"authors\":\"Xingyu He;DongXu Luo;Zekun Chen;Guisong Yang;Yunhuai Liu\",\"doi\":\"10.1109/TVT.2024.3473969\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Obstacle avoidance is the crux of formation control for a UAV swarm, no matter in urban or wild application environments. But there is a dilemma to promote both of obstacle avoidance flexibility and consistency in UAV formation control, which has not been solved effectively by existing researches. In view of this, this paper proposes a consensus based reinforcement learning method for differentiated formation control (DFC) of UAVs, to balance the obstacle avoidance flexibility and consistency. In this method, to promote obstacle avoidance flexibility of a UAV swarm, a consensus mechanism with differentiated formation control strategies is designed, it allows each UAV in a swarm changes its formation control strategy among aggregation, formation keeping and obstacle avoidance according to its local environment, and calculates its own current subgoal based on the selected strategy. Further, to improve the flight efficiency of the UAV swarm, a reinforcement learning model is provided to generate the optimal offset vector for each UAV according to its current subgoal. Moreover, to enhance the obstacle avoidance consistency of the UAV swarm, a collaborative obstacle avoidance algorithm is designed in the obstacle avoidance strategy, it requires UAVs to share their obstacle information and obstacle avoidance actions, and provides obstacle avoidance consensus rules to help UAVs to choose consistent obstacle avoidance directions. The experiment results show that the proposed method can combine obstacle avoidance flexibility and consistency of UAVs, thereby achieving higher flight efficiency and maintaining stable network connectivity.\",\"PeriodicalId\":13421,\"journal\":{\"name\":\"IEEE Transactions on Vehicular Technology\",\"volume\":\"74 2\",\"pages\":\"2429-2442\"},\"PeriodicalIF\":7.1000,\"publicationDate\":\"2025-02-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Vehicular Technology\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10887374/\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Vehicular Technology","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10887374/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

摘要

无论在城市还是野外环境下，避障是无人机编队控制的关键。但在无人机编队控制中存在着既要兼顾避障灵活性又要兼顾一致性的两难问题，现有研究尚未有效解决。鉴于此，本文提出了一种基于共识的无人机差异化编队控制（DFC）强化学习方法，以平衡避障灵活性和一致性。该方法为提高无人机群避障的灵活性，设计了一种具有差异化编队控制策略的共识机制，允许每架无人机根据其所在的局部环境，在聚集、编队保持和避障之间改变编队控制策略，并根据所选择的策略计算出自己当前的子目标。为了提高无人机群的飞行效率，提出了一种强化学习模型，根据每架无人机当前的子目标生成最优偏移向量。此外，为了增强无人机群避障一致性，在避障策略中设计了一种协同避障算法，要求无人机共享其障碍物信息和避障动作，并提供避障共识规则，帮助无人机选择一致的避障方向。实验结果表明，该方法可以将无人机避障灵活性和一致性结合起来，从而实现更高的飞行效率，并保持稳定的网络连通性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Consensus Based Reinforcement Learning Method for Differentiated Formation Control of UAVs

Obstacle avoidance is the crux of formation control for a UAV swarm, no matter in urban or wild application environments. But there is a dilemma to promote both of obstacle avoidance flexibility and consistency in UAV formation control, which has not been solved effectively by existing researches. In view of this, this paper proposes a consensus based reinforcement learning method for differentiated formation control (DFC) of UAVs, to balance the obstacle avoidance flexibility and consistency. In this method, to promote obstacle avoidance flexibility of a UAV swarm, a consensus mechanism with differentiated formation control strategies is designed, it allows each UAV in a swarm changes its formation control strategy among aggregation, formation keeping and obstacle avoidance according to its local environment, and calculates its own current subgoal based on the selected strategy. Further, to improve the flight efficiency of the UAV swarm, a reinforcement learning model is provided to generate the optimal offset vector for each UAV according to its current subgoal. Moreover, to enhance the obstacle avoidance consistency of the UAV swarm, a collaborative obstacle avoidance algorithm is designed in the obstacle avoidance strategy, it requires UAVs to share their obstacle information and obstacle avoidance actions, and provides obstacle avoidance consensus rules to help UAVs to choose consistent obstacle avoidance directions. The experiment results show that the proposed method can combine obstacle avoidance flexibility and consistency of UAVs, thereby achieving higher flight efficiency and maintaining stable network connectivity.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE Transactions on Vehicular Technology 工程技术-电信学

CiteScore

6.00

自引率

8.80%

发文量

1245

审稿时长

6.3 months

期刊介绍： The scope of the Transactions is threefold (which was approved by the IEEE Periodicals Committee in 1967) and is published on the journal website as follows: Communications: The use of mobile radio on land, sea, and air, including cellular radio, two-way radio, and one-way radio, with applications to dispatch and control vehicles, mobile radiotelephone, radio paging, and status monitoring and reporting. Related areas include spectrum usage, component radio equipment such as cavities and antennas, compute control for radio systems, digital modulation and transmission techniques, mobile radio circuit design, radio propagation for vehicular communications, effects of ignition noise and radio frequency interference, and consideration of the vehicle as part of the radio operating environment. Transportation Systems: The use of electronic technology for the control of ground transportation systems including, but not limited to, traffic aid systems; traffic control systems; automatic vehicle identification, location, and monitoring systems; automated transport systems, with single and multiple vehicle control; and moving walkways or people-movers. Vehicular Electronics: The use of electronic or electrical components and systems for control, propulsion, or auxiliary functions, including but not limited to, electronic controls for engineer, drive train, convenience, safety, and other vehicle systems; sensors, actuators, and microprocessors for onboard use; electronic fuel control systems; vehicle electrical components and systems collision avoidance systems; electromagnetic compatibility in the vehicle environment; and electric vehicles and controls.