空中走廊无人机控制的聚类联邦强化学习

IF 4.8 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC

IEEE Open Journal of Vehicular Technology Pub Date : 2025-03-26 DOI:10.1109/OJVT.2025.3573647

Meng Xiang Xuan;Liangkun Yu;Xiang Sun;Sudharman K. Jayaweera

{"title":"空中走廊无人机控制的聚类联邦强化学习","authors":"Meng Xiang Xuan;Liangkun Yu;Xiang Sun;Sudharman K. Jayaweera","doi":"10.1109/OJVT.2025.3573647","DOIUrl":null,"url":null,"abstract":"Advanced Air Mobility (AAM) aims to integrate unmanned aerial vehicles (UAVs) into urban airspace for efficient cargo and passenger transport, relying on autonomous navigation within designated 3D air corridors. Deep reinforcement learning (DRL) has demonstrated significant potential for autonomous UAV control in complex environments, particularly when trained with sufficient flight data. However, the performance of DRL-based models can degrade when real-world conditions differ from their training environments, leading to increased collision risks and boundary violations. To address this challenge, we propose CLustered fEderAted Reinforcement Learning (CLEAR), a novel approach that enables UAVs to collaboratively fine-tune their DRL models in real time using flight data from their operational environment. Unlike traditional federated reinforcement learning (FRL) frameworks that assume clients have pre-existing local datasets, CLEAR organizes UAVs into clusters, where each cluster head aggregates flight data from its members to perform local training before contributing to a global model. This adaptive learning process enhances UAV control in dynamic airspace while maintaining decentralized autonomy. Simulation results show that CLEAR significantly outperforms HTransRL—which lacks model fine-tuning—in terms of arrival rates and scalability as the number of UAVs increases. These findings underscore CLEAR's effectiveness in enabling real-time DRL adaptation, positioning it as a promising solution for robust UAV navigation in AAM ecosystems.","PeriodicalId":34270,"journal":{"name":"IEEE Open Journal of Vehicular Technology","volume":"6 ","pages":"1582-1592"},"PeriodicalIF":4.8000,"publicationDate":"2025-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11015557","citationCount":"0","resultStr":"{\"title\":\"Clustered Federated Reinforcement Learning for Autonomous UAV Control in Air Corridors\",\"authors\":\"Meng Xiang Xuan;Liangkun Yu;Xiang Sun;Sudharman K. Jayaweera\",\"doi\":\"10.1109/OJVT.2025.3573647\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Advanced Air Mobility (AAM) aims to integrate unmanned aerial vehicles (UAVs) into urban airspace for efficient cargo and passenger transport, relying on autonomous navigation within designated 3D air corridors. Deep reinforcement learning (DRL) has demonstrated significant potential for autonomous UAV control in complex environments, particularly when trained with sufficient flight data. However, the performance of DRL-based models can degrade when real-world conditions differ from their training environments, leading to increased collision risks and boundary violations. To address this challenge, we propose CLustered fEderAted Reinforcement Learning (CLEAR), a novel approach that enables UAVs to collaboratively fine-tune their DRL models in real time using flight data from their operational environment. Unlike traditional federated reinforcement learning (FRL) frameworks that assume clients have pre-existing local datasets, CLEAR organizes UAVs into clusters, where each cluster head aggregates flight data from its members to perform local training before contributing to a global model. This adaptive learning process enhances UAV control in dynamic airspace while maintaining decentralized autonomy. Simulation results show that CLEAR significantly outperforms HTransRL—which lacks model fine-tuning—in terms of arrival rates and scalability as the number of UAVs increases. These findings underscore CLEAR's effectiveness in enabling real-time DRL adaptation, positioning it as a promising solution for robust UAV navigation in AAM ecosystems.\",\"PeriodicalId\":34270,\"journal\":{\"name\":\"IEEE Open Journal of Vehicular Technology\",\"volume\":\"6 \",\"pages\":\"1582-1592\"},\"PeriodicalIF\":4.8000,\"publicationDate\":\"2025-03-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11015557\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Open Journal of Vehicular Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/11015557/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Open Journal of Vehicular Technology","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/11015557/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

摘要

先进空中机动（AAM）旨在将无人机（uav）整合到城市空域，依靠指定3D空中走廊内的自主导航，实现高效的货运和客运。深度强化学习（DRL）已经证明了在复杂环境下自主无人机控制的巨大潜力，特别是在有足够飞行数据的训练下。然而，当真实条件与其训练环境不同时，基于drl的模型的性能可能会下降，从而导致碰撞风险和边界违反的增加。为了应对这一挑战，我们提出了集群联邦强化学习（CLEAR），这是一种新颖的方法，使无人机能够使用来自其操作环境的飞行数据实时协作微调其DRL模型。与传统的联邦强化学习（FRL）框架不同的是，它假设客户端已经存在本地数据集，CLEAR将无人机组织成集群，每个簇头聚集其成员的飞行数据，在为全局模型做出贡献之前进行本地训练。这种自适应学习过程增强了无人机在动态空域的控制能力，同时保持了分散的自主性。仿真结果表明，随着无人机数量的增加，CLEAR在到达率和可扩展性方面明显优于缺乏模型微调的htransrl。这些发现强调了CLEAR在实现实时DRL适应方面的有效性，将其定位为AAM生态系统中强大的无人机导航的有前途的解决方案。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Clustered Federated Reinforcement Learning for Autonomous UAV Control in Air Corridors

Advanced Air Mobility (AAM) aims to integrate unmanned aerial vehicles (UAVs) into urban airspace for efficient cargo and passenger transport, relying on autonomous navigation within designated 3D air corridors. Deep reinforcement learning (DRL) has demonstrated significant potential for autonomous UAV control in complex environments, particularly when trained with sufficient flight data. However, the performance of DRL-based models can degrade when real-world conditions differ from their training environments, leading to increased collision risks and boundary violations. To address this challenge, we propose CLustered fEderAted Reinforcement Learning (CLEAR), a novel approach that enables UAVs to collaboratively fine-tune their DRL models in real time using flight data from their operational environment. Unlike traditional federated reinforcement learning (FRL) frameworks that assume clients have pre-existing local datasets, CLEAR organizes UAVs into clusters, where each cluster head aggregates flight data from its members to perform local training before contributing to a global model. This adaptive learning process enhances UAV control in dynamic airspace while maintaining decentralized autonomy. Simulation results show that CLEAR significantly outperforms HTransRL—which lacks model fine-tuning—in terms of arrival rates and scalability as the number of UAVs increases. These findings underscore CLEAR's effectiveness in enabling real-time DRL adaptation, positioning it as a promising solution for robust UAV navigation in AAM ecosystems.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE Open Journal of Vehicular Technology Multiple-

CiteScore

9.60

自引率

0.00%

发文量

审稿时长

10 weeks