Meng Xiang Xuan;Liangkun Yu;Xiang Sun;Sudharman K. Jayaweera
{"title":"空中走廊无人机控制的聚类联邦强化学习","authors":"Meng Xiang Xuan;Liangkun Yu;Xiang Sun;Sudharman K. Jayaweera","doi":"10.1109/OJVT.2025.3573647","DOIUrl":null,"url":null,"abstract":"Advanced Air Mobility (AAM) aims to integrate unmanned aerial vehicles (UAVs) into urban airspace for efficient cargo and passenger transport, relying on autonomous navigation within designated 3D air corridors. Deep reinforcement learning (DRL) has demonstrated significant potential for autonomous UAV control in complex environments, particularly when trained with sufficient flight data. However, the performance of DRL-based models can degrade when real-world conditions differ from their training environments, leading to increased collision risks and boundary violations. To address this challenge, we propose CLustered fEderAted Reinforcement Learning (CLEAR), a novel approach that enables UAVs to collaboratively fine-tune their DRL models in real time using flight data from their operational environment. Unlike traditional federated reinforcement learning (FRL) frameworks that assume clients have pre-existing local datasets, CLEAR organizes UAVs into clusters, where each cluster head aggregates flight data from its members to perform local training before contributing to a global model. This adaptive learning process enhances UAV control in dynamic airspace while maintaining decentralized autonomy. Simulation results show that CLEAR significantly outperforms HTransRL—which lacks model fine-tuning—in terms of arrival rates and scalability as the number of UAVs increases. These findings underscore CLEAR's effectiveness in enabling real-time DRL adaptation, positioning it as a promising solution for robust UAV navigation in AAM ecosystems.","PeriodicalId":34270,"journal":{"name":"IEEE Open Journal of Vehicular Technology","volume":"6 ","pages":"1582-1592"},"PeriodicalIF":4.8000,"publicationDate":"2025-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11015557","citationCount":"0","resultStr":"{\"title\":\"Clustered Federated Reinforcement Learning for Autonomous UAV Control in Air Corridors\",\"authors\":\"Meng Xiang Xuan;Liangkun Yu;Xiang Sun;Sudharman K. Jayaweera\",\"doi\":\"10.1109/OJVT.2025.3573647\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Advanced Air Mobility (AAM) aims to integrate unmanned aerial vehicles (UAVs) into urban airspace for efficient cargo and passenger transport, relying on autonomous navigation within designated 3D air corridors. Deep reinforcement learning (DRL) has demonstrated significant potential for autonomous UAV control in complex environments, particularly when trained with sufficient flight data. However, the performance of DRL-based models can degrade when real-world conditions differ from their training environments, leading to increased collision risks and boundary violations. To address this challenge, we propose CLustered fEderAted Reinforcement Learning (CLEAR), a novel approach that enables UAVs to collaboratively fine-tune their DRL models in real time using flight data from their operational environment. Unlike traditional federated reinforcement learning (FRL) frameworks that assume clients have pre-existing local datasets, CLEAR organizes UAVs into clusters, where each cluster head aggregates flight data from its members to perform local training before contributing to a global model. This adaptive learning process enhances UAV control in dynamic airspace while maintaining decentralized autonomy. Simulation results show that CLEAR significantly outperforms HTransRL—which lacks model fine-tuning—in terms of arrival rates and scalability as the number of UAVs increases. These findings underscore CLEAR's effectiveness in enabling real-time DRL adaptation, positioning it as a promising solution for robust UAV navigation in AAM ecosystems.\",\"PeriodicalId\":34270,\"journal\":{\"name\":\"IEEE Open Journal of Vehicular Technology\",\"volume\":\"6 \",\"pages\":\"1582-1592\"},\"PeriodicalIF\":4.8000,\"publicationDate\":\"2025-03-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11015557\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Open Journal of Vehicular Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/11015557/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Open Journal of Vehicular Technology","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/11015557/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
Clustered Federated Reinforcement Learning for Autonomous UAV Control in Air Corridors
Advanced Air Mobility (AAM) aims to integrate unmanned aerial vehicles (UAVs) into urban airspace for efficient cargo and passenger transport, relying on autonomous navigation within designated 3D air corridors. Deep reinforcement learning (DRL) has demonstrated significant potential for autonomous UAV control in complex environments, particularly when trained with sufficient flight data. However, the performance of DRL-based models can degrade when real-world conditions differ from their training environments, leading to increased collision risks and boundary violations. To address this challenge, we propose CLustered fEderAted Reinforcement Learning (CLEAR), a novel approach that enables UAVs to collaboratively fine-tune their DRL models in real time using flight data from their operational environment. Unlike traditional federated reinforcement learning (FRL) frameworks that assume clients have pre-existing local datasets, CLEAR organizes UAVs into clusters, where each cluster head aggregates flight data from its members to perform local training before contributing to a global model. This adaptive learning process enhances UAV control in dynamic airspace while maintaining decentralized autonomy. Simulation results show that CLEAR significantly outperforms HTransRL—which lacks model fine-tuning—in terms of arrival rates and scalability as the number of UAVs increases. These findings underscore CLEAR's effectiveness in enabling real-time DRL adaptation, positioning it as a promising solution for robust UAV navigation in AAM ecosystems.