{"title":"无人机支持的高能效空中计算:一种联合深度强化学习方法","authors":"Qianqian Wu;Qiang Liu;Ying He;Zefan Wu","doi":"10.1109/TR.2024.3480117","DOIUrl":null,"url":null,"abstract":"Aerial computing paradigms, particularly those involving UAVs as access points and radio towers, show significant promise for local data analysis and real-time service provision in aerial access networks. However, the limited battery life of UAVs, compounded by the high-energy demands of communication tasks and prolonged computation delays, presents a significant challenge. Deep reinforcement learning (DRL) enables UAVs to autonomously optimize their operations, reducing both energy consumption and latency. Nevertheless, the prolonged learning process of DRL can lead to inefficiencies, especially in dynamic environments where the equitable participation of UAVs is crucial. To address these issues, we introduce a fairness-oriented federated learning (FL) scheme that employs importance sampling to select UAVs for training, ensuring equitable utilization of each UAV's data. Furthermore, we integrate this FL fairness scheme into the design of DRL algorithms, termed FedDRL. This algorithm jointly optimizes the computation capabilities and bandwidth allocation of UAVs to minimize system costs. Numerical results demonstrate the fairness of FedDRL in fFL networks. Specifically, compared to other state-of-the-art DRL algorithms (e.g., TD3 and DDPG), FedDRL reduces system costs by 43.82% and 49.45%, respectively.","PeriodicalId":56305,"journal":{"name":"IEEE Transactions on Reliability","volume":"74 3","pages":"3559-3572"},"PeriodicalIF":5.7000,"publicationDate":"2024-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"UAV-Enabled Energy-Efficient Aerial Computing: A Federated Deep Reinforcement Learning Approach\",\"authors\":\"Qianqian Wu;Qiang Liu;Ying He;Zefan Wu\",\"doi\":\"10.1109/TR.2024.3480117\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Aerial computing paradigms, particularly those involving UAVs as access points and radio towers, show significant promise for local data analysis and real-time service provision in aerial access networks. However, the limited battery life of UAVs, compounded by the high-energy demands of communication tasks and prolonged computation delays, presents a significant challenge. Deep reinforcement learning (DRL) enables UAVs to autonomously optimize their operations, reducing both energy consumption and latency. Nevertheless, the prolonged learning process of DRL can lead to inefficiencies, especially in dynamic environments where the equitable participation of UAVs is crucial. To address these issues, we introduce a fairness-oriented federated learning (FL) scheme that employs importance sampling to select UAVs for training, ensuring equitable utilization of each UAV's data. Furthermore, we integrate this FL fairness scheme into the design of DRL algorithms, termed FedDRL. This algorithm jointly optimizes the computation capabilities and bandwidth allocation of UAVs to minimize system costs. Numerical results demonstrate the fairness of FedDRL in fFL networks. Specifically, compared to other state-of-the-art DRL algorithms (e.g., TD3 and DDPG), FedDRL reduces system costs by 43.82% and 49.45%, respectively.\",\"PeriodicalId\":56305,\"journal\":{\"name\":\"IEEE Transactions on Reliability\",\"volume\":\"74 3\",\"pages\":\"3559-3572\"},\"PeriodicalIF\":5.7000,\"publicationDate\":\"2024-10-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Reliability\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10737313/\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Reliability","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10737313/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
UAV-Enabled Energy-Efficient Aerial Computing: A Federated Deep Reinforcement Learning Approach
Aerial computing paradigms, particularly those involving UAVs as access points and radio towers, show significant promise for local data analysis and real-time service provision in aerial access networks. However, the limited battery life of UAVs, compounded by the high-energy demands of communication tasks and prolonged computation delays, presents a significant challenge. Deep reinforcement learning (DRL) enables UAVs to autonomously optimize their operations, reducing both energy consumption and latency. Nevertheless, the prolonged learning process of DRL can lead to inefficiencies, especially in dynamic environments where the equitable participation of UAVs is crucial. To address these issues, we introduce a fairness-oriented federated learning (FL) scheme that employs importance sampling to select UAVs for training, ensuring equitable utilization of each UAV's data. Furthermore, we integrate this FL fairness scheme into the design of DRL algorithms, termed FedDRL. This algorithm jointly optimizes the computation capabilities and bandwidth allocation of UAVs to minimize system costs. Numerical results demonstrate the fairness of FedDRL in fFL networks. Specifically, compared to other state-of-the-art DRL algorithms (e.g., TD3 and DDPG), FedDRL reduces system costs by 43.82% and 49.45%, respectively.
期刊介绍:
IEEE Transactions on Reliability is a refereed journal for the reliability and allied disciplines including, but not limited to, maintainability, physics of failure, life testing, prognostics, design and manufacture for reliability, reliability for systems of systems, network availability, mission success, warranty, safety, and various measures of effectiveness. Topics eligible for publication range from hardware to software, from materials to systems, from consumer and industrial devices to manufacturing plants, from individual items to networks, from techniques for making things better to ways of predicting and measuring behavior in the field. As an engineering subject that supports new and existing technologies, we constantly expand into new areas of the assurance sciences.