基于多智能体联合强化学习的无人机群体感知联合轨迹与激励优化

IF 6 3区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS
Chaoyang Zhu , Xiao Zhu , Tuanfa Qin
{"title":"基于多智能体联合强化学习的无人机群体感知联合轨迹与激励优化","authors":"Chaoyang Zhu ,&nbsp;Xiao Zhu ,&nbsp;Tuanfa Qin","doi":"10.1016/j.iot.2025.101689","DOIUrl":null,"url":null,"abstract":"<div><div>UAV-assisted mobile crowdsensing (MCS) presents a promising paradigm for enhancing data collection in smart city environments, but faces a critical systems challenge: the intricate coupling between spatial trajectory planning, economic incentive mechanisms, and information-theoretic privacy guarantees. Traditional approaches addressing these dimensions in isolation often lead to suboptimal performance. In this paper, we propose <strong>PRISM</strong>, a <em>unified framework</em> designed around a holistic optimization approach with three components. First, PRISM models the strategic interactions among Service Providers, UAVs, and ground devices through a multi-level Stackelberg game, capturing the hierarchical economic dynamics that influence participation decisions. Second, it incorporates a privacy-aware incentive mechanism that explicitly links UAV navigation decisions to privacy constraints, dynamically managing the privacy-utility trade-off based on data quality. Third, to address the resulting multi-objective optimization across distinct decision timescales, we introduce <strong>TMFR</strong>, a Two-timescale Multi-agent Federated Reinforcement Learning algorithm that enables UAVs to collaboratively learn policies for both spatial navigation and incentive allocation. Experimental evaluations demonstrate that TMFR achieves 30% faster convergence and 20% higher data quality compared to baselines that optimize only subsets of the problem. These results highlight its suitability for next-generation smart city applications.</div></div>","PeriodicalId":29968,"journal":{"name":"Internet of Things","volume":"33 ","pages":"Article 101689"},"PeriodicalIF":6.0000,"publicationDate":"2025-07-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Joint trajectory and incentive optimization for privacy-preserving UAV crowdsensing via multi-agent federated reinforcement learning\",\"authors\":\"Chaoyang Zhu ,&nbsp;Xiao Zhu ,&nbsp;Tuanfa Qin\",\"doi\":\"10.1016/j.iot.2025.101689\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>UAV-assisted mobile crowdsensing (MCS) presents a promising paradigm for enhancing data collection in smart city environments, but faces a critical systems challenge: the intricate coupling between spatial trajectory planning, economic incentive mechanisms, and information-theoretic privacy guarantees. Traditional approaches addressing these dimensions in isolation often lead to suboptimal performance. In this paper, we propose <strong>PRISM</strong>, a <em>unified framework</em> designed around a holistic optimization approach with three components. First, PRISM models the strategic interactions among Service Providers, UAVs, and ground devices through a multi-level Stackelberg game, capturing the hierarchical economic dynamics that influence participation decisions. Second, it incorporates a privacy-aware incentive mechanism that explicitly links UAV navigation decisions to privacy constraints, dynamically managing the privacy-utility trade-off based on data quality. Third, to address the resulting multi-objective optimization across distinct decision timescales, we introduce <strong>TMFR</strong>, a Two-timescale Multi-agent Federated Reinforcement Learning algorithm that enables UAVs to collaboratively learn policies for both spatial navigation and incentive allocation. Experimental evaluations demonstrate that TMFR achieves 30% faster convergence and 20% higher data quality compared to baselines that optimize only subsets of the problem. These results highlight its suitability for next-generation smart city applications.</div></div>\",\"PeriodicalId\":29968,\"journal\":{\"name\":\"Internet of Things\",\"volume\":\"33 \",\"pages\":\"Article 101689\"},\"PeriodicalIF\":6.0000,\"publicationDate\":\"2025-07-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Internet of Things\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2542660525002033\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Internet of Things","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2542660525002033","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

摘要

无人机辅助移动众感(MCS)为增强智慧城市环境中的数据收集提供了一个有前途的范例,但面临着一个关键的系统挑战:空间轨迹规划、经济激励机制和信息论隐私保障之间的复杂耦合。单独处理这些维度的传统方法通常会导致性能不理想。在本文中,我们提出了PRISM,这是一个围绕整体优化方法设计的统一框架,包含三个组件。首先,PRISM通过多层次的Stackelberg博弈模拟了服务提供商、无人机和地面设备之间的战略互动,捕捉了影响参与决策的分层经济动态。其次,它结合了隐私感知激励机制,将无人机导航决策明确地与隐私约束联系起来,动态地管理基于数据质量的隐私-效用权衡。第三,为了解决在不同决策时间尺度上产生的多目标优化问题,我们引入了TMFR,一种双时间尺度多智能体联邦强化学习算法,使无人机能够协同学习空间导航和激励分配的策略。实验评估表明,与只优化问题子集的基线相比,TMFR的收敛速度提高了30%,数据质量提高了20%。这些结果突出了它对下一代智慧城市应用的适用性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Joint trajectory and incentive optimization for privacy-preserving UAV crowdsensing via multi-agent federated reinforcement learning
UAV-assisted mobile crowdsensing (MCS) presents a promising paradigm for enhancing data collection in smart city environments, but faces a critical systems challenge: the intricate coupling between spatial trajectory planning, economic incentive mechanisms, and information-theoretic privacy guarantees. Traditional approaches addressing these dimensions in isolation often lead to suboptimal performance. In this paper, we propose PRISM, a unified framework designed around a holistic optimization approach with three components. First, PRISM models the strategic interactions among Service Providers, UAVs, and ground devices through a multi-level Stackelberg game, capturing the hierarchical economic dynamics that influence participation decisions. Second, it incorporates a privacy-aware incentive mechanism that explicitly links UAV navigation decisions to privacy constraints, dynamically managing the privacy-utility trade-off based on data quality. Third, to address the resulting multi-objective optimization across distinct decision timescales, we introduce TMFR, a Two-timescale Multi-agent Federated Reinforcement Learning algorithm that enables UAVs to collaboratively learn policies for both spatial navigation and incentive allocation. Experimental evaluations demonstrate that TMFR achieves 30% faster convergence and 20% higher data quality compared to baselines that optimize only subsets of the problem. These results highlight its suitability for next-generation smart city applications.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Internet of Things
Internet of Things Multiple-
CiteScore
3.60
自引率
5.10%
发文量
115
审稿时长
37 days
期刊介绍: Internet of Things; Engineering Cyber Physical Human Systems is a comprehensive journal encouraging cross collaboration between researchers, engineers and practitioners in the field of IoT & Cyber Physical Human Systems. The journal offers a unique platform to exchange scientific information on the entire breadth of technology, science, and societal applications of the IoT. The journal will place a high priority on timely publication, and provide a home for high quality. Furthermore, IOT is interested in publishing topical Special Issues on any aspect of IOT.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信