{"title":"基于注意力的MAPPO多源定位下大规模传感器调度","authors":"Qiyue Feng, Tao Tang, Yunpu Zhang, Zhidong Wu","doi":"10.1049/rsn2.70076","DOIUrl":null,"url":null,"abstract":"<p>Large-scale sensor scheduling for multisource localisation is a critical technology in wireless communication control and navigation systems. Most existing heuristic algorithms face challenges in adapting to large-scale sensor systems. To overcome this limitation, we utilise the self-learning capabilities of deep reinforcement learning (DRL) to enable multisource localisation. This paper proposes a large-scale sensor scheduling algorithm based on the multiagent proximal policy optimisation (LSS-MAPPO) framework. We develop a multisource localisation model based on time difference of arrival (TDOA) and design a reward function grounded in the Cramér–Rao lower bound (CRLB). Our approach integrates multihead attention layers into MAPPO to improve the performance of the algorithm. In large-scale sensor scheduling systems, multihead attention mechanisms can effectively handle the high-dimensional state space associated with multisource localisation in multiagent environments. Experimental results under different environments show that LSS-MAPPO improves localisation accuracy compared to the baseline in large-scale sensor scheduling. Notably, it maintains robust performance under partial observability, addressing critical gaps in large-scale dynamic sensor scheduling.</p>","PeriodicalId":50377,"journal":{"name":"Iet Radar Sonar and Navigation","volume":"19 1","pages":""},"PeriodicalIF":1.5000,"publicationDate":"2025-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ietresearch.onlinelibrary.wiley.com/doi/epdf/10.1049/rsn2.70076","citationCount":"0","resultStr":"{\"title\":\"Attention-Based MAPPO for Large-Scale Sensor Scheduling in Multisource Localisation\",\"authors\":\"Qiyue Feng, Tao Tang, Yunpu Zhang, Zhidong Wu\",\"doi\":\"10.1049/rsn2.70076\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Large-scale sensor scheduling for multisource localisation is a critical technology in wireless communication control and navigation systems. Most existing heuristic algorithms face challenges in adapting to large-scale sensor systems. To overcome this limitation, we utilise the self-learning capabilities of deep reinforcement learning (DRL) to enable multisource localisation. This paper proposes a large-scale sensor scheduling algorithm based on the multiagent proximal policy optimisation (LSS-MAPPO) framework. We develop a multisource localisation model based on time difference of arrival (TDOA) and design a reward function grounded in the Cramér–Rao lower bound (CRLB). Our approach integrates multihead attention layers into MAPPO to improve the performance of the algorithm. In large-scale sensor scheduling systems, multihead attention mechanisms can effectively handle the high-dimensional state space associated with multisource localisation in multiagent environments. Experimental results under different environments show that LSS-MAPPO improves localisation accuracy compared to the baseline in large-scale sensor scheduling. Notably, it maintains robust performance under partial observability, addressing critical gaps in large-scale dynamic sensor scheduling.</p>\",\"PeriodicalId\":50377,\"journal\":{\"name\":\"Iet Radar Sonar and Navigation\",\"volume\":\"19 1\",\"pages\":\"\"},\"PeriodicalIF\":1.5000,\"publicationDate\":\"2025-09-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://ietresearch.onlinelibrary.wiley.com/doi/epdf/10.1049/rsn2.70076\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Iet Radar Sonar and Navigation\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ietresearch.onlinelibrary.wiley.com/doi/10.1049/rsn2.70076\",\"RegionNum\":4,\"RegionCategory\":\"管理学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Iet Radar Sonar and Navigation","FirstCategoryId":"94","ListUrlMain":"https://ietresearch.onlinelibrary.wiley.com/doi/10.1049/rsn2.70076","RegionNum":4,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
Attention-Based MAPPO for Large-Scale Sensor Scheduling in Multisource Localisation
Large-scale sensor scheduling for multisource localisation is a critical technology in wireless communication control and navigation systems. Most existing heuristic algorithms face challenges in adapting to large-scale sensor systems. To overcome this limitation, we utilise the self-learning capabilities of deep reinforcement learning (DRL) to enable multisource localisation. This paper proposes a large-scale sensor scheduling algorithm based on the multiagent proximal policy optimisation (LSS-MAPPO) framework. We develop a multisource localisation model based on time difference of arrival (TDOA) and design a reward function grounded in the Cramér–Rao lower bound (CRLB). Our approach integrates multihead attention layers into MAPPO to improve the performance of the algorithm. In large-scale sensor scheduling systems, multihead attention mechanisms can effectively handle the high-dimensional state space associated with multisource localisation in multiagent environments. Experimental results under different environments show that LSS-MAPPO improves localisation accuracy compared to the baseline in large-scale sensor scheduling. Notably, it maintains robust performance under partial observability, addressing critical gaps in large-scale dynamic sensor scheduling.
期刊介绍:
IET Radar, Sonar & Navigation covers the theory and practice of systems and signals for radar, sonar, radiolocation, navigation, and surveillance purposes, in aerospace and terrestrial applications.
Examples include advances in waveform design, clutter and detection, electronic warfare, adaptive array and superresolution methods, tracking algorithms, synthetic aperture, and target recognition techniques.