{"title":"遥感小目标检测的多阶信息聚合网络","authors":"Jinli Zhong, Jianxun Zhang","doi":"10.1016/j.dsp.2025.105537","DOIUrl":null,"url":null,"abstract":"<div><div>In critical fields such as intelligent transportation and national defense security, remote sensing small target detection technology plays an essential role. To effectively overcome the complexity of remote sensing scenes and the weak response of small-scale targets, this paper proposes a lightweight Multi-order Information Aggregation Network (MIANet). MIANet mainly consists of two parts: Cross-spatial Multi-order Information Aggregation Module (CMIAM) and Multi-dimensional Information Enhancement Module (MIEM). Inspired by the research on multi-order interactions in game theory within deep learning, CMIAM can aggregate low-order, mid-order, and high-order information, effectively improving the detection accuracy of small targets in complex remote sensing scenes. Based on the design philosophy of manifolds of interest, MIEM can effectively remove redundant information, and MIEM utilizes a three-branch structure to capture cross-dimensional information interaction, enriching feature representation and achieving the effect of information enhancement. We have validated the performance of our model on multiple remote sensing small target datasets including VEDAI, DIOR, NWPU-VHR10, MVRSD, and SIMD, and achieved excellent results. In particular, for the lightweight MIANet, the accuracy metric <span><math><mtext>m</mtext><mi>A</mi><msub><mrow><mi>P</mi></mrow><mrow><mn>50</mn></mrow></msub></math></span> reached 73.7% on the VEDAI dataset, surpassing the current SOTA method SuperYOLO for remote sensing small target detection on a single modality. On the NWPU-VHR10 dataset, MIANet outperformed SuperYOLO by 2.1% in the <span><math><mtext>m</mtext><mi>A</mi><msub><mrow><mi>P</mi></mrow><mrow><mn>50</mn></mrow></msub></math></span> metric and FFCA-YOLO by 2.2%. On the DIOR dataset, with a parameter count of 9.79M, MIANet achieved an <span><math><mtext>m</mtext><mi>A</mi><msub><mrow><mi>P</mi></mrow><mrow><mn>50</mn></mrow></msub></math></span> metric of 81.3% and an <span><math><mtext>m</mtext><mi>A</mi><msub><mrow><mi>P</mi></mrow><mrow><mn>50</mn><mo>:</mo><mn>95</mn></mrow></msub></math></span> of 60.9%, which demonstrate that our model exhibits strong robustness characteristics. Our code will be made publicly available on <span><span>https://github.com/Liro-o/MIANet</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"168 ","pages":"Article 105537"},"PeriodicalIF":3.0000,"publicationDate":"2025-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multi-order information aggregation network for remote sensing small target detection\",\"authors\":\"Jinli Zhong, Jianxun Zhang\",\"doi\":\"10.1016/j.dsp.2025.105537\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>In critical fields such as intelligent transportation and national defense security, remote sensing small target detection technology plays an essential role. To effectively overcome the complexity of remote sensing scenes and the weak response of small-scale targets, this paper proposes a lightweight Multi-order Information Aggregation Network (MIANet). MIANet mainly consists of two parts: Cross-spatial Multi-order Information Aggregation Module (CMIAM) and Multi-dimensional Information Enhancement Module (MIEM). Inspired by the research on multi-order interactions in game theory within deep learning, CMIAM can aggregate low-order, mid-order, and high-order information, effectively improving the detection accuracy of small targets in complex remote sensing scenes. Based on the design philosophy of manifolds of interest, MIEM can effectively remove redundant information, and MIEM utilizes a three-branch structure to capture cross-dimensional information interaction, enriching feature representation and achieving the effect of information enhancement. We have validated the performance of our model on multiple remote sensing small target datasets including VEDAI, DIOR, NWPU-VHR10, MVRSD, and SIMD, and achieved excellent results. In particular, for the lightweight MIANet, the accuracy metric <span><math><mtext>m</mtext><mi>A</mi><msub><mrow><mi>P</mi></mrow><mrow><mn>50</mn></mrow></msub></math></span> reached 73.7% on the VEDAI dataset, surpassing the current SOTA method SuperYOLO for remote sensing small target detection on a single modality. On the NWPU-VHR10 dataset, MIANet outperformed SuperYOLO by 2.1% in the <span><math><mtext>m</mtext><mi>A</mi><msub><mrow><mi>P</mi></mrow><mrow><mn>50</mn></mrow></msub></math></span> metric and FFCA-YOLO by 2.2%. On the DIOR dataset, with a parameter count of 9.79M, MIANet achieved an <span><math><mtext>m</mtext><mi>A</mi><msub><mrow><mi>P</mi></mrow><mrow><mn>50</mn></mrow></msub></math></span> metric of 81.3% and an <span><math><mtext>m</mtext><mi>A</mi><msub><mrow><mi>P</mi></mrow><mrow><mn>50</mn><mo>:</mo><mn>95</mn></mrow></msub></math></span> of 60.9%, which demonstrate that our model exhibits strong robustness characteristics. Our code will be made publicly available on <span><span>https://github.com/Liro-o/MIANet</span><svg><path></path></svg></span>.</div></div>\",\"PeriodicalId\":51011,\"journal\":{\"name\":\"Digital Signal Processing\",\"volume\":\"168 \",\"pages\":\"Article 105537\"},\"PeriodicalIF\":3.0000,\"publicationDate\":\"2025-08-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Digital Signal Processing\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1051200425005597\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital Signal Processing","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1051200425005597","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
Multi-order information aggregation network for remote sensing small target detection
In critical fields such as intelligent transportation and national defense security, remote sensing small target detection technology plays an essential role. To effectively overcome the complexity of remote sensing scenes and the weak response of small-scale targets, this paper proposes a lightweight Multi-order Information Aggregation Network (MIANet). MIANet mainly consists of two parts: Cross-spatial Multi-order Information Aggregation Module (CMIAM) and Multi-dimensional Information Enhancement Module (MIEM). Inspired by the research on multi-order interactions in game theory within deep learning, CMIAM can aggregate low-order, mid-order, and high-order information, effectively improving the detection accuracy of small targets in complex remote sensing scenes. Based on the design philosophy of manifolds of interest, MIEM can effectively remove redundant information, and MIEM utilizes a three-branch structure to capture cross-dimensional information interaction, enriching feature representation and achieving the effect of information enhancement. We have validated the performance of our model on multiple remote sensing small target datasets including VEDAI, DIOR, NWPU-VHR10, MVRSD, and SIMD, and achieved excellent results. In particular, for the lightweight MIANet, the accuracy metric reached 73.7% on the VEDAI dataset, surpassing the current SOTA method SuperYOLO for remote sensing small target detection on a single modality. On the NWPU-VHR10 dataset, MIANet outperformed SuperYOLO by 2.1% in the metric and FFCA-YOLO by 2.2%. On the DIOR dataset, with a parameter count of 9.79M, MIANet achieved an metric of 81.3% and an of 60.9%, which demonstrate that our model exhibits strong robustness characteristics. Our code will be made publicly available on https://github.com/Liro-o/MIANet.
期刊介绍:
Digital Signal Processing: A Review Journal is one of the oldest and most established journals in the field of signal processing yet it aims to be the most innovative. The Journal invites top quality research articles at the frontiers of research in all aspects of signal processing. Our objective is to provide a platform for the publication of ground-breaking research in signal processing with both academic and industrial appeal.
The journal has a special emphasis on statistical signal processing methodology such as Bayesian signal processing, and encourages articles on emerging applications of signal processing such as:
• big data• machine learning• internet of things• information security• systems biology and computational biology,• financial time series analysis,• autonomous vehicles,• quantum computing,• neuromorphic engineering,• human-computer interaction and intelligent user interfaces,• environmental signal processing,• geophysical signal processing including seismic signal processing,• chemioinformatics and bioinformatics,• audio, visual and performance arts,• disaster management and prevention,• renewable energy,