{"title":"MARFPNet: Multiattention and Adaptive Reparameterized Feature Pyramid Network for Small Target Detection on Water Surfaces","authors":"Quanbo Ge;Wenjing Da;Mengmeng Wang","doi":"10.1109/TIM.2024.3485463","DOIUrl":null,"url":null,"abstract":"The images captured by unmanned aerial vehicles (UAVs) are often limited in scale and feature information, making it challenging for current detection algorithms to learn the features of objects effectively. This limitation hampers accurate identification of small objects on water surfaces. We introduce a multiattention and adaptive reparameterized feature pyramid network for small target detection on water surfaces (MARFPNet) to tackle this issue. First, to address the loss of small object features during extraction, we improved the attention mechanism based on the characteristics of small objects and proposed a multiattention module, integrating it into the feature extraction process. Second, to address the semantic information of small objects being retained mostly in shallow feature maps and not fully utilized, we introduced an adaptive reparameterized generalized feature pyramid network (Adaptive_RepGFPN). This module reorganizes features, expands the fusion scale, and incorporates adaptive weighting in the concat operation. Third, to overcome the challenge of ineffective restoration of feature map information by upsampling, we introduce the Dysample. Finally, to address the problem of the loss function being sensitive to scale changes, we propose the normalized Wasserstein distance (NWD) loss function to reduce the sudden drop in loss due to scale changes. We conducted experiments on VisDrone, SeaDronsSee, and the self-build dataset. MARFPNet showed higher accuracy compared to other detection algorithms. Notably, on the self-build dataset, mAP50 and mAP50:95 improved by 9.1% and 3.5% over the baseline network. This demonstrates MARFPNet’s effectiveness and suitability for detecting small targets in drone aerial photography on water surfaces.","PeriodicalId":13341,"journal":{"name":"IEEE Transactions on Instrumentation and Measurement","volume":"73 ","pages":"1-17"},"PeriodicalIF":5.6000,"publicationDate":"2024-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Instrumentation and Measurement","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10737298/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
The images captured by unmanned aerial vehicles (UAVs) are often limited in scale and feature information, making it challenging for current detection algorithms to learn the features of objects effectively. This limitation hampers accurate identification of small objects on water surfaces. We introduce a multiattention and adaptive reparameterized feature pyramid network for small target detection on water surfaces (MARFPNet) to tackle this issue. First, to address the loss of small object features during extraction, we improved the attention mechanism based on the characteristics of small objects and proposed a multiattention module, integrating it into the feature extraction process. Second, to address the semantic information of small objects being retained mostly in shallow feature maps and not fully utilized, we introduced an adaptive reparameterized generalized feature pyramid network (Adaptive_RepGFPN). This module reorganizes features, expands the fusion scale, and incorporates adaptive weighting in the concat operation. Third, to overcome the challenge of ineffective restoration of feature map information by upsampling, we introduce the Dysample. Finally, to address the problem of the loss function being sensitive to scale changes, we propose the normalized Wasserstein distance (NWD) loss function to reduce the sudden drop in loss due to scale changes. We conducted experiments on VisDrone, SeaDronsSee, and the self-build dataset. MARFPNet showed higher accuracy compared to other detection algorithms. Notably, on the self-build dataset, mAP50 and mAP50:95 improved by 9.1% and 3.5% over the baseline network. This demonstrates MARFPNet’s effectiveness and suitability for detecting small targets in drone aerial photography on water surfaces.
期刊介绍:
Papers are sought that address innovative solutions to the development and use of electrical and electronic instruments and equipment to measure, monitor and/or record physical phenomena for the purpose of advancing measurement science, methods, functionality and applications. The scope of these papers may encompass: (1) theory, methodology, and practice of measurement; (2) design, development and evaluation of instrumentation and measurement systems and components used in generating, acquiring, conditioning and processing signals; (3) analysis, representation, display, and preservation of the information obtained from a set of measurements; and (4) scientific and technical support to establishment and maintenance of technical standards in the field of Instrumentation and Measurement.