MARFPNet: Multiattention and Adaptive Reparameterized Feature Pyramid Network for Small Target Detection on Water Surfaces

IF 5.6 2区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC
Quanbo Ge;Wenjing Da;Mengmeng Wang
{"title":"MARFPNet: Multiattention and Adaptive Reparameterized Feature Pyramid Network for Small Target Detection on Water Surfaces","authors":"Quanbo Ge;Wenjing Da;Mengmeng Wang","doi":"10.1109/TIM.2024.3485463","DOIUrl":null,"url":null,"abstract":"The images captured by unmanned aerial vehicles (UAVs) are often limited in scale and feature information, making it challenging for current detection algorithms to learn the features of objects effectively. This limitation hampers accurate identification of small objects on water surfaces. We introduce a multiattention and adaptive reparameterized feature pyramid network for small target detection on water surfaces (MARFPNet) to tackle this issue. First, to address the loss of small object features during extraction, we improved the attention mechanism based on the characteristics of small objects and proposed a multiattention module, integrating it into the feature extraction process. Second, to address the semantic information of small objects being retained mostly in shallow feature maps and not fully utilized, we introduced an adaptive reparameterized generalized feature pyramid network (Adaptive_RepGFPN). This module reorganizes features, expands the fusion scale, and incorporates adaptive weighting in the concat operation. Third, to overcome the challenge of ineffective restoration of feature map information by upsampling, we introduce the Dysample. Finally, to address the problem of the loss function being sensitive to scale changes, we propose the normalized Wasserstein distance (NWD) loss function to reduce the sudden drop in loss due to scale changes. We conducted experiments on VisDrone, SeaDronsSee, and the self-build dataset. MARFPNet showed higher accuracy compared to other detection algorithms. Notably, on the self-build dataset, mAP50 and mAP50:95 improved by 9.1% and 3.5% over the baseline network. This demonstrates MARFPNet’s effectiveness and suitability for detecting small targets in drone aerial photography on water surfaces.","PeriodicalId":13341,"journal":{"name":"IEEE Transactions on Instrumentation and Measurement","volume":"73 ","pages":"1-17"},"PeriodicalIF":5.6000,"publicationDate":"2024-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Instrumentation and Measurement","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10737298/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

Abstract

The images captured by unmanned aerial vehicles (UAVs) are often limited in scale and feature information, making it challenging for current detection algorithms to learn the features of objects effectively. This limitation hampers accurate identification of small objects on water surfaces. We introduce a multiattention and adaptive reparameterized feature pyramid network for small target detection on water surfaces (MARFPNet) to tackle this issue. First, to address the loss of small object features during extraction, we improved the attention mechanism based on the characteristics of small objects and proposed a multiattention module, integrating it into the feature extraction process. Second, to address the semantic information of small objects being retained mostly in shallow feature maps and not fully utilized, we introduced an adaptive reparameterized generalized feature pyramid network (Adaptive_RepGFPN). This module reorganizes features, expands the fusion scale, and incorporates adaptive weighting in the concat operation. Third, to overcome the challenge of ineffective restoration of feature map information by upsampling, we introduce the Dysample. Finally, to address the problem of the loss function being sensitive to scale changes, we propose the normalized Wasserstein distance (NWD) loss function to reduce the sudden drop in loss due to scale changes. We conducted experiments on VisDrone, SeaDronsSee, and the self-build dataset. MARFPNet showed higher accuracy compared to other detection algorithms. Notably, on the self-build dataset, mAP50 and mAP50:95 improved by 9.1% and 3.5% over the baseline network. This demonstrates MARFPNet’s effectiveness and suitability for detecting small targets in drone aerial photography on water surfaces.
MARFPNet:用于水面小目标检测的多注意力和自适应重参数化特征金字塔网络
无人驾驶飞行器(UAV)拍摄的图像通常在比例和特征信息方面受到限制,使得目前的检测算法难以有效地学习物体的特征。这种限制阻碍了对水面上小物体的准确识别。为解决这一问题,我们引入了一种用于水面小目标检测的多关注和自适应重参数化特征金字塔网络(MARFPNet)。首先,针对小目标特征提取过程中的损失,我们根据小目标的特点改进了注意机制,提出了多注意模块,并将其集成到特征提取过程中。其次,针对小物体的语义信息大多保留在浅层特征图中而未被充分利用的问题,我们引入了自适应重参数广义特征金字塔网络(Adaptive_RepGFPN)。该模块重组了特征,扩大了融合规模,并在连接操作中加入了自适应加权。第三,为了克服上采样无法有效还原特征图信息的难题,我们引入了 Dysample。最后,为了解决损失函数对尺度变化敏感的问题,我们提出了归一化瓦瑟斯坦距离(NWD)损失函数,以减少尺度变化造成的损失骤降。我们在 VisDrone、SeaDronsSee 和自建数据集上进行了实验。与其他检测算法相比,MARFPNet 显示出更高的准确性。值得注意的是,在自建数据集上,mAP50 和 mAP50:95 比基准网络分别提高了 9.1% 和 3.5%。这证明了 MARFPNet 在检测无人机航拍水面小目标方面的有效性和适用性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
IEEE Transactions on Instrumentation and Measurement
IEEE Transactions on Instrumentation and Measurement 工程技术-工程:电子与电气
CiteScore
9.00
自引率
23.20%
发文量
1294
审稿时长
3.9 months
期刊介绍: Papers are sought that address innovative solutions to the development and use of electrical and electronic instruments and equipment to measure, monitor and/or record physical phenomena for the purpose of advancing measurement science, methods, functionality and applications. The scope of these papers may encompass: (1) theory, methodology, and practice of measurement; (2) design, development and evaluation of instrumentation and measurement systems and components used in generating, acquiring, conditioning and processing signals; (3) analysis, representation, display, and preservation of the information obtained from a set of measurements; and (4) scientific and technical support to establishment and maintenance of technical standards in the field of Instrumentation and Measurement.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信