遥感目标检测的部分特征重参数化与浅层交互。

IF 3.9 2区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES
Minh Tai Pham Nguyen, Quoc Duy Nam Nguyen, Hoang Viet Anh Le, Minh Khue Phan Tran, Tadashi Nakano, Thi Hong Tran
{"title":"遥感目标检测的部分特征重参数化与浅层交互。","authors":"Minh Tai Pham Nguyen, Quoc Duy Nam Nguyen, Hoang Viet Anh Le, Minh Khue Phan Tran, Tadashi Nakano, Thi Hong Tran","doi":"10.1038/s41598-025-14035-7","DOIUrl":null,"url":null,"abstract":"<p><p>Remote sensing object detection has recently emerged as one of the challenging topics in the field of deep learning applications due to the demand for both high detection performance and computational efficiency. To address these problems, this study introduces an efficient one-stage object detector that is designed mainly for detecting objects on remote sensing images, which consists of several innovations. Firstly, an extraction block is proposed called PRepConvBlock that leverages reparameterization convolution and partial feature utilization to effectively reduce the complexity in convolution operations, allowing for the utilization of larger kernel sizes in order to form the longer interactions between features and significantly expand receptive fields. Secondly, a unique shallow multi-scale fusion framework called SB-FPN based on Bi-FPN that utilizes the cross-interaction between shallow scale and deeper scale while inheriting the bidirectional connection from Bi-FPN to enhance the visual representation of features. Lastly, a Shallow-level Optimized Reparameterization Architecture Detector (SORA-DET) is proposed by applying several introduced innovations. This object detector is designed for UAV remote sensing object detection tasks that employ up to four detection heads. As a result, our proposed detector obtains a competitive performance that outperforms most of the other large-size models and SOTA works. In detail, the SORA-DET achieves 39.3% mAP50 in the VisDrone2019 test set while reaching up to 84.0% mAP50 in the SeaDroneSeeV2 validation set. Furthermore, our proposed detector is smaller than nearly 88.1% in parameters and has an inference speed of only 5.4 ms compared to other large-scale one-stage detectors.</p>","PeriodicalId":21811,"journal":{"name":"Scientific Reports","volume":"15 1","pages":"28629"},"PeriodicalIF":3.9000,"publicationDate":"2025-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12325737/pdf/","citationCount":"0","resultStr":"{\"title\":\"Partial feature reparameterization and shallow-level interaction for remote sensing object detection.\",\"authors\":\"Minh Tai Pham Nguyen, Quoc Duy Nam Nguyen, Hoang Viet Anh Le, Minh Khue Phan Tran, Tadashi Nakano, Thi Hong Tran\",\"doi\":\"10.1038/s41598-025-14035-7\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Remote sensing object detection has recently emerged as one of the challenging topics in the field of deep learning applications due to the demand for both high detection performance and computational efficiency. To address these problems, this study introduces an efficient one-stage object detector that is designed mainly for detecting objects on remote sensing images, which consists of several innovations. Firstly, an extraction block is proposed called PRepConvBlock that leverages reparameterization convolution and partial feature utilization to effectively reduce the complexity in convolution operations, allowing for the utilization of larger kernel sizes in order to form the longer interactions between features and significantly expand receptive fields. Secondly, a unique shallow multi-scale fusion framework called SB-FPN based on Bi-FPN that utilizes the cross-interaction between shallow scale and deeper scale while inheriting the bidirectional connection from Bi-FPN to enhance the visual representation of features. Lastly, a Shallow-level Optimized Reparameterization Architecture Detector (SORA-DET) is proposed by applying several introduced innovations. This object detector is designed for UAV remote sensing object detection tasks that employ up to four detection heads. As a result, our proposed detector obtains a competitive performance that outperforms most of the other large-size models and SOTA works. In detail, the SORA-DET achieves 39.3% mAP50 in the VisDrone2019 test set while reaching up to 84.0% mAP50 in the SeaDroneSeeV2 validation set. Furthermore, our proposed detector is smaller than nearly 88.1% in parameters and has an inference speed of only 5.4 ms compared to other large-scale one-stage detectors.</p>\",\"PeriodicalId\":21811,\"journal\":{\"name\":\"Scientific Reports\",\"volume\":\"15 1\",\"pages\":\"28629\"},\"PeriodicalIF\":3.9000,\"publicationDate\":\"2025-08-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12325737/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Scientific Reports\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://doi.org/10.1038/s41598-025-14035-7\",\"RegionNum\":2,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scientific Reports","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1038/s41598-025-14035-7","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

摘要

由于对高检测性能和高计算效率的要求,近年来遥感目标检测已成为深度学习应用领域中具有挑战性的课题之一。为了解决这些问题,本研究介绍了一种高效的单级目标探测器,主要用于检测遥感图像上的目标,该探测器由几个创新组成。首先,提出了一种名为PRepConvBlock的提取块,该提取块利用了重参数化卷积和部分特征利用,有效降低了卷积操作的复杂性,允许利用更大的核尺寸,以形成更长的特征之间的交互,并显着扩展接受域。其次,基于Bi-FPN的独特浅层多尺度融合框架SB-FPN,在继承Bi-FPN双向连接的同时,利用浅层尺度与深层尺度的交叉交互,增强特征的视觉表征。最后,本文提出了一种浅层优化重参数化体系结构检测器(SORA-DET)。该目标探测器设计用于无人机遥感目标探测任务,最多可使用四个探测头。因此,我们提出的检测器获得了具有竞争力的性能,优于大多数其他大尺寸模型和SOTA工作。具体来说,SORA-DET在VisDrone2019测试集中达到39.3%的mAP50,而在SeaDroneSeeV2验证集中达到84.0%的mAP50。此外,与其他大型单级探测器相比,我们提出的探测器参数小于近88.1%,推理速度仅为5.4 ms。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Partial feature reparameterization and shallow-level interaction for remote sensing object detection.

Partial feature reparameterization and shallow-level interaction for remote sensing object detection.

Partial feature reparameterization and shallow-level interaction for remote sensing object detection.

Partial feature reparameterization and shallow-level interaction for remote sensing object detection.

Partial feature reparameterization and shallow-level interaction for remote sensing object detection.

Partial feature reparameterization and shallow-level interaction for remote sensing object detection.

Partial feature reparameterization and shallow-level interaction for remote sensing object detection.

Remote sensing object detection has recently emerged as one of the challenging topics in the field of deep learning applications due to the demand for both high detection performance and computational efficiency. To address these problems, this study introduces an efficient one-stage object detector that is designed mainly for detecting objects on remote sensing images, which consists of several innovations. Firstly, an extraction block is proposed called PRepConvBlock that leverages reparameterization convolution and partial feature utilization to effectively reduce the complexity in convolution operations, allowing for the utilization of larger kernel sizes in order to form the longer interactions between features and significantly expand receptive fields. Secondly, a unique shallow multi-scale fusion framework called SB-FPN based on Bi-FPN that utilizes the cross-interaction between shallow scale and deeper scale while inheriting the bidirectional connection from Bi-FPN to enhance the visual representation of features. Lastly, a Shallow-level Optimized Reparameterization Architecture Detector (SORA-DET) is proposed by applying several introduced innovations. This object detector is designed for UAV remote sensing object detection tasks that employ up to four detection heads. As a result, our proposed detector obtains a competitive performance that outperforms most of the other large-size models and SOTA works. In detail, the SORA-DET achieves 39.3% mAP50 in the VisDrone2019 test set while reaching up to 84.0% mAP50 in the SeaDroneSeeV2 validation set. Furthermore, our proposed detector is smaller than nearly 88.1% in parameters and has an inference speed of only 5.4 ms compared to other large-scale one-stage detectors.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Scientific Reports
Scientific Reports Natural Science Disciplines-
CiteScore
7.50
自引率
4.30%
发文量
19567
审稿时长
3.9 months
期刊介绍: We publish original research from all areas of the natural sciences, psychology, medicine and engineering. You can learn more about what we publish by browsing our specific scientific subject areas below or explore Scientific Reports by browsing all articles and collections. Scientific Reports has a 2-year impact factor: 4.380 (2021), and is the 6th most-cited journal in the world, with more than 540,000 citations in 2020 (Clarivate Analytics, 2021). •Engineering Engineering covers all aspects of engineering, technology, and applied science. It plays a crucial role in the development of technologies to address some of the world''s biggest challenges, helping to save lives and improve the way we live. •Physical sciences Physical sciences are those academic disciplines that aim to uncover the underlying laws of nature — often written in the language of mathematics. It is a collective term for areas of study including astronomy, chemistry, materials science and physics. •Earth and environmental sciences Earth and environmental sciences cover all aspects of Earth and planetary science and broadly encompass solid Earth processes, surface and atmospheric dynamics, Earth system history, climate and climate change, marine and freshwater systems, and ecology. It also considers the interactions between humans and these systems. •Biological sciences Biological sciences encompass all the divisions of natural sciences examining various aspects of vital processes. The concept includes anatomy, physiology, cell biology, biochemistry and biophysics, and covers all organisms from microorganisms, animals to plants. •Health sciences The health sciences study health, disease and healthcare. This field of study aims to develop knowledge, interventions and technology for use in healthcare to improve the treatment of patients.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信