用于微小目标检测的多尺度增强区域明智分析仪。

IF 1.6 Q2 MULTIDISCIPLINARY SCIENCES
Thulasi Bikku, K P N V Satya Sree, Srinivasarao Thota, Malligunta Kiran Kumar, P Shanmugasundaram
{"title":"用于微小目标检测的多尺度增强区域明智分析仪。","authors":"Thulasi Bikku, K P N V Satya Sree, Srinivasarao Thota, Malligunta Kiran Kumar, P Shanmugasundaram","doi":"10.1186/s13104-025-07263-7","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>Detecting small, faraway objects in real-time surveillance is challenging due to limited pixel representation, affecting classifier performance. Deep Learning (DL) techniques generate feature maps to enhance detection, but conventional methods suffer from high computational costs. To address this, we propose Multi-Scale Region-wise Pixel Analysis with GAN for Tiny Object Detection (MSRP-TODNet). The model is trained and tested on VisDrone VID 2019 and MS-COCO datasets. First, images undergo two-fold pre-processing using Improved Wiener Filter (IWF) for artifact removal and Adjusted Contrast Enhancement Method (ACEM) for blurring correction. The Multi-Agent Reinforcement Learning (MARL) algorithm splits the pre-processed image into four regions, analyzing each pixel to generate feature maps. These are processed by the Enhanced Feature Pyramid Network (EFPN), which merges them into a single feature map. Finally, a Generative Adversarial Network (GAN) detects objects with bounding boxes.</p><p><strong>Results: </strong>Experimental results on the DOTA dataset demonstrate that MSRP-TODNet outperforms existing state-of-the-art methods. Specifically, it achieves an mAP @0.5 of 84.2%, mAP @0.5:0.95 of 54.1%, and an F1-Score of 84.0%, surpassing improved TPH-YOLOv5, YOLOv7-Tiny, and DRDet by margins of 1.7%-6.1% in detection performance. These results demonstrate the framework's effectiveness for accurate, real-time small object detection in UAV surveillance and aerial imagery.</p>","PeriodicalId":9234,"journal":{"name":"BMC Research Notes","volume":"18 1","pages":"200"},"PeriodicalIF":1.6000,"publicationDate":"2025-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12044837/pdf/","citationCount":"0","resultStr":"{\"title\":\"MSRP-TODNet: a multi-scale reinforced region wise analyser for tiny object detection.\",\"authors\":\"Thulasi Bikku, K P N V Satya Sree, Srinivasarao Thota, Malligunta Kiran Kumar, P Shanmugasundaram\",\"doi\":\"10.1186/s13104-025-07263-7\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Objective: </strong>Detecting small, faraway objects in real-time surveillance is challenging due to limited pixel representation, affecting classifier performance. Deep Learning (DL) techniques generate feature maps to enhance detection, but conventional methods suffer from high computational costs. To address this, we propose Multi-Scale Region-wise Pixel Analysis with GAN for Tiny Object Detection (MSRP-TODNet). The model is trained and tested on VisDrone VID 2019 and MS-COCO datasets. First, images undergo two-fold pre-processing using Improved Wiener Filter (IWF) for artifact removal and Adjusted Contrast Enhancement Method (ACEM) for blurring correction. The Multi-Agent Reinforcement Learning (MARL) algorithm splits the pre-processed image into four regions, analyzing each pixel to generate feature maps. These are processed by the Enhanced Feature Pyramid Network (EFPN), which merges them into a single feature map. Finally, a Generative Adversarial Network (GAN) detects objects with bounding boxes.</p><p><strong>Results: </strong>Experimental results on the DOTA dataset demonstrate that MSRP-TODNet outperforms existing state-of-the-art methods. Specifically, it achieves an mAP @0.5 of 84.2%, mAP @0.5:0.95 of 54.1%, and an F1-Score of 84.0%, surpassing improved TPH-YOLOv5, YOLOv7-Tiny, and DRDet by margins of 1.7%-6.1% in detection performance. These results demonstrate the framework's effectiveness for accurate, real-time small object detection in UAV surveillance and aerial imagery.</p>\",\"PeriodicalId\":9234,\"journal\":{\"name\":\"BMC Research Notes\",\"volume\":\"18 1\",\"pages\":\"200\"},\"PeriodicalIF\":1.6000,\"publicationDate\":\"2025-04-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12044837/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"BMC Research Notes\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1186/s13104-025-07263-7\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Research Notes","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1186/s13104-025-07263-7","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

摘要

目的:由于像素表示有限,实时监控中检测小而远的物体具有挑战性,影响了分类器的性能。深度学习(DL)技术生成特征映射以增强检测,但传统方法的计算成本高。为了解决这个问题,我们提出了用GAN进行多尺度区域明智像素分析的微小目标检测(MSRP-TODNet)。该模型在VisDrone VID 2019和MS-COCO数据集上进行了训练和测试。首先,使用改进的维纳滤波(IWF)去除伪影,使用调整对比度增强法(ACEM)校正模糊,对图像进行双重预处理。多智能体强化学习(MARL)算法将预处理后的图像分成四个区域,分析每个像素生成特征图。这些数据通过增强特征金字塔网络(Enhanced Feature Pyramid Network, EFPN)进行处理,并将它们合并成一个单一的特征图。最后,生成对抗网络(GAN)检测具有边界框的对象。结果:在DOTA数据集上的实验结果表明,MSRP-TODNet优于现有的最先进的方法。具体来说,它的mAP @0.5为84.2%,mAP @0.5:0.95为54.1%,F1-Score为84.0%,在检测性能上超过改进的TPH-YOLOv5、YOLOv7-Tiny和DRDet,幅度为1.7%-6.1%。这些结果证明了该框架在无人机监视和航空成像中精确、实时的小目标检测方面的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
MSRP-TODNet: a multi-scale reinforced region wise analyser for tiny object detection.

Objective: Detecting small, faraway objects in real-time surveillance is challenging due to limited pixel representation, affecting classifier performance. Deep Learning (DL) techniques generate feature maps to enhance detection, but conventional methods suffer from high computational costs. To address this, we propose Multi-Scale Region-wise Pixel Analysis with GAN for Tiny Object Detection (MSRP-TODNet). The model is trained and tested on VisDrone VID 2019 and MS-COCO datasets. First, images undergo two-fold pre-processing using Improved Wiener Filter (IWF) for artifact removal and Adjusted Contrast Enhancement Method (ACEM) for blurring correction. The Multi-Agent Reinforcement Learning (MARL) algorithm splits the pre-processed image into four regions, analyzing each pixel to generate feature maps. These are processed by the Enhanced Feature Pyramid Network (EFPN), which merges them into a single feature map. Finally, a Generative Adversarial Network (GAN) detects objects with bounding boxes.

Results: Experimental results on the DOTA dataset demonstrate that MSRP-TODNet outperforms existing state-of-the-art methods. Specifically, it achieves an mAP @0.5 of 84.2%, mAP @0.5:0.95 of 54.1%, and an F1-Score of 84.0%, surpassing improved TPH-YOLOv5, YOLOv7-Tiny, and DRDet by margins of 1.7%-6.1% in detection performance. These results demonstrate the framework's effectiveness for accurate, real-time small object detection in UAV surveillance and aerial imagery.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
BMC Research Notes
BMC Research Notes Biochemistry, Genetics and Molecular Biology-Biochemistry, Genetics and Molecular Biology (all)
CiteScore
3.60
自引率
0.00%
发文量
363
审稿时长
15 weeks
期刊介绍: BMC Research Notes publishes scientifically valid research outputs that cannot be considered as full research or methodology articles. We support the research community across all scientific and clinical disciplines by providing an open access forum for sharing data and useful information; this includes, but is not limited to, updates to previous work, additions to established methods, short publications, null results, research proposals and data management plans.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信