HMMSC-YOLO: A Comprehensively Improved Small Target Detection Algorithm

IF 1.5 4区 计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING
Chongyang Fan, Wenfang Li, Chang Lin
{"title":"HMMSC-YOLO: A Comprehensively Improved Small Target Detection Algorithm","authors":"Chongyang Fan,&nbsp;Wenfang Li,&nbsp;Chang Lin","doi":"10.1002/cpe.70288","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>This study addressed the challenges of small target detection in aerial imaging applications, including limited pixel coverage, weak feature representation, and complex background interference, by proposing a collaborative optimisation algorithm named HMMSC-YOLO. Firstly, a CNN-Transformer heterogeneous feature interaction network was constructed to mitigate high-frequency information attenuation during hierarchical transmission of small targets. Secondly, a parameter-shared dilated convolutional chain structure was designed, employing a weight-reuse strategy across multi-branch heterogeneous receptive fields to enhance geometric feature sensitivity towards minuscule targets. A differentiable affine transformation-guided multi-kernel dynamic fusion mechanism was further developed, achieving high-precision geometric alignment of cross-scale features through learnable deformation fields, thereby overcoming the rigid fusion limitations of conventional feature pyramids. A dual-attention-driven feature recalibration architecture was introduced to improve target localisation robustness under complex background interference. Finally, a dual-path collaborative downsampling module was implemented to suppress feature confusion caused by traditional single-path downsampling. Experimental evaluations on the VisDrone2019 dataset demonstrated 1.4% and 1% improvements in mAP50 and mAP50:95 metrics respectively compared to baseline models, alongside 23.3% and 2.5% reductions in parameter quantity and computational costs. The algorithm exhibited superior localisation accuracy and occlusion resistance in dense small target scenarios, establishing an innovative technical framework for practical applications including aerial image analysis and low-light environmental monitoring.</p>\n </div>","PeriodicalId":55214,"journal":{"name":"Concurrency and Computation-Practice & Experience","volume":"37 25-26","pages":""},"PeriodicalIF":1.5000,"publicationDate":"2025-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Concurrency and Computation-Practice & Experience","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cpe.70288","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}
引用次数: 0

Abstract

This study addressed the challenges of small target detection in aerial imaging applications, including limited pixel coverage, weak feature representation, and complex background interference, by proposing a collaborative optimisation algorithm named HMMSC-YOLO. Firstly, a CNN-Transformer heterogeneous feature interaction network was constructed to mitigate high-frequency information attenuation during hierarchical transmission of small targets. Secondly, a parameter-shared dilated convolutional chain structure was designed, employing a weight-reuse strategy across multi-branch heterogeneous receptive fields to enhance geometric feature sensitivity towards minuscule targets. A differentiable affine transformation-guided multi-kernel dynamic fusion mechanism was further developed, achieving high-precision geometric alignment of cross-scale features through learnable deformation fields, thereby overcoming the rigid fusion limitations of conventional feature pyramids. A dual-attention-driven feature recalibration architecture was introduced to improve target localisation robustness under complex background interference. Finally, a dual-path collaborative downsampling module was implemented to suppress feature confusion caused by traditional single-path downsampling. Experimental evaluations on the VisDrone2019 dataset demonstrated 1.4% and 1% improvements in mAP50 and mAP50:95 metrics respectively compared to baseline models, alongside 23.3% and 2.5% reductions in parameter quantity and computational costs. The algorithm exhibited superior localisation accuracy and occlusion resistance in dense small target scenarios, establishing an innovative technical framework for practical applications including aerial image analysis and low-light environmental monitoring.

Abstract Image

一种综合改进的小目标检测算法HMMSC-YOLO
本研究通过提出一种名为HMMSC-YOLO的协同优化算法,解决了航空成像应用中小目标检测的挑战,包括有限的像素覆盖、弱特征表示和复杂的背景干扰。首先,构建CNN-Transformer异构特征交互网络,缓解小目标分层传输过程中的高频信息衰减;其次,设计了一种参数共享的扩张卷积链结构,采用跨多分支异构感受野的权重重用策略,提高了对微小目标的几何特征敏感性;进一步发展了一种可微仿射变换引导的多核动态融合机制,通过可学习的变形场实现了跨尺度特征的高精度几何对齐,从而克服了传统特征金字塔的刚性融合局限性。为了提高目标定位在复杂背景干扰下的鲁棒性,提出了一种双注意力驱动的特征重标定结构。最后,实现了双路径协同下采样模块,抑制了传统的单路径下采样导致的特征混淆。对VisDrone2019数据集的实验评估表明,与基线模型相比,mAP50和mAP50:95指标分别提高了1.4%和1%,参数数量和计算成本分别降低了23.3%和2.5%。该算法在密集小目标场景下具有优异的定位精度和抗遮挡能力,为航空图像分析和低光环境监测等实际应用建立了创新的技术框架。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Concurrency and Computation-Practice & Experience
Concurrency and Computation-Practice & Experience 工程技术-计算机:理论方法
CiteScore
5.00
自引率
10.00%
发文量
664
审稿时长
9.6 months
期刊介绍: Concurrency and Computation: Practice and Experience (CCPE) publishes high-quality, original research papers, and authoritative research review papers, in the overlapping fields of: Parallel and distributed computing; High-performance computing; Computational and data science; Artificial intelligence and machine learning; Big data applications, algorithms, and systems; Network science; Ontologies and semantics; Security and privacy; Cloud/edge/fog computing; Green computing; and Quantum computing.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信