DS-YOLO: A dense small object detection algorithm based on inverted bottleneck and multi-scale fusion network

Hongyu Zhang , Guoliang Li , Dapeng Wan , Ziyue Wang , Jinshun Dong , Shoujun Lin , Lixia Deng , Haiying Liu
{"title":"DS-YOLO: A dense small object detection algorithm based on inverted bottleneck and multi-scale fusion network","authors":"Hongyu Zhang ,&nbsp;Guoliang Li ,&nbsp;Dapeng Wan ,&nbsp;Ziyue Wang ,&nbsp;Jinshun Dong ,&nbsp;Shoujun Lin ,&nbsp;Lixia Deng ,&nbsp;Haiying Liu","doi":"10.1016/j.birob.2024.100190","DOIUrl":null,"url":null,"abstract":"<div><div>In the field of security, intelligent surveillance tasks often involve a large number of dense and small objects, with severe occlusion between them, making detection particularly challenging. To address this significant challenge, Dense and Small YOLO (DS-YOLO), a dense small object detection algorithm based on YOLOv8s, is proposed in this paper. Firstly, to enhance the dense small objects’ feature extraction capability of backbone network, the paper proposes a lightweight backbone. The improved C2fUIB is employed to create a lightweight model and expand the receptive field, enabling the capture of richer contextual information and reducing the impact of occlusion on detection accuracy. Secondly, to enhance the feature fusion capability of model, a multi-scale feature fusion network, Light-weight Full Scale PAFPN (LFS-PAFPN), combined with the DO-C2f module, is introduced. The new module successfully reduces the miss rate of dense small objects while ensuring the accuracy of detecting large objects. Finally, to minimize feature loss of dense objects during network transmission, a dynamic upsampling module, DySample, is implemented. DS-YOLO was trained and tested on the CrowdHuman and VisDrone2019 datasets, which contain a large number of densely populated pedestrians, vehicles and other objects. Experimental evaluations demonstrated that DS-YOLO has advantages in dense small object detection tasks. Compared with YOLOv8s, the Recall and [email protected] are increased by 4.9% and 4.2% on CrowdHuman dataset, 4.6% and 5% on VisDrone2019, respectively. Simultaneously, DS-YOLO does not introduce a substantial amount of computing overhead, maintaining low hardware requirements.</div></div>","PeriodicalId":100184,"journal":{"name":"Biomimetic Intelligence and Robotics","volume":"4 4","pages":"Article 100190"},"PeriodicalIF":0.0000,"publicationDate":"2024-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomimetic Intelligence and Robotics","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2667379724000482","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

In the field of security, intelligent surveillance tasks often involve a large number of dense and small objects, with severe occlusion between them, making detection particularly challenging. To address this significant challenge, Dense and Small YOLO (DS-YOLO), a dense small object detection algorithm based on YOLOv8s, is proposed in this paper. Firstly, to enhance the dense small objects’ feature extraction capability of backbone network, the paper proposes a lightweight backbone. The improved C2fUIB is employed to create a lightweight model and expand the receptive field, enabling the capture of richer contextual information and reducing the impact of occlusion on detection accuracy. Secondly, to enhance the feature fusion capability of model, a multi-scale feature fusion network, Light-weight Full Scale PAFPN (LFS-PAFPN), combined with the DO-C2f module, is introduced. The new module successfully reduces the miss rate of dense small objects while ensuring the accuracy of detecting large objects. Finally, to minimize feature loss of dense objects during network transmission, a dynamic upsampling module, DySample, is implemented. DS-YOLO was trained and tested on the CrowdHuman and VisDrone2019 datasets, which contain a large number of densely populated pedestrians, vehicles and other objects. Experimental evaluations demonstrated that DS-YOLO has advantages in dense small object detection tasks. Compared with YOLOv8s, the Recall and [email protected] are increased by 4.9% and 4.2% on CrowdHuman dataset, 4.6% and 5% on VisDrone2019, respectively. Simultaneously, DS-YOLO does not introduce a substantial amount of computing overhead, maintaining low hardware requirements.
DS-YOLO:基于倒置瓶颈和多尺度融合网络的密集小目标检测算法
在安防领域,智能监控任务往往涉及大量密集的小型物体,而且这些物体之间存在严重的遮挡,因此检测工作尤其具有挑战性。针对这一重大挑战,本文提出了一种基于 YOLOv8s 的密集小物体检测算法--密集小物体 YOLO(Dense and Small YOLO,DS-YOLO)。首先,为了增强骨干网的密集小目标特征提取能力,本文提出了一种轻量级骨干网。采用改进的 C2fUIB 创建轻量级模型并扩大感受野,从而能够捕获更丰富的上下文信息,降低遮挡对检测精度的影响。其次,为增强模型的特征融合能力,引入了多尺度特征融合网络--轻量级全尺度 PAFPN(LFS-PAFPN),并与 DO-C2f 模块相结合。新模块成功降低了高密度小物体的漏检率,同时保证了大物体的检测精度。最后,为了最大限度地减少密集物体在网络传输过程中的特征损失,还实施了动态上采样模块 DySample。DS-YOLO 在 CrowdHuman 和 VisDrone2019 数据集上进行了训练和测试,这两个数据集包含大量密集的行人、车辆和其他物体。实验评估表明,DS-YOLO 在密集小物体检测任务中具有优势。与 YOLOv8s 相比,DS-YOLO 在 CrowdHuman 数据集上的 Recall 和 [email protected] 分别提高了 4.9% 和 4.2%,在 VisDrone2019 上分别提高了 4.6% 和 5%。同时,DS-YOLO 没有引入大量计算开销,保持了较低的硬件要求。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
1.80
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信