一种基于深度学习的个人防护装备检测算法。

IF 2.6 3区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES
PLoS ONE Pub Date : 2025-05-29 eCollection Date: 2025-01-01 DOI:10.1371/journal.pone.0322115
Bo Tong, Guan Li, Xiangli Bu, Yang Wang, Xingchen Yu
{"title":"一种基于深度学习的个人防护装备检测算法。","authors":"Bo Tong, Guan Li, Xiangli Bu, Yang Wang, Xingchen Yu","doi":"10.1371/journal.pone.0322115","DOIUrl":null,"url":null,"abstract":"<p><p>Personal protective equipment (PPE) is critical for ensuring the safety of construction workers. However, site surveillance images from construction sites often feature multi-size and multi-scale targets, leading to low detection accuracy for PPE in existing models. To address this issue, this paper proposes an improved model based on YOLOv8n.By enriching feature diversity and enhancing the model's adaptability to geometric transformations, the detection accuracy is improved.A Multi-Scale Group Convolution Module (MSGP) was designed to extract multi-level features using different convolution kernels. A Multi-Scale Feature Diffusion Pyramid Network (MFDPN) was developed, which aggregates multi-scale features through the Multiscale Feature Focus (MFF) module and diffuses them across scales, providing each scale with detailed contextual information. A customized Task Alignment Module was introduced to integrate interactive features, optimizing both classification and localization tasks. The DCNV2(Deformable Convolutional Networks v2) module was incorporated to handle geometric scale transformations by generating spatial offsets and feature masks from interactive features, thereby improving localization accuracy and dynamically selecting weights to enhance classification precision.The improved model incorporates rich multi-level and multi-scale features, allowing it to better adapt to tasks involving geometric transformations and aligning with the image data distribution in construction scenarios. Additionally, structured pruning techniques were applied to the model at varying levels, further reducing computational and parameter loads. Experimental results indicate that at a pruning level of 1.5, mAP@0.5 and mAP@0.5:0.95 improved by 3.9% and 4.6%, respectively, while computational load decreased by 21% and parameter count dropped by 53%. The proposed MFD-YOLO(1.5) model achieves significant progress in detecting personal protective equipment on construction sites, with a substantial reduction in parameter count, making it suitable for deployment on resource-constrained edge devices.</p>","PeriodicalId":20189,"journal":{"name":"PLoS ONE","volume":"20 5","pages":"e0322115"},"PeriodicalIF":2.6000,"publicationDate":"2025-05-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12121810/pdf/","citationCount":"0","resultStr":"{\"title\":\"A deep learning-based algorithm for the detection of personal protective equipment.\",\"authors\":\"Bo Tong, Guan Li, Xiangli Bu, Yang Wang, Xingchen Yu\",\"doi\":\"10.1371/journal.pone.0322115\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Personal protective equipment (PPE) is critical for ensuring the safety of construction workers. However, site surveillance images from construction sites often feature multi-size and multi-scale targets, leading to low detection accuracy for PPE in existing models. To address this issue, this paper proposes an improved model based on YOLOv8n.By enriching feature diversity and enhancing the model's adaptability to geometric transformations, the detection accuracy is improved.A Multi-Scale Group Convolution Module (MSGP) was designed to extract multi-level features using different convolution kernels. A Multi-Scale Feature Diffusion Pyramid Network (MFDPN) was developed, which aggregates multi-scale features through the Multiscale Feature Focus (MFF) module and diffuses them across scales, providing each scale with detailed contextual information. A customized Task Alignment Module was introduced to integrate interactive features, optimizing both classification and localization tasks. The DCNV2(Deformable Convolutional Networks v2) module was incorporated to handle geometric scale transformations by generating spatial offsets and feature masks from interactive features, thereby improving localization accuracy and dynamically selecting weights to enhance classification precision.The improved model incorporates rich multi-level and multi-scale features, allowing it to better adapt to tasks involving geometric transformations and aligning with the image data distribution in construction scenarios. Additionally, structured pruning techniques were applied to the model at varying levels, further reducing computational and parameter loads. Experimental results indicate that at a pruning level of 1.5, mAP@0.5 and mAP@0.5:0.95 improved by 3.9% and 4.6%, respectively, while computational load decreased by 21% and parameter count dropped by 53%. The proposed MFD-YOLO(1.5) model achieves significant progress in detecting personal protective equipment on construction sites, with a substantial reduction in parameter count, making it suitable for deployment on resource-constrained edge devices.</p>\",\"PeriodicalId\":20189,\"journal\":{\"name\":\"PLoS ONE\",\"volume\":\"20 5\",\"pages\":\"e0322115\"},\"PeriodicalIF\":2.6000,\"publicationDate\":\"2025-05-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12121810/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"PLoS ONE\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://doi.org/10.1371/journal.pone.0322115\",\"RegionNum\":3,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q1\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"PLoS ONE","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1371/journal.pone.0322115","RegionNum":3,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

摘要

个人防护装备(PPE)对于确保建筑工人的安全至关重要。然而,建筑工地监控图像往往具有多尺寸、多尺度的目标,导致现有模型对PPE的检测精度较低。针对这一问题,本文提出了一种基于YOLOv8n的改进模型。通过丰富特征多样性和增强模型对几何变换的适应性,提高了检测精度。设计了多尺度群卷积模块(MSGP),利用不同的卷积核提取多层次特征。提出了一种多尺度特征扩散金字塔网络(MFDPN),该网络通过多尺度特征焦点(MFF)模块聚合多尺度特征,并将其扩散到各个尺度,为每个尺度提供详细的上下文信息。引入了一个定制的任务对齐模块来集成交互功能,优化分类和定位任务。采用DCNV2(Deformable Convolutional Networks v2)模块,从交互特征中生成空间偏移和特征蒙版,处理几何尺度变换,从而提高定位精度,动态选择权值,提高分类精度。改进后的模型融合了丰富的多层次和多尺度特征,使其能够更好地适应涉及几何变换的任务,并与建筑场景中的图像数据分布对齐。此外,在不同层次上对模型应用结构化剪枝技术,进一步减少了计算和参数负荷。实验结果表明,当剪枝水平为1.5时,mAP@0.5:0.95和mAP@0.5:0.95分别提高了3.9%和4.6%,计算量减少了21%,参数数减少了53%。提出的MFD-YOLO(1.5)模型在检测建筑工地个人防护装备方面取得了重大进展,参数数量大幅减少,适合部署在资源受限的边缘设备上。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A deep learning-based algorithm for the detection of personal protective equipment.

Personal protective equipment (PPE) is critical for ensuring the safety of construction workers. However, site surveillance images from construction sites often feature multi-size and multi-scale targets, leading to low detection accuracy for PPE in existing models. To address this issue, this paper proposes an improved model based on YOLOv8n.By enriching feature diversity and enhancing the model's adaptability to geometric transformations, the detection accuracy is improved.A Multi-Scale Group Convolution Module (MSGP) was designed to extract multi-level features using different convolution kernels. A Multi-Scale Feature Diffusion Pyramid Network (MFDPN) was developed, which aggregates multi-scale features through the Multiscale Feature Focus (MFF) module and diffuses them across scales, providing each scale with detailed contextual information. A customized Task Alignment Module was introduced to integrate interactive features, optimizing both classification and localization tasks. The DCNV2(Deformable Convolutional Networks v2) module was incorporated to handle geometric scale transformations by generating spatial offsets and feature masks from interactive features, thereby improving localization accuracy and dynamically selecting weights to enhance classification precision.The improved model incorporates rich multi-level and multi-scale features, allowing it to better adapt to tasks involving geometric transformations and aligning with the image data distribution in construction scenarios. Additionally, structured pruning techniques were applied to the model at varying levels, further reducing computational and parameter loads. Experimental results indicate that at a pruning level of 1.5, mAP@0.5 and mAP@0.5:0.95 improved by 3.9% and 4.6%, respectively, while computational load decreased by 21% and parameter count dropped by 53%. The proposed MFD-YOLO(1.5) model achieves significant progress in detecting personal protective equipment on construction sites, with a substantial reduction in parameter count, making it suitable for deployment on resource-constrained edge devices.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
PLoS ONE
PLoS ONE 生物-生物学
CiteScore
6.20
自引率
5.40%
发文量
14242
审稿时长
3.7 months
期刊介绍: PLOS ONE is an international, peer-reviewed, open-access, online publication. PLOS ONE welcomes reports on primary research from any scientific discipline. It provides: * Open-access—freely accessible online, authors retain copyright * Fast publication times * Peer review by expert, practicing researchers * Post-publication tools to indicate quality and impact * Community-based dialogue on articles * Worldwide media coverage
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信