基于多级细粒度YOLOX算法的行人检测研究

Hong Wang, Yonggui Xie, Shasha Tian, Lu Zheng, Xiaojie Dong, Yueli Zhu
{"title":"基于多级细粒度YOLOX算法的行人检测研究","authors":"Hong Wang, Yonggui Xie, Shasha Tian, Lu Zheng, Xiaojie Dong, Yueli Zhu","doi":"10.1108/ijicc-05-2022-0161","DOIUrl":null,"url":null,"abstract":"PurposeThe purpose of the study is to address the problems of low accuracy and missed detection of occluded pedestrians and small target pedestrians when using the YOLOX general object detection algorithm for pedestrian detection. This study proposes a multi-level fine-grained YOLOX pedestrian detection algorithm.Design/methodology/approachFirst, to address the problem of the original YOLOX algorithm in obtaining a single perceptual field for the feature map before feature fusion, this study improves the PAFPN structure by adding the ResCoT module to increase the diversity of the perceptual field of the feature map and divides the pedestrian multi-scale features into finer granularity. Second, for the CSPLayer of the PAFPN, a weight gain-based normalization-based attention module (NAM) is proposed to make the model pay more attention to the context information when extracting pedestrian features and highlight the salient features of pedestrians. Finally, the authors experimentally determined the optimal values for the confidence loss function.FindingsThe experimental results show that, compared with the original YOLOX algorithm, the AP of the improved algorithm increased by 2.90%, the Recall increased by 3.57%, and F1 increased by 2% on the pedestrian dataset.Research limitations/implicationsThe multi-level fine-grained YOLOX pedestrian detection algorithm can effectively improve the detection of occluded pedestrians and small target pedestrians.Originality/valueThe authors introduce a multi-level fine-grained ResCoT module and a weight gain-based NAM attention module.","PeriodicalId":352072,"journal":{"name":"Int. J. Intell. Comput. Cybern.","volume":"171 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Research on pedestrian detection based on multi-level fine-grained YOLOX algorithm\",\"authors\":\"Hong Wang, Yonggui Xie, Shasha Tian, Lu Zheng, Xiaojie Dong, Yueli Zhu\",\"doi\":\"10.1108/ijicc-05-2022-0161\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"PurposeThe purpose of the study is to address the problems of low accuracy and missed detection of occluded pedestrians and small target pedestrians when using the YOLOX general object detection algorithm for pedestrian detection. This study proposes a multi-level fine-grained YOLOX pedestrian detection algorithm.Design/methodology/approachFirst, to address the problem of the original YOLOX algorithm in obtaining a single perceptual field for the feature map before feature fusion, this study improves the PAFPN structure by adding the ResCoT module to increase the diversity of the perceptual field of the feature map and divides the pedestrian multi-scale features into finer granularity. Second, for the CSPLayer of the PAFPN, a weight gain-based normalization-based attention module (NAM) is proposed to make the model pay more attention to the context information when extracting pedestrian features and highlight the salient features of pedestrians. Finally, the authors experimentally determined the optimal values for the confidence loss function.FindingsThe experimental results show that, compared with the original YOLOX algorithm, the AP of the improved algorithm increased by 2.90%, the Recall increased by 3.57%, and F1 increased by 2% on the pedestrian dataset.Research limitations/implicationsThe multi-level fine-grained YOLOX pedestrian detection algorithm can effectively improve the detection of occluded pedestrians and small target pedestrians.Originality/valueThe authors introduce a multi-level fine-grained ResCoT module and a weight gain-based NAM attention module.\",\"PeriodicalId\":352072,\"journal\":{\"name\":\"Int. J. Intell. Comput. Cybern.\",\"volume\":\"171 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-09-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Int. J. Intell. Comput. Cybern.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1108/ijicc-05-2022-0161\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Int. J. Intell. Comput. Cybern.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1108/ijicc-05-2022-0161","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

本研究的目的是为了解决使用YOLOX通用目标检测算法进行行人检测时存在的对遮挡行人和小目标行人检测准确率低、漏检的问题。本研究提出了一种多级细粒度的YOLOX行人检测算法。首先,针对原有YOLOX算法在特征融合前获取特征图单一感知场的问题,本研究通过增加ResCoT模块对PAFPN结构进行改进,增加特征图感知场的多样性,并对行人多尺度特征进行更细粒度的划分。其次,针对PAFPN的CSPLayer,提出了基于权值增益的归一化关注模块(NAM),使模型在提取行人特征时更加关注上下文信息,突出行人的显著特征;最后,通过实验确定了置信损失函数的最优值。实验结果表明,与原始YOLOX算法相比,改进算法在行人数据集上的AP提高了2.90%,Recall提高了3.57%,F1提高了2%。研究局限性/意义多级细粒度YOLOX行人检测算法可以有效提高对遮挡行人和小目标行人的检测。作者介绍了一个多级细粒度的ResCoT模块和一个基于权重增加的NAM注意力模块。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Research on pedestrian detection based on multi-level fine-grained YOLOX algorithm
PurposeThe purpose of the study is to address the problems of low accuracy and missed detection of occluded pedestrians and small target pedestrians when using the YOLOX general object detection algorithm for pedestrian detection. This study proposes a multi-level fine-grained YOLOX pedestrian detection algorithm.Design/methodology/approachFirst, to address the problem of the original YOLOX algorithm in obtaining a single perceptual field for the feature map before feature fusion, this study improves the PAFPN structure by adding the ResCoT module to increase the diversity of the perceptual field of the feature map and divides the pedestrian multi-scale features into finer granularity. Second, for the CSPLayer of the PAFPN, a weight gain-based normalization-based attention module (NAM) is proposed to make the model pay more attention to the context information when extracting pedestrian features and highlight the salient features of pedestrians. Finally, the authors experimentally determined the optimal values for the confidence loss function.FindingsThe experimental results show that, compared with the original YOLOX algorithm, the AP of the improved algorithm increased by 2.90%, the Recall increased by 3.57%, and F1 increased by 2% on the pedestrian dataset.Research limitations/implicationsThe multi-level fine-grained YOLOX pedestrian detection algorithm can effectively improve the detection of occluded pedestrians and small target pedestrians.Originality/valueThe authors introduce a multi-level fine-grained ResCoT module and a weight gain-based NAM attention module.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信