基于YOLOv4的大规模检测算法及应用

Xiangbin Shi, Jinwen Peng
{"title":"基于YOLOv4的大规模检测算法及应用","authors":"Xiangbin Shi, Jinwen Peng","doi":"10.1109/CSE53436.2021.00026","DOIUrl":null,"url":null,"abstract":"In this paper, we addressed the problems of occlusion and crowding in large-scale object detection. First, large-scale detection is more complex and diverse than traditional object detection. The number of targets to be detected is larger and often clustered together. This will produce occlusion and dense detection problems, which brings a serious challenge to object detection. Secondly, current dominant object detection is rarely trained and inferred on large-scale labeled dataset, so it is unable to evaluate the performance of these detection models on large dataset. To solve the above problems, we propose L-YOLO large-scale object detection algorithm. We modified the structure of feature pyramid network, then the receptive field was increased by using four-scale detection. Next, we propose a new loss function designed specifically for large-scale scenarios, which keeps the prediction box that is not the target as far away from the target as possible. It prevents the fusion of adjacent boundary boxes in the inference process and improves the detection performance in the case of occlusion effectively. At last, we use a new non-maximum suppression rule to prevent suppression of the correct detection box during infer. We annotated new dataset for large-scale detection, retrained and evaluated our model. Experiments on our dataset show the superiority of our model. Compared to the original YOLOv4, our improved model increases 1.8% mAP.","PeriodicalId":6838,"journal":{"name":"2021 IEEE 24th International Conference on Computational Science and Engineering (CSE)","volume":"54 1","pages":"116-122"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Large-scale Detection Algorithm and Application Based on YOLOv4\",\"authors\":\"Xiangbin Shi, Jinwen Peng\",\"doi\":\"10.1109/CSE53436.2021.00026\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we addressed the problems of occlusion and crowding in large-scale object detection. First, large-scale detection is more complex and diverse than traditional object detection. The number of targets to be detected is larger and often clustered together. This will produce occlusion and dense detection problems, which brings a serious challenge to object detection. Secondly, current dominant object detection is rarely trained and inferred on large-scale labeled dataset, so it is unable to evaluate the performance of these detection models on large dataset. To solve the above problems, we propose L-YOLO large-scale object detection algorithm. We modified the structure of feature pyramid network, then the receptive field was increased by using four-scale detection. Next, we propose a new loss function designed specifically for large-scale scenarios, which keeps the prediction box that is not the target as far away from the target as possible. It prevents the fusion of adjacent boundary boxes in the inference process and improves the detection performance in the case of occlusion effectively. At last, we use a new non-maximum suppression rule to prevent suppression of the correct detection box during infer. We annotated new dataset for large-scale detection, retrained and evaluated our model. Experiments on our dataset show the superiority of our model. Compared to the original YOLOv4, our improved model increases 1.8% mAP.\",\"PeriodicalId\":6838,\"journal\":{\"name\":\"2021 IEEE 24th International Conference on Computational Science and Engineering (CSE)\",\"volume\":\"54 1\",\"pages\":\"116-122\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE 24th International Conference on Computational Science and Engineering (CSE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CSE53436.2021.00026\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 24th International Conference on Computational Science and Engineering (CSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSE53436.2021.00026","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

在本文中,我们解决了大规模目标检测中的遮挡和拥挤问题。首先,与传统的目标检测相比,大规模检测更加复杂和多样。需要检测的目标数量较大,并且经常聚集在一起。这会产生遮挡和密集检测问题,给目标检测带来严峻的挑战。其次,目前的主导目标检测很少在大规模标记数据集上进行训练和推断,因此无法评估这些检测模型在大型数据集上的性能。为了解决上述问题,我们提出了L-YOLO大规模目标检测算法。首先对特征金字塔网络结构进行修正,然后采用四尺度检测增加接收野。接下来,我们提出了一种专门为大规模场景设计的新的损失函数,使非目标的预测框尽可能远离目标。在推理过程中防止了相邻边界框的融合,有效提高了遮挡情况下的检测性能。最后,我们采用了一种新的非最大抑制规则来防止正确检测框在推断过程中被抑制。我们为大规模检测标注了新的数据集,重新训练和评估了我们的模型。在我们的数据集上的实验表明了我们模型的优越性。与原来的YOLOv4相比,我们改进的模型增加了1.8%的mAP。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A Large-scale Detection Algorithm and Application Based on YOLOv4
In this paper, we addressed the problems of occlusion and crowding in large-scale object detection. First, large-scale detection is more complex and diverse than traditional object detection. The number of targets to be detected is larger and often clustered together. This will produce occlusion and dense detection problems, which brings a serious challenge to object detection. Secondly, current dominant object detection is rarely trained and inferred on large-scale labeled dataset, so it is unable to evaluate the performance of these detection models on large dataset. To solve the above problems, we propose L-YOLO large-scale object detection algorithm. We modified the structure of feature pyramid network, then the receptive field was increased by using four-scale detection. Next, we propose a new loss function designed specifically for large-scale scenarios, which keeps the prediction box that is not the target as far away from the target as possible. It prevents the fusion of adjacent boundary boxes in the inference process and improves the detection performance in the case of occlusion effectively. At last, we use a new non-maximum suppression rule to prevent suppression of the correct detection box during infer. We annotated new dataset for large-scale detection, retrained and evaluated our model. Experiments on our dataset show the superiority of our model. Compared to the original YOLOv4, our improved model increases 1.8% mAP.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信