Wang Mo, Yan Ke, Qingao Huo, Ruyi Cao, Guochang Song, Wendong Zhang
{"title":"ADN: ATSS-based Deep Network for Pedestrian Detection on High-Resolution Images","authors":"Wang Mo, Yan Ke, Qingao Huo, Ruyi Cao, Guochang Song, Wendong Zhang","doi":"10.1109/NaNA56854.2022.00084","DOIUrl":null,"url":null,"abstract":"This paper investigates the high-resolution pedestrian detection network AND(ATSS-based Deep Network for Pedestrian Detection on High-Resolution Images) to solve the problem that high-resolution images are difficult to be processed directly as well as pedestrian occlusion and pose variations. First, it obtains a deeper backbone network by stacking residual modules to extract multi-level features, improve the extraction ability of occluded target features and avoid network degradation. Subsequently, the deformable convolution is introduced to optimize the backbone network and expand the local receptive field, thus further optimizing the detection ability of deformable targets and occluded targets. On the PANDA dataset, the AP, AP50, and AP75 of ADN are 3.5, 2.7, and 4.2, which are higher than that for the baseline respectively. Compared with other state-of-the-art methods, experiments show that ADN effectively enhances the accuracy of pedestrian detection in high-resolution Imgaes, and it outperforms existing object detection algorithms.","PeriodicalId":113743,"journal":{"name":"2022 International Conference on Networking and Network Applications (NaNA)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Networking and Network Applications (NaNA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NaNA56854.2022.00084","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This paper investigates the high-resolution pedestrian detection network AND(ATSS-based Deep Network for Pedestrian Detection on High-Resolution Images) to solve the problem that high-resolution images are difficult to be processed directly as well as pedestrian occlusion and pose variations. First, it obtains a deeper backbone network by stacking residual modules to extract multi-level features, improve the extraction ability of occluded target features and avoid network degradation. Subsequently, the deformable convolution is introduced to optimize the backbone network and expand the local receptive field, thus further optimizing the detection ability of deformable targets and occluded targets. On the PANDA dataset, the AP, AP50, and AP75 of ADN are 3.5, 2.7, and 4.2, which are higher than that for the baseline respectively. Compared with other state-of-the-art methods, experiments show that ADN effectively enhances the accuracy of pedestrian detection in high-resolution Imgaes, and it outperforms existing object detection algorithms.
为了解决高分辨率图像难以直接处理以及行人遮挡和位姿变化等问题,本文研究了基于atss的行人检测网络AND(Deep network for pedestrian detection on high-resolution Images)。首先,通过叠加残差模块提取多层次特征,得到更深层次的骨干网络,提高被遮挡目标特征的提取能力,避免网络退化;随后,引入可变形卷积优化骨干网络,扩展局部感受野,进一步优化可变形目标和闭塞目标的检测能力。在PANDA数据集上,ADN的AP值为3.5,AP50值为2.7,AP75值为4.2,均高于基线值。实验表明,与其他先进的方法相比,ADN有效地提高了高分辨率图像中行人检测的准确性,优于现有的目标检测算法。