{"title":"改进的Yolov4网络结构行人检测算法","authors":"Xiujun Zhu, Yujie Bai, Yijian Pei","doi":"10.1145/3487075.3487088","DOIUrl":null,"url":null,"abstract":"When the YOLOV4 network detects pedestrians alone, the small target pedestrians will be missed, resulting in the reduction of P (Precision) and AP (Average Precision) values. This paper improves the YOLOV4 network structure. In order to improve the feature extraction capability of the network for small targets, a shallower feature layer is added to the original three output feature layers of the YOLOV4 backbone network to build PANet (Path Aggregation Network) together. And two SPP (Spatial Pyramid Pooling) structures are added to expand the receptive field. The channel attention mechanism module is added and some convolutional layers of the original network are deleted. Finally, transfer learning is used to make the detection effect better. The P value of the pedestrian on the PASCAL VOC data set increased from 84.43% to 91.37%, and the AP value increased from 74.78% to 87.39%, and the P value on the commonly used pedestrian detection data set INRIA (INRIA Person Dataset) increased from 93.20% increased to 98.02%, AP value increased from 91.08% to 94.02%. Experimental results show that the network has a better effect on pedestrian detection, and the accuracy and average precision are improved.","PeriodicalId":354966,"journal":{"name":"Proceedings of the 5th International Conference on Computer Science and Application Engineering","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Improved Pedestrian Detection Algorithm of Yolov4 Network Structure\",\"authors\":\"Xiujun Zhu, Yujie Bai, Yijian Pei\",\"doi\":\"10.1145/3487075.3487088\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"When the YOLOV4 network detects pedestrians alone, the small target pedestrians will be missed, resulting in the reduction of P (Precision) and AP (Average Precision) values. This paper improves the YOLOV4 network structure. In order to improve the feature extraction capability of the network for small targets, a shallower feature layer is added to the original three output feature layers of the YOLOV4 backbone network to build PANet (Path Aggregation Network) together. And two SPP (Spatial Pyramid Pooling) structures are added to expand the receptive field. The channel attention mechanism module is added and some convolutional layers of the original network are deleted. Finally, transfer learning is used to make the detection effect better. The P value of the pedestrian on the PASCAL VOC data set increased from 84.43% to 91.37%, and the AP value increased from 74.78% to 87.39%, and the P value on the commonly used pedestrian detection data set INRIA (INRIA Person Dataset) increased from 93.20% increased to 98.02%, AP value increased from 91.08% to 94.02%. Experimental results show that the network has a better effect on pedestrian detection, and the accuracy and average precision are improved.\",\"PeriodicalId\":354966,\"journal\":{\"name\":\"Proceedings of the 5th International Conference on Computer Science and Application Engineering\",\"volume\":\"24 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-10-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 5th International Conference on Computer Science and Application Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3487075.3487088\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 5th International Conference on Computer Science and Application Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3487075.3487088","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Improved Pedestrian Detection Algorithm of Yolov4 Network Structure
When the YOLOV4 network detects pedestrians alone, the small target pedestrians will be missed, resulting in the reduction of P (Precision) and AP (Average Precision) values. This paper improves the YOLOV4 network structure. In order to improve the feature extraction capability of the network for small targets, a shallower feature layer is added to the original three output feature layers of the YOLOV4 backbone network to build PANet (Path Aggregation Network) together. And two SPP (Spatial Pyramid Pooling) structures are added to expand the receptive field. The channel attention mechanism module is added and some convolutional layers of the original network are deleted. Finally, transfer learning is used to make the detection effect better. The P value of the pedestrian on the PASCAL VOC data set increased from 84.43% to 91.37%, and the AP value increased from 74.78% to 87.39%, and the P value on the commonly used pedestrian detection data set INRIA (INRIA Person Dataset) increased from 93.20% increased to 98.02%, AP value increased from 91.08% to 94.02%. Experimental results show that the network has a better effect on pedestrian detection, and the accuracy and average precision are improved.