{"title":"基于人级联网络的遮挡行人鲁棒检测","authors":"Zhewei Xu, Xiufeng Fu, Da Feng, Wei Li, Yang Liu","doi":"10.1109/cniot55862.2022.00045","DOIUrl":null,"url":null,"abstract":"Occlusion is a key challenge in real-world on-road pedestrian detection task. Due to constrained viewpoint geometry, a pedestrian is very likely to be obstructed by other pedestrians and/or other objects such as cars and bicycles. For Advanced Driver Assistance System (ADAS), heavily occluded pedestrians are as important as reasonable pedestrians because they may burst out from crowds or roadside obstacles. In this paper, a human-cascaded network is proposed for robust detection of heavily occluded pedestrians. Specifically, a sharp-response proposal network (SRPN) is designed to refine the feature responses in a narrow area to handle crowded pedestrian detection, followed by outputting the head and full body proposals for diverse occlusion situations. After RoI-pooling, a visible-guided attention (VGA) module is developed to leverage the head and visible area information. The VGA module also suppresses the feature noise of occluded area to enhance the feature representation learning of the backbone network. Finally, a head-cascade RCNN (HRCNN) network is proposed to predict the pedestrian bounding box from the head proposal. The proposed approach is validated through a widely used pedestrian detection dataset: CityPersons. Experimental results show that our approach achieves promising detection performance (log-average miss rate, MR) improvement of 11.4% on heavy occlusion subset, compared to the baseline detector.","PeriodicalId":251734,"journal":{"name":"2022 3rd International Conference on Computing, Networks and Internet of Things (CNIOT)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Human-Cascaded network for Robust Detection of Occluded Pedestrian\",\"authors\":\"Zhewei Xu, Xiufeng Fu, Da Feng, Wei Li, Yang Liu\",\"doi\":\"10.1109/cniot55862.2022.00045\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Occlusion is a key challenge in real-world on-road pedestrian detection task. Due to constrained viewpoint geometry, a pedestrian is very likely to be obstructed by other pedestrians and/or other objects such as cars and bicycles. For Advanced Driver Assistance System (ADAS), heavily occluded pedestrians are as important as reasonable pedestrians because they may burst out from crowds or roadside obstacles. In this paper, a human-cascaded network is proposed for robust detection of heavily occluded pedestrians. Specifically, a sharp-response proposal network (SRPN) is designed to refine the feature responses in a narrow area to handle crowded pedestrian detection, followed by outputting the head and full body proposals for diverse occlusion situations. After RoI-pooling, a visible-guided attention (VGA) module is developed to leverage the head and visible area information. The VGA module also suppresses the feature noise of occluded area to enhance the feature representation learning of the backbone network. Finally, a head-cascade RCNN (HRCNN) network is proposed to predict the pedestrian bounding box from the head proposal. The proposed approach is validated through a widely used pedestrian detection dataset: CityPersons. Experimental results show that our approach achieves promising detection performance (log-average miss rate, MR) improvement of 11.4% on heavy occlusion subset, compared to the baseline detector.\",\"PeriodicalId\":251734,\"journal\":{\"name\":\"2022 3rd International Conference on Computing, Networks and Internet of Things (CNIOT)\",\"volume\":\"4 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 3rd International Conference on Computing, Networks and Internet of Things (CNIOT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/cniot55862.2022.00045\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 3rd International Conference on Computing, Networks and Internet of Things (CNIOT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/cniot55862.2022.00045","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Human-Cascaded network for Robust Detection of Occluded Pedestrian
Occlusion is a key challenge in real-world on-road pedestrian detection task. Due to constrained viewpoint geometry, a pedestrian is very likely to be obstructed by other pedestrians and/or other objects such as cars and bicycles. For Advanced Driver Assistance System (ADAS), heavily occluded pedestrians are as important as reasonable pedestrians because they may burst out from crowds or roadside obstacles. In this paper, a human-cascaded network is proposed for robust detection of heavily occluded pedestrians. Specifically, a sharp-response proposal network (SRPN) is designed to refine the feature responses in a narrow area to handle crowded pedestrian detection, followed by outputting the head and full body proposals for diverse occlusion situations. After RoI-pooling, a visible-guided attention (VGA) module is developed to leverage the head and visible area information. The VGA module also suppresses the feature noise of occluded area to enhance the feature representation learning of the backbone network. Finally, a head-cascade RCNN (HRCNN) network is proposed to predict the pedestrian bounding box from the head proposal. The proposed approach is validated through a widely used pedestrian detection dataset: CityPersons. Experimental results show that our approach achieves promising detection performance (log-average miss rate, MR) improvement of 11.4% on heavy occlusion subset, compared to the baseline detector.