{"title":"一种改进的快速RCNN行人检测方法","authors":"S. Panigrahi, U. Raju","doi":"10.1109/CAPS52117.2021.9730492","DOIUrl":null,"url":null,"abstract":"Pedestrian detection plays a pivotal role in applications such as robotics, automated driving, assistive living, and surveillance. The problem of pedestrian detection, although approached by many computer vision researchers is far from solved. The scale, pose, occlusion, illumination, and many such factors affect the performance of the methods. In this work, a modification of the most commonly used deep convolutional neural network model ResNet18 is proposed. The modified CNN structure forms the base of the Faster RCNN model utilized to predict the locations of pedestrians in the image. The proposed method has been improved in terms of the feature map extraction of the image. To evaluate the proposed method, two benchmark datasets INRIA Pedestrian and PASCAL VOC 2012 are considered. The performance metrics used for evaluation are Detection Error Trade-off and Precision-Recall Curve. A statistical analysis is also conducted. The proposed method is compared against state-of-the-art detection methods.","PeriodicalId":445427,"journal":{"name":"2021 International Conference on Control, Automation, Power and Signal Processing (CAPS)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An improved Faster RCNN for Pedestrian Detection\",\"authors\":\"S. Panigrahi, U. Raju\",\"doi\":\"10.1109/CAPS52117.2021.9730492\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Pedestrian detection plays a pivotal role in applications such as robotics, automated driving, assistive living, and surveillance. The problem of pedestrian detection, although approached by many computer vision researchers is far from solved. The scale, pose, occlusion, illumination, and many such factors affect the performance of the methods. In this work, a modification of the most commonly used deep convolutional neural network model ResNet18 is proposed. The modified CNN structure forms the base of the Faster RCNN model utilized to predict the locations of pedestrians in the image. The proposed method has been improved in terms of the feature map extraction of the image. To evaluate the proposed method, two benchmark datasets INRIA Pedestrian and PASCAL VOC 2012 are considered. The performance metrics used for evaluation are Detection Error Trade-off and Precision-Recall Curve. A statistical analysis is also conducted. The proposed method is compared against state-of-the-art detection methods.\",\"PeriodicalId\":445427,\"journal\":{\"name\":\"2021 International Conference on Control, Automation, Power and Signal Processing (CAPS)\",\"volume\":\"22 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-12-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 International Conference on Control, Automation, Power and Signal Processing (CAPS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CAPS52117.2021.9730492\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Control, Automation, Power and Signal Processing (CAPS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CAPS52117.2021.9730492","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Pedestrian detection plays a pivotal role in applications such as robotics, automated driving, assistive living, and surveillance. The problem of pedestrian detection, although approached by many computer vision researchers is far from solved. The scale, pose, occlusion, illumination, and many such factors affect the performance of the methods. In this work, a modification of the most commonly used deep convolutional neural network model ResNet18 is proposed. The modified CNN structure forms the base of the Faster RCNN model utilized to predict the locations of pedestrians in the image. The proposed method has been improved in terms of the feature map extraction of the image. To evaluate the proposed method, two benchmark datasets INRIA Pedestrian and PASCAL VOC 2012 are considered. The performance metrics used for evaluation are Detection Error Trade-off and Precision-Recall Curve. A statistical analysis is also conducted. The proposed method is compared against state-of-the-art detection methods.