{"title":"Detecting and Counting People's Faces in Images Using Convolutional Neural Networks","authors":"Yehea al Atrash, Motaz Saad, I. H. Alshami","doi":"10.1109/PICICT53635.2021.00031","DOIUrl":null,"url":null,"abstract":"Computer Vision (CV) has so many applications such as but not limited to object recognition, which is a collection of computer vision tasks that involves identifying objects in images. One of CV applications is People counting, and it is useful for automatically counting the number of persons in a class, or a ceremony, or an event. People counting is based on face detection is a challenging task and still an open problem in computer vision. This research investigates two object detection models for detecting and counting people's faces. The first model is based on Faster-RCNN and the second one is based on SSD. These models are deep neural networks that are trained on object detection tasks. In this work, we train Faster-RCNN and SSD models on Wider-Face dataset, which is composed of faces in a variety of conditions relating to occlusion, illumination, expression, pose and scale. The evaluation result on the test part of the wider face dataset is 0.5 of accuracy for Faster-RCNN and SSD, also the Mean Relative Error for the Faster-RCNN is 0.3 and the SSD is 0.4. The Mean Absolute Error for the Faster-RCNN is 7.5 and the SSD is 8.6.","PeriodicalId":308869,"journal":{"name":"2021 Palestinian International Conference on Information and Communication Technology (PICICT)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 Palestinian International Conference on Information and Communication Technology (PICICT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PICICT53635.2021.00031","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Computer Vision (CV) has so many applications such as but not limited to object recognition, which is a collection of computer vision tasks that involves identifying objects in images. One of CV applications is People counting, and it is useful for automatically counting the number of persons in a class, or a ceremony, or an event. People counting is based on face detection is a challenging task and still an open problem in computer vision. This research investigates two object detection models for detecting and counting people's faces. The first model is based on Faster-RCNN and the second one is based on SSD. These models are deep neural networks that are trained on object detection tasks. In this work, we train Faster-RCNN and SSD models on Wider-Face dataset, which is composed of faces in a variety of conditions relating to occlusion, illumination, expression, pose and scale. The evaluation result on the test part of the wider face dataset is 0.5 of accuracy for Faster-RCNN and SSD, also the Mean Relative Error for the Faster-RCNN is 0.3 and the SSD is 0.4. The Mean Absolute Error for the Faster-RCNN is 7.5 and the SSD is 8.6.