基于区域卷积神经网络的蒙面检测

2020 IEEE 15th International Conference on Industrial and Information Systems (ICIIS) Pub Date : 2020-11-26 DOI:10.1109/ICIIS51140.2020.9342737

Jenil Gathani, Krish Shah

{"title":"基于区域卷积神经网络的蒙面检测","authors":"Jenil Gathani, Krish Shah","doi":"10.1109/ICIIS51140.2020.9342737","DOIUrl":null,"url":null,"abstract":"Precisely detecting masked and non-masked faces are increasingly important since wearing a face mask is an effective measure to prevent the spread of the COVID-19 pandemic. Previous literature focused mainly on occluded face detection or facial expression, which were unsuitable for the application of detecting human faces wearing masks. Hence, to overcome the issue, this work proposes a Convolutional Neural Network-based model that uses region proposals to detect masked and nonmasked faces. The depth of the Convolutional Neural Network was increased by using residual skip-connections. The model was implemented using TensorFlow Object Detection API and was pre-trained over the COCO dataset. A secondary outcome of the paper was to collect a dataset of high resolution masked and nonmasked faces for training deep learning frameworks, due to the lack of datasets available for this task. The proposed model was compared with the SSD Inception V2 model [1] and the SSD MobileNet V2 model [2] in the context of detection accuracy (mAP). The experiments highlight that the proposed framework achieves a detection accuracy (total mAP) of 85.82%, on the collected dataset. The results are significantly better in detecting non-masked faces with a detection accuracy (mAP) of 98.61% while it is 68.72% accurate in detecting masked faces. Based on this model’s performance on standard parameters, a detailed study is outlined along with the conclusion and future plan of action.","PeriodicalId":352858,"journal":{"name":"2020 IEEE 15th International Conference on Industrial and Information Systems (ICIIS)","volume":"49 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Detecting Masked Faces using Region-based Convolutional Neural Network\",\"authors\":\"Jenil Gathani, Krish Shah\",\"doi\":\"10.1109/ICIIS51140.2020.9342737\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Precisely detecting masked and non-masked faces are increasingly important since wearing a face mask is an effective measure to prevent the spread of the COVID-19 pandemic. Previous literature focused mainly on occluded face detection or facial expression, which were unsuitable for the application of detecting human faces wearing masks. Hence, to overcome the issue, this work proposes a Convolutional Neural Network-based model that uses region proposals to detect masked and nonmasked faces. The depth of the Convolutional Neural Network was increased by using residual skip-connections. The model was implemented using TensorFlow Object Detection API and was pre-trained over the COCO dataset. A secondary outcome of the paper was to collect a dataset of high resolution masked and nonmasked faces for training deep learning frameworks, due to the lack of datasets available for this task. The proposed model was compared with the SSD Inception V2 model [1] and the SSD MobileNet V2 model [2] in the context of detection accuracy (mAP). The experiments highlight that the proposed framework achieves a detection accuracy (total mAP) of 85.82%, on the collected dataset. The results are significantly better in detecting non-masked faces with a detection accuracy (mAP) of 98.61% while it is 68.72% accurate in detecting masked faces. Based on this model’s performance on standard parameters, a detailed study is outlined along with the conclusion and future plan of action.\",\"PeriodicalId\":352858,\"journal\":{\"name\":\"2020 IEEE 15th International Conference on Industrial and Information Systems (ICIIS)\",\"volume\":\"49 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-11-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE 15th International Conference on Industrial and Information Systems (ICIIS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICIIS51140.2020.9342737\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 15th International Conference on Industrial and Information Systems (ICIIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIIS51140.2020.9342737","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

摘要

佩戴口罩是防止新冠肺炎疫情传播的有效措施，因此准确识别口罩和非口罩变得越来越重要。以往的文献主要集中在遮挡人脸检测或面部表情方面，不适合用于检测戴口罩的人脸。因此，为了克服这个问题，这项工作提出了一个基于卷积神经网络的模型，该模型使用区域建议来检测被屏蔽和非被屏蔽的人脸。利用残差跳跃连接增加了卷积神经网络的深度。该模型使用TensorFlow对象检测API实现，并在COCO数据集上进行预训练。由于缺乏可用于该任务的数据集，本文的第二个结果是收集用于训练深度学习框架的高分辨率遮罩和非遮罩人脸数据集。将提出的模型与SSD Inception V2模型[1]和SSD MobileNet V2模型[2]在检测精度(mAP)方面进行比较。实验结果表明，该框架在采集的数据集上的检测准确率(total mAP)达到85.82%。结果表明，该方法对非蒙面人脸的检测准确率为98.61%，对蒙面人脸的检测准确率为68.72%。根据该模型在标准参数下的性能，对其进行了详细的研究，并给出了结论和未来的行动计划。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Detecting Masked Faces using Region-based Convolutional Neural Network

Precisely detecting masked and non-masked faces are increasingly important since wearing a face mask is an effective measure to prevent the spread of the COVID-19 pandemic. Previous literature focused mainly on occluded face detection or facial expression, which were unsuitable for the application of detecting human faces wearing masks. Hence, to overcome the issue, this work proposes a Convolutional Neural Network-based model that uses region proposals to detect masked and nonmasked faces. The depth of the Convolutional Neural Network was increased by using residual skip-connections. The model was implemented using TensorFlow Object Detection API and was pre-trained over the COCO dataset. A secondary outcome of the paper was to collect a dataset of high resolution masked and nonmasked faces for training deep learning frameworks, due to the lack of datasets available for this task. The proposed model was compared with the SSD Inception V2 model [1] and the SSD MobileNet V2 model [2] in the context of detection accuracy (mAP). The experiments highlight that the proposed framework achieves a detection accuracy (total mAP) of 85.82%, on the collected dataset. The results are significantly better in detecting non-masked faces with a detection accuracy (mAP) of 98.61% while it is 68.72% accurate in detecting masked faces. Based on this model’s performance on standard parameters, a detailed study is outlined along with the conclusion and future plan of action.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2020 IEEE 15th International Conference on Industrial and Information Systems (ICIIS)

自引率

0.00%

发文量