基于语义分割的人脸检测

2019 4th International Conference on Computing, Communications and Security (ICCCS) Pub Date : 2019-10-01 DOI:10.1109/CCCS.2019.8888092

T. Meenpal, Ashutosh Balakrishnan, Amit Verma

{"title":"基于语义分割的人脸检测","authors":"T. Meenpal, Ashutosh Balakrishnan, Amit Verma","doi":"10.1109/CCCS.2019.8888092","DOIUrl":null,"url":null,"abstract":"Face Detection has evolved as a very popular problem in Image processing and Computer Vision. Many new algorithms are being devised using convolutional architectures to make the algorithm as accurate as possible. These convolutional architectures have made it possible to extract even the pixel details. We aim to design a binary face classifier which can detect any face present in the frame irrespective of its alignment. We present a method to generate accurate face segmentation masks from any arbitrary size input image. Beginning from the RGB image of any size, the method uses Predefined Training Weights of VGG – 16 Architecture for feature extraction. Training is performed through Fully Convolutional Networks to semantically segment out the faces present in that image. Gradient Descent is used for training while Binomial Cross Entropy is used as a loss function. Further the output image from the FCN is processed to remove the unwanted noise and avoid the false predictions if any and make bounding box around the faces. Furthermore, proposed model has also shown great results in recognizing non-frontal faces. Along with this it is also able to detect multiple facial masks in a single frame. Experiments were performed on Multi Parsing Human Dataset obtaining mean pixel level accuracy of 93.884 % for the segmented face masks.","PeriodicalId":152148,"journal":{"name":"2019 4th International Conference on Computing, Communications and Security (ICCCS)","volume":"73 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"90","resultStr":"{\"title\":\"Facial Mask Detection using Semantic Segmentation\",\"authors\":\"T. Meenpal, Ashutosh Balakrishnan, Amit Verma\",\"doi\":\"10.1109/CCCS.2019.8888092\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Face Detection has evolved as a very popular problem in Image processing and Computer Vision. Many new algorithms are being devised using convolutional architectures to make the algorithm as accurate as possible. These convolutional architectures have made it possible to extract even the pixel details. We aim to design a binary face classifier which can detect any face present in the frame irrespective of its alignment. We present a method to generate accurate face segmentation masks from any arbitrary size input image. Beginning from the RGB image of any size, the method uses Predefined Training Weights of VGG – 16 Architecture for feature extraction. Training is performed through Fully Convolutional Networks to semantically segment out the faces present in that image. Gradient Descent is used for training while Binomial Cross Entropy is used as a loss function. Further the output image from the FCN is processed to remove the unwanted noise and avoid the false predictions if any and make bounding box around the faces. Furthermore, proposed model has also shown great results in recognizing non-frontal faces. Along with this it is also able to detect multiple facial masks in a single frame. Experiments were performed on Multi Parsing Human Dataset obtaining mean pixel level accuracy of 93.884 % for the segmented face masks.\",\"PeriodicalId\":152148,\"journal\":{\"name\":\"2019 4th International Conference on Computing, Communications and Security (ICCCS)\",\"volume\":\"73 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"90\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 4th International Conference on Computing, Communications and Security (ICCCS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CCCS.2019.8888092\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 4th International Conference on Computing, Communications and Security (ICCCS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCCS.2019.8888092","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 90

摘要

人脸检测已经发展成为图像处理和计算机视觉领域的一个非常热门的问题。许多新的算法正在使用卷积架构来设计，以使算法尽可能准确。这些卷积架构使得提取像素细节成为可能。我们的目标是设计一个二值人脸分类器，它可以检测出帧中存在的任何人脸，而不考虑其对齐方式。提出了一种从任意大小的输入图像中生成准确的人脸分割蒙版的方法。该方法从任意大小的RGB图像入手，采用VGG - 16体系结构的预定义训练权值进行特征提取。训练是通过全卷积网络进行的，以在语义上分割出该图像中存在的面部。使用梯度下降进行训练，使用二项交叉熵作为损失函数。进一步对FCN输出图像进行处理，去除不必要的噪声，避免错误的预测(如果有的话)，并在人脸周围制作边界框。此外，该模型在识别非正面人脸方面也取得了很好的效果。除此之外，它还能够在单个帧中检测多个面部面具。在多解析人类数据集上进行实验，得到分割后的人脸平均像素级准确率为93.884%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Facial Mask Detection using Semantic Segmentation

Face Detection has evolved as a very popular problem in Image processing and Computer Vision. Many new algorithms are being devised using convolutional architectures to make the algorithm as accurate as possible. These convolutional architectures have made it possible to extract even the pixel details. We aim to design a binary face classifier which can detect any face present in the frame irrespective of its alignment. We present a method to generate accurate face segmentation masks from any arbitrary size input image. Beginning from the RGB image of any size, the method uses Predefined Training Weights of VGG – 16 Architecture for feature extraction. Training is performed through Fully Convolutional Networks to semantically segment out the faces present in that image. Gradient Descent is used for training while Binomial Cross Entropy is used as a loss function. Further the output image from the FCN is processed to remove the unwanted noise and avoid the false predictions if any and make bounding box around the faces. Furthermore, proposed model has also shown great results in recognizing non-frontal faces. Along with this it is also able to detect multiple facial masks in a single frame. Experiments were performed on Multi Parsing Human Dataset obtaining mean pixel level accuracy of 93.884 % for the segmented face masks.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2019 4th International Conference on Computing, Communications and Security (ICCCS)

自引率

0.00%

发文量