A Deep Learning Approach for Face Detection using YOLO

2018 IEEE Punecon Pub Date : 2018-11-01 DOI:10.1109/PUNECON.2018.8745376

Dweepna Garg, Parth Goel, Sharnil Pandya, Amit Ganatra, K. Kotecha

{"title":"A Deep Learning Approach for Face Detection using YOLO","authors":"Dweepna Garg, Parth Goel, Sharnil Pandya, Amit Ganatra, K. Kotecha","doi":"10.1109/PUNECON.2018.8745376","DOIUrl":null,"url":null,"abstract":"Deep learning is nowadays a buzzword and is considered a new era of machine learning which trains the computers in finding the pattern from a massive amount of data. It mainly describes the learning at multiple levels of representation which helps to make sense on the data consisting of text, sound and images. Many organizations are using a type of deep learning known as a convolutional neural network to deal with the objects in a video sequence. Deep Convolution Neural Networks (CNNs) have proved to be impressive in terms of performance for detecting the objects, classification of images and semantic segmentation. Object detection is defined as a combination of classification and localization. Face detection is one of the most challenging problems of pattern recognition. Various face related applications like face verification, facial recognition, clustering of face etc. are a part of face detection. Effective training needs to be carried out for detection and recognition. The accuracy in face detection using the traditional approach did not yield a good result. This paper focuses on improving the accuracy of detecting the face using the model of deep learning. YOLO (You only look once), a popular deep learning library is used to implement the proposed work. The paper compares the accuracy of detecting the face in an efficient manner with respect to the traditional approach. The proposed model uses the convolutional neural network as an approach of deep learning for detecting faces from videos. The FDDB dataset is used for training and testing of our model. A model is fine-tuned on various performance parameters and the best suitable values are taken into consideration. It is also compared the execution of training time and the performance of the model on two different GPUs.","PeriodicalId":166677,"journal":{"name":"2018 IEEE Punecon","volume":"37 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"57","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE Punecon","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PUNECON.2018.8745376","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 57

Abstract

Deep learning is nowadays a buzzword and is considered a new era of machine learning which trains the computers in finding the pattern from a massive amount of data. It mainly describes the learning at multiple levels of representation which helps to make sense on the data consisting of text, sound and images. Many organizations are using a type of deep learning known as a convolutional neural network to deal with the objects in a video sequence. Deep Convolution Neural Networks (CNNs) have proved to be impressive in terms of performance for detecting the objects, classification of images and semantic segmentation. Object detection is defined as a combination of classification and localization. Face detection is one of the most challenging problems of pattern recognition. Various face related applications like face verification, facial recognition, clustering of face etc. are a part of face detection. Effective training needs to be carried out for detection and recognition. The accuracy in face detection using the traditional approach did not yield a good result. This paper focuses on improving the accuracy of detecting the face using the model of deep learning. YOLO (You only look once), a popular deep learning library is used to implement the proposed work. The paper compares the accuracy of detecting the face in an efficient manner with respect to the traditional approach. The proposed model uses the convolutional neural network as an approach of deep learning for detecting faces from videos. The FDDB dataset is used for training and testing of our model. A model is fine-tuned on various performance parameters and the best suitable values are taken into consideration. It is also compared the execution of training time and the performance of the model on two different GPUs.

查看原文本刊更多论文

一种基于YOLO的人脸检测深度学习方法

深度学习现在是一个流行词，被认为是机器学习的新时代，它训练计算机从大量数据中找到模式。它主要描述了多层表示的学习，这有助于对由文本、声音和图像组成的数据进行理解。许多组织正在使用一种称为卷积神经网络的深度学习来处理视频序列中的对象。深度卷积神经网络(cnn)在物体检测、图像分类和语义分割方面的表现令人印象深刻。目标检测被定义为分类和定位的结合。人脸检测是模式识别中最具挑战性的问题之一。各种与人脸相关的应用，如人脸验证、人脸识别、人脸聚类等，都是人脸检测的一部分。需要进行有效的培训以进行检测和识别。传统的人脸检测方法准确率不高。本文主要研究利用深度学习模型提高人脸检测的准确性。YOLO(你只看一次)，一个流行的深度学习库被用来实现提议的工作。本文比较了高效人脸检测方法与传统人脸检测方法的准确率。该模型使用卷积神经网络作为深度学习方法从视频中检测人脸。FDDB数据集用于训练和测试我们的模型。根据各种性能参数对模型进行微调，并考虑最合适的值。并比较了该模型在两种不同gpu上的训练执行时间和性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2018 IEEE Punecon

自引率

0.00%

发文量