基于MWIR和可见光波段的深度人脸检测模型设计

2019 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM) Pub Date : 2019-08-01 DOI:10.1145/3341161.3343528

Suha Reddy Mokalla, T. Bourlai

{"title":"基于MWIR和可见光波段的深度人脸检测模型设计","authors":"Suha Reddy Mokalla, T. Bourlai","doi":"10.1145/3341161.3343528","DOIUrl":null,"url":null,"abstract":"In this work, we propose an optimal solution for face detection when operating in the thermal and visible bands. Our aim is to train, fine tune, optimize and validate preexisting object detection models using thermal and visible data separately. Thus, we perform an empirical study to determine the most efficient band specific DeepFace detection model in terms of detection performance. The original object detection models that were selected for our study are the Faster R-CNN (Region based Convolutional Neural Network), SSD (Single-shot Multi-Box Detector) and R-FCN (Region-based Fully Convolutional Network). Also, the dual-band dataset used for this work is composed of two challenging MWIR and visible band face datasets, where the faces were captured under variable conditions, i.e. indoors, outdoors, different standoff distances (5 and 10 meters) and poses. Experimental results show that the proposed detection model yields the highest accuracy independent of the band and scenario used. Specifically, we show that a modified and tuned Faster R-CNN architecture with ResNet 101 is the most promising model when compared to all the other models tested. The proposed model yields accuracy of 99.2% and 98.4% when tested on thermal and visible face data respectively. Finally, while the proposed model is relatively slower than its competitors, our further experiments show that the speed of this network can be increased by reducing the number of proposals in RPN (Region Proposal Network), and thus, the computational complexity challenge is significantly minimized.","PeriodicalId":403360,"journal":{"name":"2019 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"On Designing MWIR and Visible Band based DeepFace Detection Models\",\"authors\":\"Suha Reddy Mokalla, T. Bourlai\",\"doi\":\"10.1145/3341161.3343528\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this work, we propose an optimal solution for face detection when operating in the thermal and visible bands. Our aim is to train, fine tune, optimize and validate preexisting object detection models using thermal and visible data separately. Thus, we perform an empirical study to determine the most efficient band specific DeepFace detection model in terms of detection performance. The original object detection models that were selected for our study are the Faster R-CNN (Region based Convolutional Neural Network), SSD (Single-shot Multi-Box Detector) and R-FCN (Region-based Fully Convolutional Network). Also, the dual-band dataset used for this work is composed of two challenging MWIR and visible band face datasets, where the faces were captured under variable conditions, i.e. indoors, outdoors, different standoff distances (5 and 10 meters) and poses. Experimental results show that the proposed detection model yields the highest accuracy independent of the band and scenario used. Specifically, we show that a modified and tuned Faster R-CNN architecture with ResNet 101 is the most promising model when compared to all the other models tested. The proposed model yields accuracy of 99.2% and 98.4% when tested on thermal and visible face data respectively. Finally, while the proposed model is relatively slower than its competitors, our further experiments show that the speed of this network can be increased by reducing the number of proposals in RPN (Region Proposal Network), and thus, the computational complexity challenge is significantly minimized.\",\"PeriodicalId\":403360,\"journal\":{\"name\":\"2019 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM)\",\"volume\":\"4 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3341161.3343528\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3341161.3343528","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

摘要

在这项工作中，我们提出了一个在热波段和可见光波段进行人脸检测的最佳解决方案。我们的目标是分别使用热数据和可见数据来训练、微调、优化和验证预先存在的目标检测模型。因此，我们进行了一项实证研究，以确定在检测性能方面最有效的波段特定DeepFace检测模型。我们研究中选择的原始目标检测模型是Faster R-CNN(基于区域的卷积神经网络)、SSD(单镜头多盒检测器)和R-FCN(基于区域的全卷积网络)。此外，用于这项工作的双频数据集由两个具有挑战性的MWIR和可见光波段人脸数据集组成，其中人脸是在不同条件下捕获的，即室内，室外，不同的距离(5米和10米)和姿势。实验结果表明，所提出的检测模型无论在何种波段和场景下都具有最高的检测精度。具体来说，我们表明，与所有其他测试模型相比，使用ResNet 101修改和调整的Faster R-CNN架构是最有前途的模型。该模型在热数据和可见人脸数据上的准确率分别为99.2%和98.4%。最后，虽然所提出的模型相对较慢，但我们进一步的实验表明，该网络的速度可以通过减少RPN (Region Proposal network)中的提案数量来提高，从而显著降低计算复杂度挑战。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

On Designing MWIR and Visible Band based DeepFace Detection Models

In this work, we propose an optimal solution for face detection when operating in the thermal and visible bands. Our aim is to train, fine tune, optimize and validate preexisting object detection models using thermal and visible data separately. Thus, we perform an empirical study to determine the most efficient band specific DeepFace detection model in terms of detection performance. The original object detection models that were selected for our study are the Faster R-CNN (Region based Convolutional Neural Network), SSD (Single-shot Multi-Box Detector) and R-FCN (Region-based Fully Convolutional Network). Also, the dual-band dataset used for this work is composed of two challenging MWIR and visible band face datasets, where the faces were captured under variable conditions, i.e. indoors, outdoors, different standoff distances (5 and 10 meters) and poses. Experimental results show that the proposed detection model yields the highest accuracy independent of the band and scenario used. Specifically, we show that a modified and tuned Faster R-CNN architecture with ResNet 101 is the most promising model when compared to all the other models tested. The proposed model yields accuracy of 99.2% and 98.4% when tested on thermal and visible face data respectively. Finally, while the proposed model is relatively slower than its competitors, our further experiments show that the speed of this network can be increased by reducing the number of proposals in RPN (Region Proposal Network), and thus, the computational complexity challenge is significantly minimized.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2019 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM)

自引率

0.00%

发文量