深度和热图像在人脸检测-图像模式之间的详细比较

Proceedings of the 2022 5th International Conference on Machine Vision and Applications Pub Date : 2022-02-18 DOI:10.1145/3523111.3523114

Wiktor Mucha, M. Kampel

{"title":"深度和热图像在人脸检测-图像模式之间的详细比较","authors":"Wiktor Mucha, M. Kampel","doi":"10.1145/3523111.3523114","DOIUrl":null,"url":null,"abstract":"Face detection is a well-known issue in image processing, and numerous studies are present in this field. A prominent part of the work is devoted to RGB images, leaving depth and thermal data with less interest. However, in some conditions like low-light areas where face detection is needed, non-RGB sensors might perform better. Also, mounting an additional RGB camera could be challenging or not possible, considering privacy concerns. In this work, current deep learning methodologies are employed to train depth and thermal detection models. The training is done using combined publicly available data that is processed by us for this purpose in order to create necessary annotations for a learning process. The resulting models are validated on a new trimodal dataset collected for this experiments purpose. It contains images captured with RGB, depth, and thermal sensors. Various scenes with single and multiple faces appearances can be found. The results show that non-RGB solutions can be applied in practice with highly robust accuracy and their efficiency is close to RGB detectors. However, their performance depends on the environment and that circumstances are described later in this article.","PeriodicalId":185161,"journal":{"name":"Proceedings of the 2022 5th International Conference on Machine Vision and Applications","volume":"36 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Depth and Thermal Images in Face Detection - A Detailed Comparison Between Image Modalities\",\"authors\":\"Wiktor Mucha, M. Kampel\",\"doi\":\"10.1145/3523111.3523114\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Face detection is a well-known issue in image processing, and numerous studies are present in this field. A prominent part of the work is devoted to RGB images, leaving depth and thermal data with less interest. However, in some conditions like low-light areas where face detection is needed, non-RGB sensors might perform better. Also, mounting an additional RGB camera could be challenging or not possible, considering privacy concerns. In this work, current deep learning methodologies are employed to train depth and thermal detection models. The training is done using combined publicly available data that is processed by us for this purpose in order to create necessary annotations for a learning process. The resulting models are validated on a new trimodal dataset collected for this experiments purpose. It contains images captured with RGB, depth, and thermal sensors. Various scenes with single and multiple faces appearances can be found. The results show that non-RGB solutions can be applied in practice with highly robust accuracy and their efficiency is close to RGB detectors. However, their performance depends on the environment and that circumstances are described later in this article.\",\"PeriodicalId\":185161,\"journal\":{\"name\":\"Proceedings of the 2022 5th International Conference on Machine Vision and Applications\",\"volume\":\"36 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-02-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2022 5th International Conference on Machine Vision and Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3523111.3523114\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2022 5th International Conference on Machine Vision and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3523111.3523114","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

摘要

人脸检测是图像处理中一个众所周知的问题，目前在这一领域进行了大量的研究。工作的一个突出部分是致力于RGB图像，留下深度和热数据较少的兴趣。然而，在某些情况下，如需要人脸检测的低光区域，非rgb传感器可能表现更好。此外，考虑到隐私问题，安装一个额外的RGB摄像头可能是具有挑战性的，或者是不可能的。在这项工作中，采用当前的深度学习方法来训练深度和热检测模型。训练是使用我们为此目的处理的公开可用数据来完成的，以便为学习过程创建必要的注释。所得模型在为此实验目的收集的新三模数据集上进行了验证。它包含用RGB、深度和热传感器捕获的图像。可以找到各种单一和多重面孔出现的场景。结果表明，非RGB解决方案在实际应用中具有很高的鲁棒精度，其效率接近RGB检测器。但是，它们的性能取决于环境，这些环境将在本文后面介绍。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Depth and Thermal Images in Face Detection - A Detailed Comparison Between Image Modalities

Face detection is a well-known issue in image processing, and numerous studies are present in this field. A prominent part of the work is devoted to RGB images, leaving depth and thermal data with less interest. However, in some conditions like low-light areas where face detection is needed, non-RGB sensors might perform better. Also, mounting an additional RGB camera could be challenging or not possible, considering privacy concerns. In this work, current deep learning methodologies are employed to train depth and thermal detection models. The training is done using combined publicly available data that is processed by us for this purpose in order to create necessary annotations for a learning process. The resulting models are validated on a new trimodal dataset collected for this experiments purpose. It contains images captured with RGB, depth, and thermal sensors. Various scenes with single and multiple faces appearances can be found. The results show that non-RGB solutions can be applied in practice with highly robust accuracy and their efficiency is close to RGB detectors. However, their performance depends on the environment and that circumstances are described later in this article.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the 2022 5th International Conference on Machine Vision and Applications

自引率

0.00%

发文量