考虑隐私的深度学习辅助单像素手势识别

IEICE Trans. Electron. Pub Date : 2022-01-01 DOI:10.1587/transele.2021dii0002

Naoya Mukojima, Masaki Yasugi, Y. Mizutani, T. Yasui, Hirotsugu Yamamoto

{"title":"考虑隐私的深度学习辅助单像素手势识别","authors":"Naoya Mukojima, Masaki Yasugi, Y. Mizutani, T. Yasui, Hirotsugu Yamamoto","doi":"10.1587/transele.2021dii0002","DOIUrl":null,"url":null,"abstract":"SUMMARY We have utilized single-pixel imaging and deep-learning to solve the privacy-preserving problem in gesture recognition for interactive display. Silhouette images of hand gestures were acquired by use of a display panel as an illumination. Reconstructions of gesture images have been performed by numerical experiments on single-pixel imaging by changing the number of illumination mask patterns. For the training and the image restoration with deep learning, we prepared reconstructed data with 250 and 500 illuminations as datasets. For each of the 250 and 500 illuminations, we prepared 9000 datasets in which original images and reconstructed data were paired. Of these data, 8500 data were used for training a neural network (6800 data for training and 1700 data for validation), and 500 data were used to evaluate the accuracy of image restoration. Our neural network, based on U-net, was able to restore images close to the original images even from reconstructed data with greatly reduced number of illuminations, which is 1/40 of the single-pixel imaging without deep learning. Compared restoration accuracy between cases using shadowgraph (black on white background) and negative-positive reversed images (white on black background) as silhouette image, the accuracy of the restored image was lower for negative-positive-reversed images when the number of illuminations was small. Moreover, we found that the restoration accuracy decreased in the order of rock, scissor, and paper. Shadowgraph is suitable for gesture silhouette, and it is necessary to prepare training data and construct neural networks, to avoid the restoration accuracy between gestures when further reducing the number of illuminations.","PeriodicalId":13259,"journal":{"name":"IEICE Trans. Electron.","volume":"735 1","pages":"79-85"},"PeriodicalIF":0.0000,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Deep-Learning-Assisted Single-Pixel Imaging for Gesture Recognition in Consideration of Privacy\",\"authors\":\"Naoya Mukojima, Masaki Yasugi, Y. Mizutani, T. Yasui, Hirotsugu Yamamoto\",\"doi\":\"10.1587/transele.2021dii0002\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"SUMMARY We have utilized single-pixel imaging and deep-learning to solve the privacy-preserving problem in gesture recognition for interactive display. Silhouette images of hand gestures were acquired by use of a display panel as an illumination. Reconstructions of gesture images have been performed by numerical experiments on single-pixel imaging by changing the number of illumination mask patterns. For the training and the image restoration with deep learning, we prepared reconstructed data with 250 and 500 illuminations as datasets. For each of the 250 and 500 illuminations, we prepared 9000 datasets in which original images and reconstructed data were paired. Of these data, 8500 data were used for training a neural network (6800 data for training and 1700 data for validation), and 500 data were used to evaluate the accuracy of image restoration. Our neural network, based on U-net, was able to restore images close to the original images even from reconstructed data with greatly reduced number of illuminations, which is 1/40 of the single-pixel imaging without deep learning. Compared restoration accuracy between cases using shadowgraph (black on white background) and negative-positive reversed images (white on black background) as silhouette image, the accuracy of the restored image was lower for negative-positive-reversed images when the number of illuminations was small. Moreover, we found that the restoration accuracy decreased in the order of rock, scissor, and paper. Shadowgraph is suitable for gesture silhouette, and it is necessary to prepare training data and construct neural networks, to avoid the restoration accuracy between gestures when further reducing the number of illuminations.\",\"PeriodicalId\":13259,\"journal\":{\"name\":\"IEICE Trans. Electron.\",\"volume\":\"735 1\",\"pages\":\"79-85\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEICE Trans. Electron.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1587/transele.2021dii0002\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEICE Trans. Electron.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1587/transele.2021dii0002","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

我们利用单像素成像和深度学习来解决交互式显示手势识别中的隐私保护问题。手势的轮廓图像是通过使用显示面板作为照明来获得的。通过改变照明掩模模式的数目，在单像素成像上进行了手势图像的重建实验。为了训练和深度学习图像恢复，我们准备了250和500照度的重建数据作为数据集。对于250和500照明中的每一个，我们准备了9000个数据集，其中原始图像和重建数据配对。其中8500个数据用于训练神经网络(6800个数据用于训练，1700个数据用于验证)，500个数据用于评估图像恢复的准确性。我们基于U-net的神经网络，即使在光照数量大大减少的情况下，也能从重建的数据中恢复接近原始图像的图像，这是没有深度学习的单像素成像的1/40。对比阴影图(白色背景上的黑色)和正负反转图(黑色背景上的白色)作为剪影图像的恢复精度，光照数较少时正负反转图像的恢复精度较低。此外，我们发现恢复精度依次为岩石、剪刀和纸。阴影图适合于手势剪影，需要准备训练数据和构建神经网络，以避免在进一步减少光照数量时手势之间的恢复精度降低。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Deep-Learning-Assisted Single-Pixel Imaging for Gesture Recognition in Consideration of Privacy

SUMMARY We have utilized single-pixel imaging and deep-learning to solve the privacy-preserving problem in gesture recognition for interactive display. Silhouette images of hand gestures were acquired by use of a display panel as an illumination. Reconstructions of gesture images have been performed by numerical experiments on single-pixel imaging by changing the number of illumination mask patterns. For the training and the image restoration with deep learning, we prepared reconstructed data with 250 and 500 illuminations as datasets. For each of the 250 and 500 illuminations, we prepared 9000 datasets in which original images and reconstructed data were paired. Of these data, 8500 data were used for training a neural network (6800 data for training and 1700 data for validation), and 500 data were used to evaluate the accuracy of image restoration. Our neural network, based on U-net, was able to restore images close to the original images even from reconstructed data with greatly reduced number of illuminations, which is 1/40 of the single-pixel imaging without deep learning. Compared restoration accuracy between cases using shadowgraph (black on white background) and negative-positive reversed images (white on black background) as silhouette image, the accuracy of the restored image was lower for negative-positive-reversed images when the number of illuminations was small. Moreover, we found that the restoration accuracy decreased in the order of rock, scissor, and paper. Shadowgraph is suitable for gesture silhouette, and it is necessary to prepare training data and construct neural networks, to avoid the restoration accuracy between gestures when further reducing the number of illuminations.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEICE Trans. Electron.

自引率

0.00%

发文量