Active non-line-of-sight human pose estimation based on deep learning

International Conference on Optical Instruments and Technology Pub Date : 2022-07-22 DOI:10.1117/12.2610975

Qianqian Xu, Liquan Dong, Lingqin Kong, Yuejin Zhao, Ming Liu

{"title":"Active non-line-of-sight human pose estimation based on deep learning","authors":"Qianqian Xu, Liquan Dong, Lingqin Kong, Yuejin Zhao, Ming Liu","doi":"10.1117/12.2610975","DOIUrl":null,"url":null,"abstract":"Non-Line-of-Sight technology is to image objects that are hidden from the camera's view. It has a wide range of application prospects in robotic vision, national defense, remote sensing, medical imaging, and unmanned driving. Active Non-Lineof- Sight imaging mainly relies on time-resolved optical impulse responses. The Non-Line-of-Sight imaging system emits ultra-short light pulses to illuminate the diffuse reflection wall, and uses ultra-fast time-resolved single-photon detectors to collect multiple reflected photon information, thereby obtaining information in the hidden scene. Finally, various reconstruction algorithms are used to reconstruct the hidden scene. However, most of the existing reconstruction algorithms have the problems of slow reconstruction speed and fuzzy reconstruction results, especially in the aspect of human pose estimation. In this article, we describe a method of active Non-Line-of-Sight human pose estimation based on deep learning. In order to solve the problem of lack of deep learning data, we simulate large amounts of pseudo-transient images for the network, including various complex actions: walking, jumping, turning, bending back and forth, rotating, using the confocal Non-Line-of-Sight imaging model. And then we train the simulated transient images using light cones Transformation and U-net coding and decoding network structure. Finally, we examine the performance of our method on synthetic and experimental datasets. The prediction results show that our method can not only estimate the pose of real measured non-view human pose data, but also significantly improve the quality of reconstruction.","PeriodicalId":201899,"journal":{"name":"International Conference on Optical Instruments and Technology","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Optical Instruments and Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.2610975","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Non-Line-of-Sight technology is to image objects that are hidden from the camera's view. It has a wide range of application prospects in robotic vision, national defense, remote sensing, medical imaging, and unmanned driving. Active Non-Lineof- Sight imaging mainly relies on time-resolved optical impulse responses. The Non-Line-of-Sight imaging system emits ultra-short light pulses to illuminate the diffuse reflection wall, and uses ultra-fast time-resolved single-photon detectors to collect multiple reflected photon information, thereby obtaining information in the hidden scene. Finally, various reconstruction algorithms are used to reconstruct the hidden scene. However, most of the existing reconstruction algorithms have the problems of slow reconstruction speed and fuzzy reconstruction results, especially in the aspect of human pose estimation. In this article, we describe a method of active Non-Line-of-Sight human pose estimation based on deep learning. In order to solve the problem of lack of deep learning data, we simulate large amounts of pseudo-transient images for the network, including various complex actions: walking, jumping, turning, bending back and forth, rotating, using the confocal Non-Line-of-Sight imaging model. And then we train the simulated transient images using light cones Transformation and U-net coding and decoding network structure. Finally, we examine the performance of our method on synthetic and experimental datasets. The prediction results show that our method can not only estimate the pose of real measured non-view human pose data, but also significantly improve the quality of reconstruction.

查看原文本刊更多论文

基于深度学习的主动非视线人体姿态估计

非视线技术是对隐藏在相机视野之外的物体进行成像。在机器人视觉、国防、遥感、医学成像、无人驾驶等领域具有广泛的应用前景。主动非瞄准线成像主要依赖于时间分辨光脉冲响应。非视距成像系统发射超短光脉冲照射漫反射壁，利用超快时间分辨单光子探测器采集多个反射光子信息，从而获取隐藏场景中的信息。最后，利用各种重构算法对隐藏场景进行重构。然而，现有的重建算法大多存在重建速度慢、重建结果模糊等问题，尤其是在人体姿态估计方面。在本文中，我们描述了一种基于深度学习的主动非视线人体姿态估计方法。为了解决深度学习数据缺乏的问题，我们为网络模拟了大量的伪瞬态图像，包括各种复杂动作:行走、跳跃、转弯、前后弯曲、旋转，使用共焦非视距成像模型。然后利用光锥变换和U-net编解码网络结构对模拟瞬态图像进行训练。最后，我们检验了我们的方法在合成数据集和实验数据集上的性能。预测结果表明，该方法不仅可以对真实测量的非视点人体姿态数据进行姿态估计，而且显著提高了重建质量。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International Conference on Optical Instruments and Technology

自引率

0.00%

发文量