3D Body and Background Reconstruction in a Large-scale Indoor Scene using Multiple Depth Cameras

2019 12th Asia Pacific Workshop on Mixed and Augmented Reality (APMAR) Pub Date : 2019-05-07 DOI:10.1109/APMAR.2019.8709280

Daisuke Kobayashi, D. Thomas, Hideaki Uchiyama, R. Taniguchi

{"title":"3D Body and Background Reconstruction in a Large-scale Indoor Scene using Multiple Depth Cameras","authors":"Daisuke Kobayashi, D. Thomas, Hideaki Uchiyama, R. Taniguchi","doi":"10.1109/APMAR.2019.8709280","DOIUrl":null,"url":null,"abstract":"3D reconstruction of indoor scenes that contain non-rigidly moving human body using depth cameras is a task of extraordinary difficulty. Despite intensive efforts from the researchers in the 3D vision community, existing methods are still limited to reconstruct small scale scenes. This is because of the difficulty to track the camera motion when a target person moves in a totally different direction. Due to the narrow field of view (FoV) of consumer-grade red-green-blue-depth (RGB-D) cameras, a target person (generally put at about 2–3 meters from the camera) covers most of the FoV of the camera. Therefore, there are not enough features from the static background to track the motion of the camera. In this paper, we propose a system which reconstructs a moving human body and the background of an indoor scene using multiple depth cameras. Our system is composed of three Kinects that are approximately set in the same line and facing the same direction so that their FoV do not overlap (to avoid interference). Owing to this setup, we capture images of a person moving in a large scale indoor scene. The three Kinect cameras are calibrated with a robust method that uses three large non parallel planes. A moving person is detected by using human skeleton information, and is reconstructed separately from the static background. By separating the human body and the background, static 3D reconstruction can be adopted for the static background area while a method specialized for the human body area can be used to reconstruct the 3D model of the moving person. The experimental result shows the performance of proposed system for human body in a large-scale indoor scene.","PeriodicalId":156273,"journal":{"name":"2019 12th Asia Pacific Workshop on Mixed and Augmented Reality (APMAR)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 12th Asia Pacific Workshop on Mixed and Augmented Reality (APMAR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/APMAR.2019.8709280","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

3D reconstruction of indoor scenes that contain non-rigidly moving human body using depth cameras is a task of extraordinary difficulty. Despite intensive efforts from the researchers in the 3D vision community, existing methods are still limited to reconstruct small scale scenes. This is because of the difficulty to track the camera motion when a target person moves in a totally different direction. Due to the narrow field of view (FoV) of consumer-grade red-green-blue-depth (RGB-D) cameras, a target person (generally put at about 2–3 meters from the camera) covers most of the FoV of the camera. Therefore, there are not enough features from the static background to track the motion of the camera. In this paper, we propose a system which reconstructs a moving human body and the background of an indoor scene using multiple depth cameras. Our system is composed of three Kinects that are approximately set in the same line and facing the same direction so that their FoV do not overlap (to avoid interference). Owing to this setup, we capture images of a person moving in a large scale indoor scene. The three Kinect cameras are calibrated with a robust method that uses three large non parallel planes. A moving person is detected by using human skeleton information, and is reconstructed separately from the static background. By separating the human body and the background, static 3D reconstruction can be adopted for the static background area while a method specialized for the human body area can be used to reconstruct the 3D model of the moving person. The experimental result shows the performance of proposed system for human body in a large-scale indoor scene.

查看原文本刊更多论文

基于多深度相机的大规模室内场景三维体与背景重建

利用深度相机对包含非刚性运动人体的室内场景进行三维重建是一项非常困难的任务。尽管3D视觉界的研究人员付出了巨大的努力，但现有的方法仍然局限于小尺度场景的重建。这是因为当目标人物移动到完全不同的方向时，很难跟踪摄像机的运动。由于消费级红绿蓝深(RGB-D)相机的视场(FoV)狭窄，目标人物(通常放置在距相机约2-3米的位置)占据了相机的大部分视场。因此，静态背景中没有足够的特征来跟踪摄像机的运动。在本文中，我们提出了一种使用多个深度相机重建运动人体和室内场景背景的系统。我们的系统由三个kinect组成，它们大致设置在同一条线上，面向同一方向，因此它们的FoV不会重叠(以避免干扰)。由于这种设置，我们可以捕捉到一个人在大规模室内场景中移动的图像。这三个Kinect摄像头采用了一种稳健的方法进行校准，该方法使用了三个大的非平行平面。利用人体骨架信息检测运动的人，并与静态背景分离重建运动的人。将人体与背景分离，对静态背景区域采用静态三维重建，对运动人的三维模型采用针对人体区域的方法进行重建。实验结果表明了该系统在大规模室内场景下对人体的检测效果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2019 12th Asia Pacific Workshop on Mixed and Augmented Reality (APMAR)

自引率

0.00%

发文量