Automatic Reconstruction of Building-Scale Indoor 3D Environment with a Deep-Reinforcement-Learning-Based Mobile Robot

International Journal of Robotics and Automation Technology Pub Date : 2021-11-29 DOI:10.31875/2409-9694.2019.06.2

Menglong Yang, K. Nagao

{"title":"Automatic Reconstruction of Building-Scale Indoor 3D Environment with a Deep-Reinforcement-Learning-Based Mobile Robot","authors":"Menglong Yang, K. Nagao","doi":"10.31875/2409-9694.2019.06.2","DOIUrl":null,"url":null,"abstract":"The aim of this paper is to digitize the environments in which humans live, at low cost, and reconstruct highly accurate three-dimensional environments that are based on those in the real world. This three-dimensional content can be used such as for virtual reality environments and three-dimensional maps for automatic driving systems. In general, however, a three-dimensional environment must be carefully reconstructed by manually moving the sensors used to first scan the real environment on which the three-dimensional one is based. This is done so that every corner of an entire area can be measured, but time and costs increase as the area expands. Therefore, a system that creates three-dimensional content that is based on real-world large-scale buildings at low cost is proposed. This involves automatically scanning the indoors with a mobile robot that uses low-cost sensors and generating 3D point clouds. When the robot reaches an appropriate measurement position, it collects the three-dimensional data of shapes observable from that position by using a 3D sensor and 360-degree panoramic camera. The problem of determining an appropriate measurement position is called the “next best view problem,” and it is difficult to solve in a complicated indoor environment. To deal with this problem, a deep reinforcement learning method is employed. It combines reinforcement learning, with which an autonomous agent learns strategies for selecting behavior, and deep learning done using a neural network. As a result, 3D point cloud data can be generated with better quality than the conventional rule-based approach.","PeriodicalId":234563,"journal":{"name":"International Journal of Robotics and Automation Technology","volume":"37 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Robotics and Automation Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.31875/2409-9694.2019.06.2","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

Abstract

The aim of this paper is to digitize the environments in which humans live, at low cost, and reconstruct highly accurate three-dimensional environments that are based on those in the real world. This three-dimensional content can be used such as for virtual reality environments and three-dimensional maps for automatic driving systems. In general, however, a three-dimensional environment must be carefully reconstructed by manually moving the sensors used to first scan the real environment on which the three-dimensional one is based. This is done so that every corner of an entire area can be measured, but time and costs increase as the area expands. Therefore, a system that creates three-dimensional content that is based on real-world large-scale buildings at low cost is proposed. This involves automatically scanning the indoors with a mobile robot that uses low-cost sensors and generating 3D point clouds. When the robot reaches an appropriate measurement position, it collects the three-dimensional data of shapes observable from that position by using a 3D sensor and 360-degree panoramic camera. The problem of determining an appropriate measurement position is called the “next best view problem,” and it is difficult to solve in a complicated indoor environment. To deal with this problem, a deep reinforcement learning method is employed. It combines reinforcement learning, with which an autonomous agent learns strategies for selecting behavior, and deep learning done using a neural network. As a result, 3D point cloud data can be generated with better quality than the conventional rule-based approach.

查看原文本刊更多论文

基于深度强化学习的移动机器人自动重建建筑尺度室内三维环境

本文的目的是将人类生活的环境数字化，以低成本重建基于现实世界的高精度三维环境。这种三维内容可以用于虚拟现实环境和自动驾驶系统的三维地图等。然而，一般来说，三维环境必须通过手动移动用于首先扫描三维环境所基于的真实环境的传感器来仔细重建。这样做是为了测量整个区域的每个角落，但时间和成本随着面积的扩大而增加。因此，提出了一种基于现实世界大型建筑以低成本创建三维内容的系统。这包括用一个使用低成本传感器的移动机器人自动扫描室内，并产生3D点云。当机器人到达适当的测量位置时，它通过3D传感器和360度全景相机收集该位置可观察到的形状的三维数据。确定合适的测量位置的问题被称为“次优视角问题”，在复杂的室内环境中很难解决。为了解决这个问题，采用了一种深度强化学习方法。它结合了强化学习和使用神经网络完成的深度学习。强化学习是指自主代理学习选择行为的策略。因此，与传统的基于规则的方法相比，3D点云数据的生成质量更好。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International Journal of Robotics and Automation Technology

自引率

0.00%

发文量