BDLoc: Global Localization from 2.5D Building Map

2021 IEEE International Symposium on Mixed and Augmented Reality (ISMAR) Pub Date : 2021-10-01 DOI:10.1109/ismar52148.2021.00022

Hai Li, Tianxing Fan, Hongjia Zhai, Zhaopeng Cui, H. Bao, Guofeng Zhang

{"title":"BDLoc: Global Localization from 2.5D Building Map","authors":"Hai Li, Tianxing Fan, Hongjia Zhai, Zhaopeng Cui, H. Bao, Guofeng Zhang","doi":"10.1109/ismar52148.2021.00022","DOIUrl":null,"url":null,"abstract":"Robust and accurate global 6DoF localization is essential for many applications, i.e., augmented reality and autonomous driving. Most existing 6DoF visual localization approaches need to build a dense texture model in advance, which is computationally extensive and almost infeasible in the global range. In this work, we propose BDLoc, a hierarchical global localization framework via the 2.5D building map, which is able to estimate the accurate pose of the query street-view image without using detailed dense 3D model and texture information. Specifically speaking, we first extract the 3D building information from the street-view image and surrounding 2.5D building map, and then solve a coarse relative pose by local to global registration. In order to improve the feature extraction, we propose a novel SPG-Net which is able to capture both local and global features. Finally, an iterative semantic alignment is applied to obtain a finner result with the differentiable rendering and the cross-view semantic constraint. Except for a coarse longitude and latitude from GPS, BDLoc doesn’t need any additional information like altitude and orientation that are necessary for many previous works. We also create a large dataset to explore the performance of the 2.5D map-based localization task. Extensive experiments demonstrate the superior performance of our method.","PeriodicalId":395413,"journal":{"name":"2021 IEEE International Symposium on Mixed and Augmented Reality (ISMAR)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Symposium on Mixed and Augmented Reality (ISMAR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ismar52148.2021.00022","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

Robust and accurate global 6DoF localization is essential for many applications, i.e., augmented reality and autonomous driving. Most existing 6DoF visual localization approaches need to build a dense texture model in advance, which is computationally extensive and almost infeasible in the global range. In this work, we propose BDLoc, a hierarchical global localization framework via the 2.5D building map, which is able to estimate the accurate pose of the query street-view image without using detailed dense 3D model and texture information. Specifically speaking, we first extract the 3D building information from the street-view image and surrounding 2.5D building map, and then solve a coarse relative pose by local to global registration. In order to improve the feature extraction, we propose a novel SPG-Net which is able to capture both local and global features. Finally, an iterative semantic alignment is applied to obtain a finner result with the differentiable rendering and the cross-view semantic constraint. Except for a coarse longitude and latitude from GPS, BDLoc doesn’t need any additional information like altitude and orientation that are necessary for many previous works. We also create a large dataset to explore the performance of the 2.5D map-based localization task. Extensive experiments demonstrate the superior performance of our method.

查看原文本刊更多论文

BDLoc:基于2.5D建筑地图的全球定位

强大而准确的全球6DoF定位对于许多应用至关重要，例如增强现实和自动驾驶。现有的六自由度视觉定位方法大多需要预先建立密集的纹理模型，计算量大，在全局范围内几乎不可行。在这项工作中，我们提出了一种基于2.5D建筑地图的分层全局定位框架BDLoc，该框架能够在不使用详细的密集3D模型和纹理信息的情况下估计查询街景图像的准确姿态。具体来说，我们首先从街景图像和周围的2.5D建筑地图中提取3D建筑信息，然后通过局部到全局的配准来求解粗糙的相对位姿。为了改进特征提取，我们提出了一种能够同时捕获局部和全局特征的新型SPG-Net。最后，利用可微渲染和跨视图语义约束进行迭代语义对齐，得到更精细的结果。除了GPS提供的粗略经纬度外，BDLoc不需要任何额外的信息，如高度和方向，这是许多以前的工作所必需的。我们还创建了一个大型数据集来探索基于2.5D地图的定位任务的性能。大量的实验证明了该方法的优越性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 IEEE International Symposium on Mixed and Augmented Reality (ISMAR)

自引率

0.00%

发文量