增强现实单图像三维车辆姿态估计

2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP) Pub Date : 2019-11-01 DOI:10.1109/GlobalSIP45357.2019.8969201

Yawen Lu, Sophia Kourian, C. Salvaggio, Chenliang Xu, G. Lu

{"title":"增强现实单图像三维车辆姿态估计","authors":"Yawen Lu, Sophia Kourian, C. Salvaggio, Chenliang Xu, G. Lu","doi":"10.1109/GlobalSIP45357.2019.8969201","DOIUrl":null,"url":null,"abstract":"The intent of this paper is to introduce a novel method for 3D vehicle pose estimation, a critical component of augmented reality. The proposed method is able to recover the location of a specific object from a single image by combining pretrained reliable semantic segmentation and improved single image depth estimation. Our method exploits a novel pose estimation technique by generating new 2D images created from the projections of rotated point clouds. The rotation of the specific object is able to be predicted. Augmented objects can be shifted, rotated and scaled correctly based on the recovered orientation and localization. Through accurate vehicle pose estimation, virtual vehicles are able to be augmented accurately in place of real vehicles. The effectiveness of our method is verified by comparison with other recent pose estimation methods on the challenging KITTI 3D benchmark. Further experiments on the Cityscapes dataset also demonstrates good robustness in the method. Without requiring ground truth 3D vehicle pose labels for training, our model is able to produce competitive and robust performance in 3D vehicle pose estimation.","PeriodicalId":221378,"journal":{"name":"2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Single Image 3D Vehicle Pose Estimation for Augmented Reality\",\"authors\":\"Yawen Lu, Sophia Kourian, C. Salvaggio, Chenliang Xu, G. Lu\",\"doi\":\"10.1109/GlobalSIP45357.2019.8969201\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The intent of this paper is to introduce a novel method for 3D vehicle pose estimation, a critical component of augmented reality. The proposed method is able to recover the location of a specific object from a single image by combining pretrained reliable semantic segmentation and improved single image depth estimation. Our method exploits a novel pose estimation technique by generating new 2D images created from the projections of rotated point clouds. The rotation of the specific object is able to be predicted. Augmented objects can be shifted, rotated and scaled correctly based on the recovered orientation and localization. Through accurate vehicle pose estimation, virtual vehicles are able to be augmented accurately in place of real vehicles. The effectiveness of our method is verified by comparison with other recent pose estimation methods on the challenging KITTI 3D benchmark. Further experiments on the Cityscapes dataset also demonstrates good robustness in the method. Without requiring ground truth 3D vehicle pose labels for training, our model is able to produce competitive and robust performance in 3D vehicle pose estimation.\",\"PeriodicalId\":221378,\"journal\":{\"name\":\"2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP)\",\"volume\":\"13 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/GlobalSIP45357.2019.8969201\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/GlobalSIP45357.2019.8969201","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

摘要

本文的目的是介绍一种新的三维车辆姿态估计方法，这是增强现实的一个关键组成部分。该方法结合了预训练的可靠语义分割和改进的单幅图像深度估计，能够从单幅图像中恢复特定目标的位置。我们的方法利用了一种新的姿态估计技术，通过旋转点云的投影生成新的2D图像。特定物体的旋转是可以预测的。增强对象可以根据恢复的方向和定位正确地移动、旋转和缩放。通过对车辆姿态的精确估计，虚拟车辆能够准确地代替真实车辆进行增强。在具有挑战性的KITTI 3D基准上，与其他最近的姿态估计方法进行了比较，验证了该方法的有效性。在城市景观数据集上的进一步实验也证明了该方法具有良好的鲁棒性。不需要地面真实3D车辆姿态标签进行训练，我们的模型能够在3D车辆姿态估计中产生具有竞争力和鲁棒性的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Single Image 3D Vehicle Pose Estimation for Augmented Reality

The intent of this paper is to introduce a novel method for 3D vehicle pose estimation, a critical component of augmented reality. The proposed method is able to recover the location of a specific object from a single image by combining pretrained reliable semantic segmentation and improved single image depth estimation. Our method exploits a novel pose estimation technique by generating new 2D images created from the projections of rotated point clouds. The rotation of the specific object is able to be predicted. Augmented objects can be shifted, rotated and scaled correctly based on the recovered orientation and localization. Through accurate vehicle pose estimation, virtual vehicles are able to be augmented accurately in place of real vehicles. The effectiveness of our method is verified by comparison with other recent pose estimation methods on the challenging KITTI 3D benchmark. Further experiments on the Cityscapes dataset also demonstrates good robustness in the method. Without requiring ground truth 3D vehicle pose labels for training, our model is able to produce competitive and robust performance in 3D vehicle pose estimation.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP)

自引率

0.00%

发文量