End-to-end Learning Improves Static Object Geo-localization from Video

2021 IEEE Winter Conference on Applications of Computer Vision (WACV) Pub Date : 2021-01-01 DOI:10.1109/WACV48630.2021.00211

Mohamed Chaabane, L. Gueguen, A. Trabelsi, J. Beveridge, Stephen O'Hara

引用次数: 3

Abstract

Accurately estimating the position of static objects, such as traffic lights, from the moving camera of a self-driving car is a challenging problem. In this work, we present a system that improves the localization of static objects by jointly-optimizing the components of the system via learning. Our system is comprised of networks that perform: 1) 5DoF object pose estimation from a single image, 2) association of objects between pairs of frames, and 3) multi-object tracking to produce the final geo-localization of the static objects within the scene. We evaluate our approach using a publicly-available data set, focusing on traffic lights due to data availability. For each component, we compare against contemporary alternatives and show significantly-improved performance. We also show that the end-to-end system performance is further improved via joint-training of the constituent models. Code is available at: https://github.com/MedChaabane/Static_Objects_Geolocalization.

查看原文本刊更多论文

端到端学习改进视频静态对象的地理定位

从自动驾驶汽车的移动摄像头中准确估计交通灯等静态物体的位置是一个具有挑战性的问题。在这项工作中，我们提出了一个系统，通过学习联合优化系统的组件来提高静态物体的定位。我们的系统由以下网络组成:1)从单张图像中估计物体的5DoF姿态，2)在对帧之间关联物体，以及3)多物体跟踪以产生场景中静态物体的最终地理定位。我们使用公开可用的数据集来评估我们的方法，由于数据的可用性，我们将重点放在交通灯上。对于每个组件，我们与当代替代品进行比较，并显示出显着改进的性能。我们还表明，通过对组成模型的联合训练，端到端系统性能得到了进一步提高。代码可从https://github.com/MedChaabane/Static_Objects_Geolocalization获得。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 IEEE Winter Conference on Applications of Computer Vision (WACV)

自引率

0.00%

发文量