{"title":"基于EKF方法的长期稳定单目三维多目标跟踪","authors":"A. Reich, Hans-Joachim Wünsche","doi":"10.23919/fusion49465.2021.9626850","DOIUrl":null,"url":null,"abstract":"Most monocular object tracking algorithms work in the 2D domain of the image. However, object trajectories, which are very simple in a fixed 3D world space, result in complex motions on the image plane, especially when the camera is moving. Therefore, in absence of any 3D representation, aforementioned approaches are only able to perform the measurement-to-track association based on rough similarity of 2D bounding box parameters. Recent advances in monocular 3D object detection allow to extract additional parameters like the pose and spatial extent of a 3D bounding box. In this paper, we present a multi-object tracking approach composed of an Extended Kalman filter estimating the 3D state by using these detections for track initialization. In subsequent time steps 2D bounding boxes are used to avoid filtering temporally correlated 3D measurements. This ensures properly estimated state uncertainties. We show that this 3D representation is very valuable as we achieve state-of-the-art results on the KITTI dataset with an association solely based on 2D bounding box comparison. We use state uncertainties transformed into the measurement space while completely ignoring appearance features.","PeriodicalId":226850,"journal":{"name":"2021 IEEE 24th International Conference on Information Fusion (FUSION)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"15","resultStr":"{\"title\":\"Monocular 3D Multi-Object Tracking with an EKF Approach for Long-Term Stable Tracks\",\"authors\":\"A. Reich, Hans-Joachim Wünsche\",\"doi\":\"10.23919/fusion49465.2021.9626850\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Most monocular object tracking algorithms work in the 2D domain of the image. However, object trajectories, which are very simple in a fixed 3D world space, result in complex motions on the image plane, especially when the camera is moving. Therefore, in absence of any 3D representation, aforementioned approaches are only able to perform the measurement-to-track association based on rough similarity of 2D bounding box parameters. Recent advances in monocular 3D object detection allow to extract additional parameters like the pose and spatial extent of a 3D bounding box. In this paper, we present a multi-object tracking approach composed of an Extended Kalman filter estimating the 3D state by using these detections for track initialization. In subsequent time steps 2D bounding boxes are used to avoid filtering temporally correlated 3D measurements. This ensures properly estimated state uncertainties. We show that this 3D representation is very valuable as we achieve state-of-the-art results on the KITTI dataset with an association solely based on 2D bounding box comparison. We use state uncertainties transformed into the measurement space while completely ignoring appearance features.\",\"PeriodicalId\":226850,\"journal\":{\"name\":\"2021 IEEE 24th International Conference on Information Fusion (FUSION)\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"15\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE 24th International Conference on Information Fusion (FUSION)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.23919/fusion49465.2021.9626850\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 24th International Conference on Information Fusion (FUSION)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/fusion49465.2021.9626850","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Monocular 3D Multi-Object Tracking with an EKF Approach for Long-Term Stable Tracks
Most monocular object tracking algorithms work in the 2D domain of the image. However, object trajectories, which are very simple in a fixed 3D world space, result in complex motions on the image plane, especially when the camera is moving. Therefore, in absence of any 3D representation, aforementioned approaches are only able to perform the measurement-to-track association based on rough similarity of 2D bounding box parameters. Recent advances in monocular 3D object detection allow to extract additional parameters like the pose and spatial extent of a 3D bounding box. In this paper, we present a multi-object tracking approach composed of an Extended Kalman filter estimating the 3D state by using these detections for track initialization. In subsequent time steps 2D bounding boxes are used to avoid filtering temporally correlated 3D measurements. This ensures properly estimated state uncertainties. We show that this 3D representation is very valuable as we achieve state-of-the-art results on the KITTI dataset with an association solely based on 2D bounding box comparison. We use state uncertainties transformed into the measurement space while completely ignoring appearance features.