{"title":"基于RANSAC离群值去除的自监督单目深度估计尺度恢复","authors":"Zhuoyue Wu, G. Zhuo, Feng Xue","doi":"10.1109/CVCI51460.2020.9338538","DOIUrl":null,"url":null,"abstract":"Recently, self-supervised method has become an increasingly significant branch of depth estimation task, especially in the realm of autonomous driving applications. However, current per-pixel depth maps yielded from RGB images still suffer from uncertain scale factor generated by the nature of monocular image sequences, which further leads to the insufficiency in practical use. In this work, we first analyze such scale uncertainty both theoretically and practically. Then we perform scale recovery utilizing geometric constraint to estimate accurate scale factor, RANSAC(Random sample consensus) outlier removal is introduced into pipeline to obtain accurate ground point extraction. Adequate experiments on KITTI dataset(dataset generated by an autonomous driving platform built up by KIT and TRINA comprising stereo and optical flow image pairs as well as laser data, distributed to train set and test set on account of deep learning), show that, using only camera height prior, our proposed method, though not relying on additional sensors, is able to achieve accurate scale recovery and outperform existing scale recovery methods.","PeriodicalId":119721,"journal":{"name":"2020 4th CAA International Conference on Vehicular Control and Intelligence (CVCI)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2020-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Self-Supervised Monocular Depth Estimation Scale Recovery using RANSAC Outlier Removal\",\"authors\":\"Zhuoyue Wu, G. Zhuo, Feng Xue\",\"doi\":\"10.1109/CVCI51460.2020.9338538\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Recently, self-supervised method has become an increasingly significant branch of depth estimation task, especially in the realm of autonomous driving applications. However, current per-pixel depth maps yielded from RGB images still suffer from uncertain scale factor generated by the nature of monocular image sequences, which further leads to the insufficiency in practical use. In this work, we first analyze such scale uncertainty both theoretically and practically. Then we perform scale recovery utilizing geometric constraint to estimate accurate scale factor, RANSAC(Random sample consensus) outlier removal is introduced into pipeline to obtain accurate ground point extraction. Adequate experiments on KITTI dataset(dataset generated by an autonomous driving platform built up by KIT and TRINA comprising stereo and optical flow image pairs as well as laser data, distributed to train set and test set on account of deep learning), show that, using only camera height prior, our proposed method, though not relying on additional sensors, is able to achieve accurate scale recovery and outperform existing scale recovery methods.\",\"PeriodicalId\":119721,\"journal\":{\"name\":\"2020 4th CAA International Conference on Vehicular Control and Intelligence (CVCI)\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-12-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 4th CAA International Conference on Vehicular Control and Intelligence (CVCI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CVCI51460.2020.9338538\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 4th CAA International Conference on Vehicular Control and Intelligence (CVCI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CVCI51460.2020.9338538","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Self-Supervised Monocular Depth Estimation Scale Recovery using RANSAC Outlier Removal
Recently, self-supervised method has become an increasingly significant branch of depth estimation task, especially in the realm of autonomous driving applications. However, current per-pixel depth maps yielded from RGB images still suffer from uncertain scale factor generated by the nature of monocular image sequences, which further leads to the insufficiency in practical use. In this work, we first analyze such scale uncertainty both theoretically and practically. Then we perform scale recovery utilizing geometric constraint to estimate accurate scale factor, RANSAC(Random sample consensus) outlier removal is introduced into pipeline to obtain accurate ground point extraction. Adequate experiments on KITTI dataset(dataset generated by an autonomous driving platform built up by KIT and TRINA comprising stereo and optical flow image pairs as well as laser data, distributed to train set and test set on account of deep learning), show that, using only camera height prior, our proposed method, though not relying on additional sensors, is able to achieve accurate scale recovery and outperform existing scale recovery methods.