{"title":"Boosting Depth Estimation for Self-Driving in a Self-Supervised Framework via Improved Pose Network","authors":"Yazan Dayoub;Andrey V. Savchenko;Ilya Makarov","doi":"10.1109/OJCS.2024.3505876","DOIUrl":null,"url":null,"abstract":"Depth estimation is a critical component of self-driving vehicles, enabling accurate scene understanding, obstacle detection, and precise localization. Improving the performance of depth estimation networks without increasing computational cost is highly advantageous for autonomous driving systems. In this article, we propose to enhance depth estimation by improving the pose network in a self-supervised framework. Unlike conventional pose networks, our approach preserves more detailed spatial information by integrating multi-scale features and normalized coordinates. This improved spatial awareness allows for more accurate depth predictions. Comprehensive evaluations on the KITTI and Make3D datasets show that our method yields a 2-7% improvement in the absolute relative error (abs_rel) metric. Furthermore, on the KITTI odometry dataset, our approach demonstrates competitive performance, with relative translational error (\n<inline-formula><tex-math>$t_{rel}$</tex-math></inline-formula>\n) of \n<inline-formula><tex-math>$6.11$</tex-math></inline-formula>\n and \n<inline-formula><tex-math>$7.21$</tex-math></inline-formula>\n, and relative rotational error (\n<inline-formula><tex-math>$r_{rel}$</tex-math></inline-formula>\n) of \n<inline-formula><tex-math>$1.12$</tex-math></inline-formula>\n and \n<inline-formula><tex-math>$2.05$</tex-math></inline-formula>\n for sequences 9 and 10, respectively.","PeriodicalId":13205,"journal":{"name":"IEEE Open Journal of the Computer Society","volume":"6 ","pages":"109-118"},"PeriodicalIF":0.0000,"publicationDate":"2024-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10767273","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Open Journal of the Computer Society","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10767273/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Depth estimation is a critical component of self-driving vehicles, enabling accurate scene understanding, obstacle detection, and precise localization. Improving the performance of depth estimation networks without increasing computational cost is highly advantageous for autonomous driving systems. In this article, we propose to enhance depth estimation by improving the pose network in a self-supervised framework. Unlike conventional pose networks, our approach preserves more detailed spatial information by integrating multi-scale features and normalized coordinates. This improved spatial awareness allows for more accurate depth predictions. Comprehensive evaluations on the KITTI and Make3D datasets show that our method yields a 2-7% improvement in the absolute relative error (abs_rel) metric. Furthermore, on the KITTI odometry dataset, our approach demonstrates competitive performance, with relative translational error (
$t_{rel}$
) of
$6.11$
and
$7.21$
, and relative rotational error (
$r_{rel}$
) of
$1.12$
and
$2.05$
for sequences 9 and 10, respectively.