Depth Estimation of Monocular Images using Transfer Learning based Unet Model

2022 IEEE Bombay Section Signature Conference (IBSSC) Pub Date : 2022-12-08 DOI:10.1109/IBSSC56953.2022.10037503

Suchitra A. Patil, Chandrakant J. Gaikwad

{"title":"Depth Estimation of Monocular Images using Transfer Learning based Unet Model","authors":"Suchitra A. Patil, Chandrakant J. Gaikwad","doi":"10.1109/IBSSC56953.2022.10037503","DOIUrl":null,"url":null,"abstract":"The problem of depth estimation using monocular images is very challenging contrasted to methods for estimating depth that involve several pictures, like stereo depth perception. Previous studies in this field have usually focused on utilizing geometrical priors or relied on other data collection techniques. Various machine learning methods, notably deep convolutional neural networks (CNN) integrated with artificial intelligence (AI) approaches, have recently produced new marks for a variety of visual applications. In this paper, a convolution neural network is used for estimating high-resolution depth image. We have used a pretrained model DenseNet-169 which is trained on ImageNet. The encoder-decoder model used is a simple Unet model, used in biomedical image analysis. The proposed model is more accurate and efficient with reduced complexity in terms of the fewer parameters used for training. This model is also noteworthy when comparing a state of art and qualitatively it performs well and captures better edges and corners of the depth map, which is the most important factor in the depth estimation.","PeriodicalId":426897,"journal":{"name":"2022 IEEE Bombay Section Signature Conference (IBSSC)","volume":"220 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE Bombay Section Signature Conference (IBSSC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IBSSC56953.2022.10037503","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

The problem of depth estimation using monocular images is very challenging contrasted to methods for estimating depth that involve several pictures, like stereo depth perception. Previous studies in this field have usually focused on utilizing geometrical priors or relied on other data collection techniques. Various machine learning methods, notably deep convolutional neural networks (CNN) integrated with artificial intelligence (AI) approaches, have recently produced new marks for a variety of visual applications. In this paper, a convolution neural network is used for estimating high-resolution depth image. We have used a pretrained model DenseNet-169 which is trained on ImageNet. The encoder-decoder model used is a simple Unet model, used in biomedical image analysis. The proposed model is more accurate and efficient with reduced complexity in terms of the fewer parameters used for training. This model is also noteworthy when comparing a state of art and qualitatively it performs well and captures better edges and corners of the depth map, which is the most important factor in the depth estimation.

查看原文本刊更多论文

基于迁移学习Unet模型的单眼图像深度估计

使用单眼图像的深度估计问题与涉及多幅图像的深度估计方法(如立体深度感知)相比非常具有挑战性。该领域以前的研究通常侧重于利用几何先验或依赖于其他数据收集技术。各种机器学习方法，特别是与人工智能(AI)方法相结合的深度卷积神经网络(CNN)，最近为各种视觉应用创造了新的标志。本文将卷积神经网络用于高分辨率深度图像的估计。我们使用了在ImageNet上训练的预训练模型DenseNet-169。所使用的编解码器模型是一个简单的Unet模型，用于生物医学图像分析。该模型在训练参数较少的情况下，具有更高的准确性和效率，降低了复杂度。该模型在比较当前技术和定性时也值得注意，它表现良好，并捕获了深度图中更好的边缘和角落，这是深度估计中最重要的因素。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 IEEE Bombay Section Signature Conference (IBSSC)

自引率

0.00%

发文量