{"title":"基于迁移学习Unet模型的单眼图像深度估计","authors":"Suchitra A. Patil, Chandrakant J. Gaikwad","doi":"10.1109/IBSSC56953.2022.10037503","DOIUrl":null,"url":null,"abstract":"The problem of depth estimation using monocular images is very challenging contrasted to methods for estimating depth that involve several pictures, like stereo depth perception. Previous studies in this field have usually focused on utilizing geometrical priors or relied on other data collection techniques. Various machine learning methods, notably deep convolutional neural networks (CNN) integrated with artificial intelligence (AI) approaches, have recently produced new marks for a variety of visual applications. In this paper, a convolution neural network is used for estimating high-resolution depth image. We have used a pretrained model DenseNet-169 which is trained on ImageNet. The encoder-decoder model used is a simple Unet model, used in biomedical image analysis. The proposed model is more accurate and efficient with reduced complexity in terms of the fewer parameters used for training. This model is also noteworthy when comparing a state of art and qualitatively it performs well and captures better edges and corners of the depth map, which is the most important factor in the depth estimation.","PeriodicalId":426897,"journal":{"name":"2022 IEEE Bombay Section Signature Conference (IBSSC)","volume":"220 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Depth Estimation of Monocular Images using Transfer Learning based Unet Model\",\"authors\":\"Suchitra A. Patil, Chandrakant J. Gaikwad\",\"doi\":\"10.1109/IBSSC56953.2022.10037503\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The problem of depth estimation using monocular images is very challenging contrasted to methods for estimating depth that involve several pictures, like stereo depth perception. Previous studies in this field have usually focused on utilizing geometrical priors or relied on other data collection techniques. Various machine learning methods, notably deep convolutional neural networks (CNN) integrated with artificial intelligence (AI) approaches, have recently produced new marks for a variety of visual applications. In this paper, a convolution neural network is used for estimating high-resolution depth image. We have used a pretrained model DenseNet-169 which is trained on ImageNet. The encoder-decoder model used is a simple Unet model, used in biomedical image analysis. The proposed model is more accurate and efficient with reduced complexity in terms of the fewer parameters used for training. This model is also noteworthy when comparing a state of art and qualitatively it performs well and captures better edges and corners of the depth map, which is the most important factor in the depth estimation.\",\"PeriodicalId\":426897,\"journal\":{\"name\":\"2022 IEEE Bombay Section Signature Conference (IBSSC)\",\"volume\":\"220 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE Bombay Section Signature Conference (IBSSC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IBSSC56953.2022.10037503\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE Bombay Section Signature Conference (IBSSC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IBSSC56953.2022.10037503","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Depth Estimation of Monocular Images using Transfer Learning based Unet Model
The problem of depth estimation using monocular images is very challenging contrasted to methods for estimating depth that involve several pictures, like stereo depth perception. Previous studies in this field have usually focused on utilizing geometrical priors or relied on other data collection techniques. Various machine learning methods, notably deep convolutional neural networks (CNN) integrated with artificial intelligence (AI) approaches, have recently produced new marks for a variety of visual applications. In this paper, a convolution neural network is used for estimating high-resolution depth image. We have used a pretrained model DenseNet-169 which is trained on ImageNet. The encoder-decoder model used is a simple Unet model, used in biomedical image analysis. The proposed model is more accurate and efficient with reduced complexity in terms of the fewer parameters used for training. This model is also noteworthy when comparing a state of art and qualitatively it performs well and captures better edges and corners of the depth map, which is the most important factor in the depth estimation.