Mingkui Zheng;Lin Luo;Haifeng Zheng;Zhangfan Ye;Zhe Su
{"title":"一种用于自监督单目深度估计的双编码器-解码器网络","authors":"Mingkui Zheng;Lin Luo;Haifeng Zheng;Zhangfan Ye;Zhe Su","doi":"10.1109/JSEN.2023.3296497","DOIUrl":null,"url":null,"abstract":"Depth estimation from a single image is a fundamental problem in the field of computer vision. With the great success of deep learning techniques, various self-supervised monocular depth estimation methods using encoder–decoder architectures have emerged. However, most previous approaches regress the depth map directly using a single encoder–decoder structure, which may not obtain sufficient features in the image and results in a depth map with low accuracy and blurred details. To improve the accuracy of self-supervised monocular depth estimation, we propose a simple but very effective scheme for depth estimation using a dual encoder–decoder structure network. Specifically, we introduce a novel global feature extraction network (GFN) to extract global features from images. GFN includes PoolAttentionFormer and ResBlock, which work together to extract and fuse hierarchical global features into the depth estimation network (DEN). To further improve the accuracy, we design two feature fusion mechanisms, including global feature fusion and multiscale fusion. The experimental results of various dual encoder–decoder combination schemes tested on the KITTI dataset show that our proposed one is effective in improving the accuracy of self-supervised monocular depth estimation, which reached 89.6% (\n<inline-formula> <tex-math>$\\delta < {1.25}$ </tex-math></inline-formula>\n).","PeriodicalId":4,"journal":{"name":"ACS Applied Energy Materials","volume":null,"pages":null},"PeriodicalIF":5.4000,"publicationDate":"2023-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Dual Encoder–Decoder Network for Self-Supervised Monocular Depth Estimation\",\"authors\":\"Mingkui Zheng;Lin Luo;Haifeng Zheng;Zhangfan Ye;Zhe Su\",\"doi\":\"10.1109/JSEN.2023.3296497\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Depth estimation from a single image is a fundamental problem in the field of computer vision. With the great success of deep learning techniques, various self-supervised monocular depth estimation methods using encoder–decoder architectures have emerged. However, most previous approaches regress the depth map directly using a single encoder–decoder structure, which may not obtain sufficient features in the image and results in a depth map with low accuracy and blurred details. To improve the accuracy of self-supervised monocular depth estimation, we propose a simple but very effective scheme for depth estimation using a dual encoder–decoder structure network. Specifically, we introduce a novel global feature extraction network (GFN) to extract global features from images. GFN includes PoolAttentionFormer and ResBlock, which work together to extract and fuse hierarchical global features into the depth estimation network (DEN). To further improve the accuracy, we design two feature fusion mechanisms, including global feature fusion and multiscale fusion. The experimental results of various dual encoder–decoder combination schemes tested on the KITTI dataset show that our proposed one is effective in improving the accuracy of self-supervised monocular depth estimation, which reached 89.6% (\\n<inline-formula> <tex-math>$\\\\delta < {1.25}$ </tex-math></inline-formula>\\n).\",\"PeriodicalId\":4,\"journal\":{\"name\":\"ACS Applied Energy Materials\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":5.4000,\"publicationDate\":\"2023-07-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ACS Applied Energy Materials\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10190311/\",\"RegionNum\":3,\"RegionCategory\":\"材料科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"CHEMISTRY, PHYSICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Applied Energy Materials","FirstCategoryId":"103","ListUrlMain":"https://ieeexplore.ieee.org/document/10190311/","RegionNum":3,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, PHYSICAL","Score":null,"Total":0}
A Dual Encoder–Decoder Network for Self-Supervised Monocular Depth Estimation
Depth estimation from a single image is a fundamental problem in the field of computer vision. With the great success of deep learning techniques, various self-supervised monocular depth estimation methods using encoder–decoder architectures have emerged. However, most previous approaches regress the depth map directly using a single encoder–decoder structure, which may not obtain sufficient features in the image and results in a depth map with low accuracy and blurred details. To improve the accuracy of self-supervised monocular depth estimation, we propose a simple but very effective scheme for depth estimation using a dual encoder–decoder structure network. Specifically, we introduce a novel global feature extraction network (GFN) to extract global features from images. GFN includes PoolAttentionFormer and ResBlock, which work together to extract and fuse hierarchical global features into the depth estimation network (DEN). To further improve the accuracy, we design two feature fusion mechanisms, including global feature fusion and multiscale fusion. The experimental results of various dual encoder–decoder combination schemes tested on the KITTI dataset show that our proposed one is effective in improving the accuracy of self-supervised monocular depth estimation, which reached 89.6% (
$\delta < {1.25}$
).
期刊介绍:
ACS Applied Energy Materials is an interdisciplinary journal publishing original research covering all aspects of materials, engineering, chemistry, physics and biology relevant to energy conversion and storage. The journal is devoted to reports of new and original experimental and theoretical research of an applied nature that integrate knowledge in the areas of materials, engineering, physics, bioscience, and chemistry into important energy applications.