Mingkui Zheng;Lin Luo;Haifeng Zheng;Zhangfan Ye;Zhe Su
{"title":"A Dual Encoder–Decoder Network for Self-Supervised Monocular Depth Estimation","authors":"Mingkui Zheng;Lin Luo;Haifeng Zheng;Zhangfan Ye;Zhe Su","doi":"10.1109/JSEN.2023.3296497","DOIUrl":null,"url":null,"abstract":"Depth estimation from a single image is a fundamental problem in the field of computer vision. With the great success of deep learning techniques, various self-supervised monocular depth estimation methods using encoder–decoder architectures have emerged. However, most previous approaches regress the depth map directly using a single encoder–decoder structure, which may not obtain sufficient features in the image and results in a depth map with low accuracy and blurred details. To improve the accuracy of self-supervised monocular depth estimation, we propose a simple but very effective scheme for depth estimation using a dual encoder–decoder structure network. Specifically, we introduce a novel global feature extraction network (GFN) to extract global features from images. GFN includes PoolAttentionFormer and ResBlock, which work together to extract and fuse hierarchical global features into the depth estimation network (DEN). To further improve the accuracy, we design two feature fusion mechanisms, including global feature fusion and multiscale fusion. The experimental results of various dual encoder–decoder combination schemes tested on the KITTI dataset show that our proposed one is effective in improving the accuracy of self-supervised monocular depth estimation, which reached 89.6% (\n<inline-formula> <tex-math>$\\delta < {1.25}$ </tex-math></inline-formula>\n).","PeriodicalId":447,"journal":{"name":"IEEE Sensors Journal","volume":"23 17","pages":"19747-19756"},"PeriodicalIF":4.3000,"publicationDate":"2023-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Sensors Journal","FirstCategoryId":"103","ListUrlMain":"https://ieeexplore.ieee.org/document/10190311/","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
Depth estimation from a single image is a fundamental problem in the field of computer vision. With the great success of deep learning techniques, various self-supervised monocular depth estimation methods using encoder–decoder architectures have emerged. However, most previous approaches regress the depth map directly using a single encoder–decoder structure, which may not obtain sufficient features in the image and results in a depth map with low accuracy and blurred details. To improve the accuracy of self-supervised monocular depth estimation, we propose a simple but very effective scheme for depth estimation using a dual encoder–decoder structure network. Specifically, we introduce a novel global feature extraction network (GFN) to extract global features from images. GFN includes PoolAttentionFormer and ResBlock, which work together to extract and fuse hierarchical global features into the depth estimation network (DEN). To further improve the accuracy, we design two feature fusion mechanisms, including global feature fusion and multiscale fusion. The experimental results of various dual encoder–decoder combination schemes tested on the KITTI dataset show that our proposed one is effective in improving the accuracy of self-supervised monocular depth estimation, which reached 89.6% (
$\delta < {1.25}$
).
期刊介绍:
The fields of interest of the IEEE Sensors Journal are the theory, design , fabrication, manufacturing and applications of devices for sensing and transducing physical, chemical and biological phenomena, with emphasis on the electronics and physics aspect of sensors and integrated sensors-actuators. IEEE Sensors Journal deals with the following:
-Sensor Phenomenology, Modelling, and Evaluation
-Sensor Materials, Processing, and Fabrication
-Chemical and Gas Sensors
-Microfluidics and Biosensors
-Optical Sensors
-Physical Sensors: Temperature, Mechanical, Magnetic, and others
-Acoustic and Ultrasonic Sensors
-Sensor Packaging
-Sensor Networks
-Sensor Applications
-Sensor Systems: Signals, Processing, and Interfaces
-Actuators and Sensor Power Systems
-Sensor Signal Processing for high precision and stability (amplification, filtering, linearization, modulation/demodulation) and under harsh conditions (EMC, radiation, humidity, temperature); energy consumption/harvesting
-Sensor Data Processing (soft computing with sensor data, e.g., pattern recognition, machine learning, evolutionary computation; sensor data fusion, processing of wave e.g., electromagnetic and acoustic; and non-wave, e.g., chemical, gravity, particle, thermal, radiative and non-radiative sensor data, detection, estimation and classification based on sensor data)
-Sensors in Industrial Practice