Multi-view stereo with recurrent neural networks for spatio-temporal consistent depth maps

2023 International Conference on Electronics, Information, and Communication (ICEIC) Pub Date : 2023-02-05 DOI:10.1109/ICEIC57457.2023.10049937

Hosung Son, Suk-ju Kang

引用次数: 0

Abstract

Depth estimation methods based on deep learning have been studied to improve depth estimation accuracy. However, obtaining inter-frame consistency in depth maps in video depth estimation remains a challenge. Therefore, we proposed an application methodology for spatio-temporal consistency enhancement in video depth estimation based on convolutional neural networks (CNNs) and recurrent neural networks (RNNs). In other words, the convolutional long-short term memory (ConvLSTM) module was added to the decoder of depth estimation network to enable the use of the information from the previous frames. Additionally, the one-stage learning process was implemented to ensure ease of training. In conclusion, we experimentally show that the proposed method can achieve not only improved accuracy also consistency between depth map frames.

查看原文本刊更多论文

基于递归神经网络的时空一致深度图多视点立体

为了提高深度估计的精度，研究了基于深度学习的深度估计方法。然而，在视频深度估计中，如何获得深度图的帧间一致性仍然是一个难题。因此，我们提出了一种基于卷积神经网络(cnn)和递归神经网络(rnn)的视频深度估计时空一致性增强应用方法。换句话说，在深度估计网络的解码器中加入卷积长短期记忆(ConvLSTM)模块，使前一帧的信息能够被使用。此外，还实施了一阶段学习过程，以确保培训的便利性。实验结果表明，该方法不仅提高了深度图的精度，而且提高了深度图帧间的一致性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2023 International Conference on Electronics, Information, and Communication (ICEIC)

自引率

0.00%

发文量