3D-C2FT: Coarse-to-fine Transformer for Multi-view 3D Reconstruction

Computer vision - ACCV ... : ... Asian Conference on Computer Vision : proceedings. Asian Conference on Computer Vision Pub Date : 2022-05-29 DOI:10.48550/arXiv.2205.14575

L. Tiong, Dick Sigmund, A. Teoh

引用次数: 3

Abstract

Recently, the transformer model has been successfully employed for the multi-view 3D reconstruction problem. However, challenges remain on designing an attention mechanism to explore the multiview features and exploit their relations for reinforcing the encoding-decoding modules. This paper proposes a new model, namely 3D coarse-to-fine transformer (3D-C2FT), by introducing a novel coarse-to-fine(C2F) attention mechanism for encoding multi-view features and rectifying defective 3D objects. C2F attention mechanism enables the model to learn multi-view information flow and synthesize 3D surface correction in a coarse to fine-grained manner. The proposed model is evaluated by ShapeNet and Multi-view Real-life datasets. Experimental results show that 3D-C2FT achieves notable results and outperforms several competing models on these datasets.

查看原文本刊更多论文

3D- c2ft:用于多视图三维重建的粗到精变压器

近年来，变压器模型已成功地用于多视图三维重建问题。然而，如何设计一种关注机制来探索多视图特征，并利用它们之间的关系来加强编解码模块，仍然是一个挑战。本文通过引入一种新的粗到精(C2F)注意机制，对多视图特征进行编码，并对有缺陷的三维物体进行校正，提出了一种新的模型3D粗到精变压器(3D- c2ft)。C2F注意机制使模型能够学习多视图信息流，并以粗粒度到细粒度的方式综合三维曲面校正。该模型通过ShapeNet和多视图真实数据集进行了评估。实验结果表明，3D-C2FT在这些数据集上取得了显著的效果，并且优于几种竞争模型。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Computer vision - ACCV ... : ... Asian Conference on Computer Vision : proceedings. Asian Conference on Computer Vision

自引率

0.00%

发文量