基于自关注的低分辨率多视点立体网络

2023 3rd International Conference on Consumer Electronics and Computer Engineering (ICCECE) Pub Date : 2023-01-06 DOI:10.1109/ICCECE58074.2023.10135325

Weijuan Li, R. Jia

{"title":"基于自关注的低分辨率多视点立体网络","authors":"Weijuan Li, R. Jia","doi":"10.1109/ICCECE58074.2023.10135325","DOIUrl":null,"url":null,"abstract":"We present SA-MVSNet, a novel two-stage multi-view stereo network equipped with self-attention mechanism, which can improve the quality of low-resolution image 3D reconstruction. SA-MVSNet consists of two stages, and the lower resolution depth maps predicted in the first stage provide a priori information for the second stage. To increase the utilization of image information, a pyramid scheme was used to fuse the feature maps at different resolutions. Moreover, we introduce an improved self-attention module in the first stage to improve reconstruction accuracy by learning the long-term dependence information of feature maps. The experiments on the DTU dataset show a promising result in both completeness and accuracy metrics of the 3D scene reconstructed by the proposed method.","PeriodicalId":120030,"journal":{"name":"2023 3rd International Conference on Consumer Electronics and Computer Engineering (ICCECE)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Self-Attention based Network for Low Resolution Multi-View Stereo\",\"authors\":\"Weijuan Li, R. Jia\",\"doi\":\"10.1109/ICCECE58074.2023.10135325\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We present SA-MVSNet, a novel two-stage multi-view stereo network equipped with self-attention mechanism, which can improve the quality of low-resolution image 3D reconstruction. SA-MVSNet consists of two stages, and the lower resolution depth maps predicted in the first stage provide a priori information for the second stage. To increase the utilization of image information, a pyramid scheme was used to fuse the feature maps at different resolutions. Moreover, we introduce an improved self-attention module in the first stage to improve reconstruction accuracy by learning the long-term dependence information of feature maps. The experiments on the DTU dataset show a promising result in both completeness and accuracy metrics of the 3D scene reconstructed by the proposed method.\",\"PeriodicalId\":120030,\"journal\":{\"name\":\"2023 3rd International Conference on Consumer Electronics and Computer Engineering (ICCECE)\",\"volume\":\"34 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-01-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 3rd International Conference on Consumer Electronics and Computer Engineering (ICCECE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCECE58074.2023.10135325\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 3rd International Conference on Consumer Electronics and Computer Engineering (ICCECE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCECE58074.2023.10135325","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

本文提出了一种具有自关注机制的两阶段多视点立体网络SA-MVSNet，可以提高低分辨率图像的三维重建质量。SA-MVSNet包括两个阶段，第一阶段预测的低分辨率深度图为第二阶段提供了先验信息。为了提高图像信息的利用率，采用金字塔结构对不同分辨率的特征图进行融合。此外，我们在第一阶段引入了改进的自关注模块，通过学习特征映射的长期依赖信息来提高重构精度。在DTU数据集上的实验表明，该方法在重建三维场景的完整性和精度指标上都取得了良好的效果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A Self-Attention based Network for Low Resolution Multi-View Stereo

We present SA-MVSNet, a novel two-stage multi-view stereo network equipped with self-attention mechanism, which can improve the quality of low-resolution image 3D reconstruction. SA-MVSNet consists of two stages, and the lower resolution depth maps predicted in the first stage provide a priori information for the second stage. To increase the utilization of image information, a pyramid scheme was used to fuse the feature maps at different resolutions. Moreover, we introduce an improved self-attention module in the first stage to improve reconstruction accuracy by learning the long-term dependence information of feature maps. The experiments on the DTU dataset show a promising result in both completeness and accuracy metrics of the 3D scene reconstructed by the proposed method.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2023 3rd International Conference on Consumer Electronics and Computer Engineering (ICCECE)

自引率

0.00%

发文量