Ziyuan Wang , Jianzhong Cao , Gaopeng Zhang , Minhao Zhang , Boxue Zhang , Weining Chen , Xin Ma , Feng Wang
{"title":"Frequency-enhanced representation and cost aggregation for multi-view stereo","authors":"Ziyuan Wang , Jianzhong Cao , Gaopeng Zhang , Minhao Zhang , Boxue Zhang , Weining Chen , Xin Ma , Feng Wang","doi":"10.1016/j.sigpro.2025.110280","DOIUrl":null,"url":null,"abstract":"<div><div>Cascade-based multi-view stereo (MVS) methods demonstrate exceptional flexibility and efficiency in 3D reconstruction tasks. However, existing methods predominantly focus on pixel-wise correlations in the spatial domain while overlooking the critical role of frequency-domain information essential for modeling challenging scenarios, leading to suboptimal 3D geometric reconstruction. Furthermore, downsampling during multi-scale feature extraction may lead to the loss of critical spatial details, undermining the fidelity of depth estimation in visually degraded scenes. In this paper, we propose FA-MVS, a framework that explicitly incorporates frequency information into multi-scale depth estimation to enhance frequency awareness. Specifically, we propose a frequency-enhanced feature extractor, where frequency representations fused with spatial depth priors are progressively refined to bolster robustness against frequency-sensitive variations. Meanwhile, we propose a frequency-aware cost aggregation module that integrates frequency cues into the cost volume, enabling precise capture of fine details in boundaries and occluded regions. Extensive experiments conducted on the DTU, Tanks and Temples, as well as the challenging ETH3D datasets, demonstrate that our method achieves competitive performance compared to existing advanced approaches while exhibiting strong generalization capability.</div></div>","PeriodicalId":49523,"journal":{"name":"Signal Processing","volume":"239 ","pages":"Article 110280"},"PeriodicalIF":3.6000,"publicationDate":"2025-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Signal Processing","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0165168425003949","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
Cascade-based multi-view stereo (MVS) methods demonstrate exceptional flexibility and efficiency in 3D reconstruction tasks. However, existing methods predominantly focus on pixel-wise correlations in the spatial domain while overlooking the critical role of frequency-domain information essential for modeling challenging scenarios, leading to suboptimal 3D geometric reconstruction. Furthermore, downsampling during multi-scale feature extraction may lead to the loss of critical spatial details, undermining the fidelity of depth estimation in visually degraded scenes. In this paper, we propose FA-MVS, a framework that explicitly incorporates frequency information into multi-scale depth estimation to enhance frequency awareness. Specifically, we propose a frequency-enhanced feature extractor, where frequency representations fused with spatial depth priors are progressively refined to bolster robustness against frequency-sensitive variations. Meanwhile, we propose a frequency-aware cost aggregation module that integrates frequency cues into the cost volume, enabling precise capture of fine details in boundaries and occluded regions. Extensive experiments conducted on the DTU, Tanks and Temples, as well as the challenging ETH3D datasets, demonstrate that our method achieves competitive performance compared to existing advanced approaches while exhibiting strong generalization capability.
期刊介绍:
Signal Processing incorporates all aspects of the theory and practice of signal processing. It features original research work, tutorial and review articles, and accounts of practical developments. It is intended for a rapid dissemination of knowledge and experience to engineers and scientists working in the research, development or practical application of signal processing.
Subject areas covered by the journal include: Signal Theory; Stochastic Processes; Detection and Estimation; Spectral Analysis; Filtering; Signal Processing Systems; Software Developments; Image Processing; Pattern Recognition; Optical Signal Processing; Digital Signal Processing; Multi-dimensional Signal Processing; Communication Signal Processing; Biomedical Signal Processing; Geophysical and Astrophysical Signal Processing; Earth Resources Signal Processing; Acoustic and Vibration Signal Processing; Data Processing; Remote Sensing; Signal Processing Technology; Radar Signal Processing; Sonar Signal Processing; Industrial Applications; New Applications.