Yufeng Yin , Xiaoyan Liu , Qing Fan , Zichao Zhang
{"title":"A non-local adaptive hypothesis propagation for multi-view stereo","authors":"Yufeng Yin , Xiaoyan Liu , Qing Fan , Zichao Zhang","doi":"10.1016/j.imavis.2025.105704","DOIUrl":null,"url":null,"abstract":"<div><div>Hypothesis propagation is a central component of PatchMatch-based multi-view stereo and significantly impacts the reconstruction performance. However, current propagation methods rely on photometric consistency to guide hypothesis propagation within a local area. When the centroid is located in a low-textured area with reflective or refractive properties, high chromatic aberration may cause the multi-view matching to fall into a local optimum that fails to provide reliable hypotheses, leading to reconstruction errors. To address this problem, we propose a non-local adaptive hypothesis propagation scheme. First, we evenly distribute sampling points in eight directions on the checkerboard to quickly determine reliable initial hypotheses. Then, starting from the initial hypotheses generated in the eight directions of the checkerboard, the hypotheses are adaptively propagated to non-checkerboard areas based on matching cost, reducing interference from unreliable photometric consistency and improving reconstruction performance in challenging areas. The test results on large-scale benchmarks show that the proposed scheme has significant advantages in reconstructing challenging areas. It can significantly improve the completeness of point clouds from current state-of-the-art methods and outperform existing propagation schemes.</div></div>","PeriodicalId":50374,"journal":{"name":"Image and Vision Computing","volume":"162 ","pages":"Article 105704"},"PeriodicalIF":4.2000,"publicationDate":"2025-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Image and Vision Computing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0262885625002926","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Hypothesis propagation is a central component of PatchMatch-based multi-view stereo and significantly impacts the reconstruction performance. However, current propagation methods rely on photometric consistency to guide hypothesis propagation within a local area. When the centroid is located in a low-textured area with reflective or refractive properties, high chromatic aberration may cause the multi-view matching to fall into a local optimum that fails to provide reliable hypotheses, leading to reconstruction errors. To address this problem, we propose a non-local adaptive hypothesis propagation scheme. First, we evenly distribute sampling points in eight directions on the checkerboard to quickly determine reliable initial hypotheses. Then, starting from the initial hypotheses generated in the eight directions of the checkerboard, the hypotheses are adaptively propagated to non-checkerboard areas based on matching cost, reducing interference from unreliable photometric consistency and improving reconstruction performance in challenging areas. The test results on large-scale benchmarks show that the proposed scheme has significant advantages in reconstructing challenging areas. It can significantly improve the completeness of point clouds from current state-of-the-art methods and outperform existing propagation schemes.
期刊介绍:
Image and Vision Computing has as a primary aim the provision of an effective medium of interchange for the results of high quality theoretical and applied research fundamental to all aspects of image interpretation and computer vision. The journal publishes work that proposes new image interpretation and computer vision methodology or addresses the application of such methods to real world scenes. It seeks to strengthen a deeper understanding in the discipline by encouraging the quantitative comparison and performance evaluation of the proposed methodology. The coverage includes: image interpretation, scene modelling, object recognition and tracking, shape analysis, monitoring and surveillance, active vision and robotic systems, SLAM, biologically-inspired computer vision, motion analysis, stereo vision, document image understanding, character and handwritten text recognition, face and gesture recognition, biometrics, vision-based human-computer interaction, human activity and behavior understanding, data fusion from multiple sensor inputs, image databases.