Mohannad Al-Jaafari , Firas Abedi , You Yang , Qiong Liu
{"title":"Corner selection and dual network blender for efficient view synthesis in outdoor scenes","authors":"Mohannad Al-Jaafari , Firas Abedi , You Yang , Qiong Liu","doi":"10.1016/j.patcog.2025.111668","DOIUrl":null,"url":null,"abstract":"<div><div>Novel view synthesis (NVS) from freely distributed viewpoints from large-scale outdoor scenes offers a compelling user experience in several applications, including first-person hyper-lapse videos and virtual reality. However, high-quality NVS requires dense input images, which can be affected by color discrepancies caused by extreme brightness changes, varying viewing angles, incorrect estimation of camera parameters, and mobile objects. This paper introduces Competent View Synthesis (CVS), a cost-effective approach to generating high-quality NVS from large-scale outdoor scenes to address these challenges. CVS employs a three-stage pipeline, including a Corners Selection Algorithm (CSA) to reduce the number of required input images, a Tinkering mechanism to fill in missing pixel data, and a dual-network blending (DNB) model to fuse colors and calculate attention coefficients for feature refinement. The experimental results demonstrate the effectiveness of the proposed CVS in generating realistic viewpoints from a limited number of input viewpoints. Furthermore, the comparative evaluations against two baselines using metrics such as PSNR, SSIM, and LPIPS reveal significant performance improvements of 4.6%, 0.18%, and 30.13%, respectively, over the first baseline, while the proposed method significantly outperforms the second baseline in term of perceptual metrics. By optimizing the synthesis process for complex outdoor scenes, CVS enhances the quality of generated images and improves computational efficiency.</div></div>","PeriodicalId":49713,"journal":{"name":"Pattern Recognition","volume":"166 ","pages":"Article 111668"},"PeriodicalIF":7.5000,"publicationDate":"2025-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0031320325003280","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Novel view synthesis (NVS) from freely distributed viewpoints from large-scale outdoor scenes offers a compelling user experience in several applications, including first-person hyper-lapse videos and virtual reality. However, high-quality NVS requires dense input images, which can be affected by color discrepancies caused by extreme brightness changes, varying viewing angles, incorrect estimation of camera parameters, and mobile objects. This paper introduces Competent View Synthesis (CVS), a cost-effective approach to generating high-quality NVS from large-scale outdoor scenes to address these challenges. CVS employs a three-stage pipeline, including a Corners Selection Algorithm (CSA) to reduce the number of required input images, a Tinkering mechanism to fill in missing pixel data, and a dual-network blending (DNB) model to fuse colors and calculate attention coefficients for feature refinement. The experimental results demonstrate the effectiveness of the proposed CVS in generating realistic viewpoints from a limited number of input viewpoints. Furthermore, the comparative evaluations against two baselines using metrics such as PSNR, SSIM, and LPIPS reveal significant performance improvements of 4.6%, 0.18%, and 30.13%, respectively, over the first baseline, while the proposed method significantly outperforms the second baseline in term of perceptual metrics. By optimizing the synthesis process for complex outdoor scenes, CVS enhances the quality of generated images and improves computational efficiency.
期刊介绍:
The field of Pattern Recognition is both mature and rapidly evolving, playing a crucial role in various related fields such as computer vision, image processing, text analysis, and neural networks. It closely intersects with machine learning and is being applied in emerging areas like biometrics, bioinformatics, multimedia data analysis, and data science. The journal Pattern Recognition, established half a century ago during the early days of computer science, has since grown significantly in scope and influence.