{"title":"Leveraging more information for blind stereo super-resolution via large-window cross-attention","authors":"Weifeng Cao, Xiaoyan Lei, Yong Jiang, Zongfei Bai, Xiaoliang Qian","doi":"10.1016/j.asoc.2024.112492","DOIUrl":null,"url":null,"abstract":"<div><div>Stereo image super-resolution aims to reconstruct high-resolution images by effectively utilizing cross-view complementary information from stereo image pairs. A prevalent method in Stereo image super-resolution is the stereo cross-attention module, which allows the model to focus on and integrate relevant features from both the left and right views. Despite its advantages, our analysis using a diagnostic tool called local attribution map (LAM) reveals that current methods exhibit limitations in effectively leveraging this complementary information. To address this issue, we propose the Double Stereo Cross-Attention Module (DSCAM), which utilizes an Overlapping Stereo Cross-Attention (OSCA) mechanism that enhances the integration of cross-view complementary information by using overlapping windows, followed by an additional multiplication step to refine and emphasize the combined features. Additionally, we develop a stereo image degradation model that ensures the consistency of degradation between stereo pairs, accurately simulating the real-world degradation process of stereo images. Extensive experiments have demonstrated that our method achieves visually pleasing results, making it the first to address the problem of stereo image super-resolution in real-world scenarios. The source code is available at <span><span>https://github.com/nathan66666/LCASSR.git</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50737,"journal":{"name":"Applied Soft Computing","volume":"168 ","pages":"Article 112492"},"PeriodicalIF":7.2000,"publicationDate":"2024-11-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Soft Computing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1568494624012663","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Stereo image super-resolution aims to reconstruct high-resolution images by effectively utilizing cross-view complementary information from stereo image pairs. A prevalent method in Stereo image super-resolution is the stereo cross-attention module, which allows the model to focus on and integrate relevant features from both the left and right views. Despite its advantages, our analysis using a diagnostic tool called local attribution map (LAM) reveals that current methods exhibit limitations in effectively leveraging this complementary information. To address this issue, we propose the Double Stereo Cross-Attention Module (DSCAM), which utilizes an Overlapping Stereo Cross-Attention (OSCA) mechanism that enhances the integration of cross-view complementary information by using overlapping windows, followed by an additional multiplication step to refine and emphasize the combined features. Additionally, we develop a stereo image degradation model that ensures the consistency of degradation between stereo pairs, accurately simulating the real-world degradation process of stereo images. Extensive experiments have demonstrated that our method achieves visually pleasing results, making it the first to address the problem of stereo image super-resolution in real-world scenarios. The source code is available at https://github.com/nathan66666/LCASSR.git.
期刊介绍:
Applied Soft Computing is an international journal promoting an integrated view of soft computing to solve real life problems.The focus is to publish the highest quality research in application and convergence of the areas of Fuzzy Logic, Neural Networks, Evolutionary Computing, Rough Sets and other similar techniques to address real world complexities.
Applied Soft Computing is a rolling publication: articles are published as soon as the editor-in-chief has accepted them. Therefore, the web site will continuously be updated with new articles and the publication time will be short.