{"title":"SqSFill:用于高保真图像绘制的联合空间和光谱学习","authors":"Zihao Zhang , Feifan Cai , Qin Zhou , Youdong Ding","doi":"10.1016/j.neucom.2025.130414","DOIUrl":null,"url":null,"abstract":"<div><div>Image inpainting has made significant progress due to recent advances in deep learning. However, most generative inpainting networks face challenges such as producing blurry results that lack high-frequency details or introducing inconsistent structures. To address these issues, we propose a novel transformer-based approach, SqSFill, which exploits rich information in both spatial and spectral domains. Specifically, SqSFill incorporates Rectified Frequency Feature Extractor (RecFFE) in the early layers of the network to capture fine-grained details by leveraging frequency information, guided by frequency loss. Moreover, we design a Scout Attention Block with linear complexity to replace vanilla self-attention, thereby effectively capturing long-range dependencies with lower computational cost. By integrating the RecFFE and Scout Attention Block, SqSFill is able to generate both coherent structures and sharp textures. Extensive experiments demonstrate the proposed SqSFill achieves superior results, outperforming previous state-of-the-art approaches with fewer parameters.</div></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":"645 ","pages":"Article 130414"},"PeriodicalIF":5.5000,"publicationDate":"2025-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"SqSFill : Joint spatial and spectral learning for high-fidelity image inpainting\",\"authors\":\"Zihao Zhang , Feifan Cai , Qin Zhou , Youdong Ding\",\"doi\":\"10.1016/j.neucom.2025.130414\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Image inpainting has made significant progress due to recent advances in deep learning. However, most generative inpainting networks face challenges such as producing blurry results that lack high-frequency details or introducing inconsistent structures. To address these issues, we propose a novel transformer-based approach, SqSFill, which exploits rich information in both spatial and spectral domains. Specifically, SqSFill incorporates Rectified Frequency Feature Extractor (RecFFE) in the early layers of the network to capture fine-grained details by leveraging frequency information, guided by frequency loss. Moreover, we design a Scout Attention Block with linear complexity to replace vanilla self-attention, thereby effectively capturing long-range dependencies with lower computational cost. By integrating the RecFFE and Scout Attention Block, SqSFill is able to generate both coherent structures and sharp textures. Extensive experiments demonstrate the proposed SqSFill achieves superior results, outperforming previous state-of-the-art approaches with fewer parameters.</div></div>\",\"PeriodicalId\":19268,\"journal\":{\"name\":\"Neurocomputing\",\"volume\":\"645 \",\"pages\":\"Article 130414\"},\"PeriodicalIF\":5.5000,\"publicationDate\":\"2025-05-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Neurocomputing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0925231225010860\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neurocomputing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0925231225010860","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
SqSFill : Joint spatial and spectral learning for high-fidelity image inpainting
Image inpainting has made significant progress due to recent advances in deep learning. However, most generative inpainting networks face challenges such as producing blurry results that lack high-frequency details or introducing inconsistent structures. To address these issues, we propose a novel transformer-based approach, SqSFill, which exploits rich information in both spatial and spectral domains. Specifically, SqSFill incorporates Rectified Frequency Feature Extractor (RecFFE) in the early layers of the network to capture fine-grained details by leveraging frequency information, guided by frequency loss. Moreover, we design a Scout Attention Block with linear complexity to replace vanilla self-attention, thereby effectively capturing long-range dependencies with lower computational cost. By integrating the RecFFE and Scout Attention Block, SqSFill is able to generate both coherent structures and sharp textures. Extensive experiments demonstrate the proposed SqSFill achieves superior results, outperforming previous state-of-the-art approaches with fewer parameters.
期刊介绍:
Neurocomputing publishes articles describing recent fundamental contributions in the field of neurocomputing. Neurocomputing theory, practice and applications are the essential topics being covered.