{"title":"Image Inpainting Using Channel Attention and Hierarchical Residual Networks","authors":"Hao Yang, Yingzhen Yu","doi":"10.3724/sp.j.1089.2021.18514","DOIUrl":null,"url":null,"abstract":"Existing deep-learning-based inpainting methods may have some shortcomings in perceiving and presenting image information at multi-scales. For this problem, we proposed an image inpainting model based on multi-scale channel attention and a hierarchical residual backbone network. Firstly, we adopted a U-Net architecture as the generator backbone of our inpainting model to encode and decode the damaged image. Secondly, we built multi-scale hierarchical residual structures in the encoder and decoder respectively, which can improve the ability of the model to extract and express occluded image features. Finally, we designed a dilated multi-scale channel-attention block and inserted it into the skip-connection of the generator. This block can improve the utilization efficiency of low-level features in the encoder. Experimental results show that our model outperforms other classical inpainting approaches in the face, street-view inpainting tasks, both qualitatively and quantitatively.","PeriodicalId":52442,"journal":{"name":"计算机辅助设计与图形学学报","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"计算机辅助设计与图形学学报","FirstCategoryId":"1093","ListUrlMain":"https://doi.org/10.3724/sp.j.1089.2021.18514","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Computer Science","Score":null,"Total":0}
引用次数: 4
Abstract
Existing deep-learning-based inpainting methods may have some shortcomings in perceiving and presenting image information at multi-scales. For this problem, we proposed an image inpainting model based on multi-scale channel attention and a hierarchical residual backbone network. Firstly, we adopted a U-Net architecture as the generator backbone of our inpainting model to encode and decode the damaged image. Secondly, we built multi-scale hierarchical residual structures in the encoder and decoder respectively, which can improve the ability of the model to extract and express occluded image features. Finally, we designed a dilated multi-scale channel-attention block and inserted it into the skip-connection of the generator. This block can improve the utilization efficiency of low-level features in the encoder. Experimental results show that our model outperforms other classical inpainting approaches in the face, street-view inpainting tasks, both qualitatively and quantitatively.