基于通道注意力和层次残差网络的图像修复

Q3 Computer Science
Hao Yang, Yingzhen Yu
{"title":"基于通道注意力和层次残差网络的图像修复","authors":"Hao Yang, Yingzhen Yu","doi":"10.3724/sp.j.1089.2021.18514","DOIUrl":null,"url":null,"abstract":"Existing deep-learning-based inpainting methods may have some shortcomings in perceiving and presenting image information at multi-scales. For this problem, we proposed an image inpainting model based on multi-scale channel attention and a hierarchical residual backbone network. Firstly, we adopted a U-Net architecture as the generator backbone of our inpainting model to encode and decode the damaged image. Secondly, we built multi-scale hierarchical residual structures in the encoder and decoder respectively, which can improve the ability of the model to extract and express occluded image features. Finally, we designed a dilated multi-scale channel-attention block and inserted it into the skip-connection of the generator. This block can improve the utilization efficiency of low-level features in the encoder. Experimental results show that our model outperforms other classical inpainting approaches in the face, street-view inpainting tasks, both qualitatively and quantitatively.","PeriodicalId":52442,"journal":{"name":"计算机辅助设计与图形学学报","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Image Inpainting Using Channel Attention and Hierarchical Residual Networks\",\"authors\":\"Hao Yang, Yingzhen Yu\",\"doi\":\"10.3724/sp.j.1089.2021.18514\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Existing deep-learning-based inpainting methods may have some shortcomings in perceiving and presenting image information at multi-scales. For this problem, we proposed an image inpainting model based on multi-scale channel attention and a hierarchical residual backbone network. Firstly, we adopted a U-Net architecture as the generator backbone of our inpainting model to encode and decode the damaged image. Secondly, we built multi-scale hierarchical residual structures in the encoder and decoder respectively, which can improve the ability of the model to extract and express occluded image features. Finally, we designed a dilated multi-scale channel-attention block and inserted it into the skip-connection of the generator. This block can improve the utilization efficiency of low-level features in the encoder. Experimental results show that our model outperforms other classical inpainting approaches in the face, street-view inpainting tasks, both qualitatively and quantitatively.\",\"PeriodicalId\":52442,\"journal\":{\"name\":\"计算机辅助设计与图形学学报\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"计算机辅助设计与图形学学报\",\"FirstCategoryId\":\"1093\",\"ListUrlMain\":\"https://doi.org/10.3724/sp.j.1089.2021.18514\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"Computer Science\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"计算机辅助设计与图形学学报","FirstCategoryId":"1093","ListUrlMain":"https://doi.org/10.3724/sp.j.1089.2021.18514","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Computer Science","Score":null,"Total":0}
引用次数: 4

摘要

现有的基于深度学习的图像绘制方法在感知和呈现多尺度图像信息方面存在不足。针对这一问题,我们提出了一种基于多尺度通道关注和分层残差骨干网的图像补图模型。首先,我们采用U-Net架构作为修复模型的生成主干,对受损图像进行编码和解码。其次,分别在编码器和解码器中构建多尺度分层残差结构,提高模型提取和表达遮挡图像特征的能力;最后,我们设计了一个扩展的多尺度通道注意块,并将其插入到发生器的跳接中。该块可以提高编码器中底层特征的利用效率。实验结果表明,该模型在人脸、街景图像绘制任务中的定性和定量上都优于其他经典图像绘制方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Image Inpainting Using Channel Attention and Hierarchical Residual Networks
Existing deep-learning-based inpainting methods may have some shortcomings in perceiving and presenting image information at multi-scales. For this problem, we proposed an image inpainting model based on multi-scale channel attention and a hierarchical residual backbone network. Firstly, we adopted a U-Net architecture as the generator backbone of our inpainting model to encode and decode the damaged image. Secondly, we built multi-scale hierarchical residual structures in the encoder and decoder respectively, which can improve the ability of the model to extract and express occluded image features. Finally, we designed a dilated multi-scale channel-attention block and inserted it into the skip-connection of the generator. This block can improve the utilization efficiency of low-level features in the encoder. Experimental results show that our model outperforms other classical inpainting approaches in the face, street-view inpainting tasks, both qualitatively and quantitatively.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
计算机辅助设计与图形学学报
计算机辅助设计与图形学学报 Computer Science-Computer Graphics and Computer-Aided Design
CiteScore
1.20
自引率
0.00%
发文量
6833
期刊介绍:
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信