Structure-Aware Image Expansion with Global Attention

SIGGRAPH Asia 2019 Technical Briefs Pub Date : 2019-11-17 DOI:10.1145/3355088.3365161

Dewen Guo, J. Feng, Bingfeng Zhou

{"title":"Structure-Aware Image Expansion with Global Attention","authors":"Dewen Guo, J. Feng, Bingfeng Zhou","doi":"10.1145/3355088.3365161","DOIUrl":null,"url":null,"abstract":"We present a novel structure-aware strategy for image expansion which aims to complete an image from a small patch. Different from image inpainting, the majority of the pixels are absent here. Hence, there are higher requirements for global structure-aware prediction to produce visually plausible results. Thus, treating the expansion tasks as inpainting from the outside is ill-posed. Therefore, we propose a learning-based method combining structure-aware and visual attention strategies to make better prediction. Our architecture consists of two stages. Since visual attention cannot be taken full advantage of when the global structure is absent, we first use the ImageNet-pre-trained VGG-19 to make the structure-aware prediction on the pre-training stage. Then, we implement a non-local attention layer on the coarsely-completed results on the refining stage. Our network can well predict the global structures and semantic details from small input image patches, and generate full images with structural consistency. We apply our method on a human face dataset, which containing rich semantic and structural details. The results show its stability and effectiveness.","PeriodicalId":435930,"journal":{"name":"SIGGRAPH Asia 2019 Technical Briefs","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"SIGGRAPH Asia 2019 Technical Briefs","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3355088.3365161","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

We present a novel structure-aware strategy for image expansion which aims to complete an image from a small patch. Different from image inpainting, the majority of the pixels are absent here. Hence, there are higher requirements for global structure-aware prediction to produce visually plausible results. Thus, treating the expansion tasks as inpainting from the outside is ill-posed. Therefore, we propose a learning-based method combining structure-aware and visual attention strategies to make better prediction. Our architecture consists of two stages. Since visual attention cannot be taken full advantage of when the global structure is absent, we first use the ImageNet-pre-trained VGG-19 to make the structure-aware prediction on the pre-training stage. Then, we implement a non-local attention layer on the coarsely-completed results on the refining stage. Our network can well predict the global structures and semantic details from small input image patches, and generate full images with structural consistency. We apply our method on a human face dataset, which containing rich semantic and structural details. The results show its stability and effectiveness.

查看原文本刊更多论文

结构感知图像扩展与全球关注

我们提出了一种新的结构感知图像扩展策略，旨在从一个小块完成图像。与图像上漆不同的是，大多数像素在这里是缺席的。因此，对全局结构感知预测提出了更高的要求，以产生视觉上可信的结果。因此，将扩展任务视为从外部进行粉刷是不恰当的。因此，我们提出了一种结合结构感知和视觉注意策略的基于学习的方法来进行更好的预测。我们的架构由两个阶段组成。由于全局结构缺失时，视觉注意力无法被充分利用，我们首先使用imagenet预训练的VGG-19在预训练阶段进行结构感知预测。然后，在精炼阶段，我们在粗完成的结果上实现非局部关注层。我们的网络可以很好地预测小块输入图像的整体结构和语义细节，并生成结构一致的完整图像。我们将该方法应用于包含丰富语义和结构细节的人脸数据集。实验结果表明了该方法的稳定性和有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

SIGGRAPH Asia 2019 Technical Briefs

自引率

0.00%

发文量