语义准确的超分辨率生成对抗网络

Comput. Vis. Image Underst. Pub Date : 2022-05-01 DOI:10.48550/arXiv.2205.08659

Tristan Frizza, D. Dansereau, Nagita Mehr Seresht, M. Bewley

{"title":"语义准确的超分辨率生成对抗网络","authors":"Tristan Frizza, D. Dansereau, Nagita Mehr Seresht, M. Bewley","doi":"10.48550/arXiv.2205.08659","DOIUrl":null,"url":null,"abstract":"This work addresses the problems of semantic segmentation and image super-resolution by jointly considering the performance of both in training a Generative Adversarial Network (GAN). We propose a novel architecture and domain-speciﬁc feature loss, allowing super-resolution to operate as a pre-processing step to increase the performance of downstream computer vision tasks, speciﬁcally semantic segmentation. We demonstrate this approach using Nearmap’s aerial imagery dataset which covers hundreds of urban areas at 5-7 cm per pixel resolution. We show the proposed approach improves perceived image quality as well as quantitative segmentation accuracy across all prediction classes, yielding an average accuracy improvement of 11.8% and 108% at 4 × and 32 × super-resolution, compared with state-of-the art single-network methods. This work demonstrates that jointly considering image-based and task-speciﬁc losses can improve the performance of both, and advances the state-of-the-art in semantic-aware super-resolution of aerial imagery. 1: A comparison of of three potential generator model architec- tures for 4 × super-resolution. We chose RRDN for all subsequent ex-periments due to its superior overall performance on pixel-wise loss","PeriodicalId":10549,"journal":{"name":"Comput. Vis. Image Underst.","volume":"31 1","pages":"103464"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Semantically Accurate Super-Resolution Generative Adversarial Networks\",\"authors\":\"Tristan Frizza, D. Dansereau, Nagita Mehr Seresht, M. Bewley\",\"doi\":\"10.48550/arXiv.2205.08659\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This work addresses the problems of semantic segmentation and image super-resolution by jointly considering the performance of both in training a Generative Adversarial Network (GAN). We propose a novel architecture and domain-speciﬁc feature loss, allowing super-resolution to operate as a pre-processing step to increase the performance of downstream computer vision tasks, speciﬁcally semantic segmentation. We demonstrate this approach using Nearmap’s aerial imagery dataset which covers hundreds of urban areas at 5-7 cm per pixel resolution. We show the proposed approach improves perceived image quality as well as quantitative segmentation accuracy across all prediction classes, yielding an average accuracy improvement of 11.8% and 108% at 4 × and 32 × super-resolution, compared with state-of-the art single-network methods. This work demonstrates that jointly considering image-based and task-speciﬁc losses can improve the performance of both, and advances the state-of-the-art in semantic-aware super-resolution of aerial imagery. 1: A comparison of of three potential generator model architec- tures for 4 × super-resolution. We chose RRDN for all subsequent ex-periments due to its superior overall performance on pixel-wise loss\",\"PeriodicalId\":10549,\"journal\":{\"name\":\"Comput. Vis. Image Underst.\",\"volume\":\"31 1\",\"pages\":\"103464\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Comput. Vis. Image Underst.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.48550/arXiv.2205.08659\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Comput. Vis. Image Underst.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2205.08659","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

摘要

这项工作通过联合考虑两者在训练生成对抗网络(GAN)中的性能来解决语义分割和图像超分辨率的问题。我们提出了一种新的架构和特定领域的特征损失，允许超分辨率作为预处理步骤来提高下游计算机视觉任务的性能，特别是语义分割。我们使用Nearmap的航空图像数据集来演示这种方法，该数据集以每像素5-7厘米的分辨率覆盖了数百个城市地区。我们表明，所提出的方法提高了感知图像质量以及所有预测类别的定量分割精度，与最先进的单网络方法相比，在4 ×和32 ×超分辨率下的平均精度提高了11.8%和108%。这项工作表明，联合考虑基于图像和特定任务的损失可以提高两者的性能，并推进了航空图像语义感知超分辨率的最新技术。1 . 4 ×超分辨率三种潜在发电机模型体系结构的比较。我们选择RRDN进行所有后续实验，因为它在像素级损失方面的整体性能优越

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Semantically Accurate Super-Resolution Generative Adversarial Networks

This work addresses the problems of semantic segmentation and image super-resolution by jointly considering the performance of both in training a Generative Adversarial Network (GAN). We propose a novel architecture and domain-speciﬁc feature loss, allowing super-resolution to operate as a pre-processing step to increase the performance of downstream computer vision tasks, speciﬁcally semantic segmentation. We demonstrate this approach using Nearmap’s aerial imagery dataset which covers hundreds of urban areas at 5-7 cm per pixel resolution. We show the proposed approach improves perceived image quality as well as quantitative segmentation accuracy across all prediction classes, yielding an average accuracy improvement of 11.8% and 108% at 4 × and 32 × super-resolution, compared with state-of-the art single-network methods. This work demonstrates that jointly considering image-based and task-speciﬁc losses can improve the performance of both, and advances the state-of-the-art in semantic-aware super-resolution of aerial imagery. 1: A comparison of of three potential generator model architec- tures for 4 × super-resolution. We chose RRDN for all subsequent ex-periments due to its superior overall performance on pixel-wise loss

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Comput. Vis. Image Underst.

自引率

0.00%

发文量