使用深度迁移学习的图像着色

2020 8th Iranian Joint Congress on Fuzzy and intelligent Systems (CFIS) Pub Date : 2020-09-01 DOI:10.1109/CFIS49607.2020.9238737

Leila Kiani, Masoudnia Saeed, H. Nezamabadi-pour

{"title":"使用深度迁移学习的图像着色","authors":"Leila Kiani, Masoudnia Saeed, H. Nezamabadi-pour","doi":"10.1109/CFIS49607.2020.9238737","DOIUrl":null,"url":null,"abstract":"Over the past decade, the automatic image coloring has been of particular interest in applications such as repairing damaged or old images. One of the problems with the auto-coloring is the ability to predict multiple color results for gray image pixels. In the presence of noise, the problem becomes more complicated. Recently, some researchers employ conventional neural networks (CNN) to the problem of image colorization. Usually, the output of the last layer of CNN is used as a feature representation. However, the information contained in this layer may be too coarse spatially to allow exact localization. Conversely, earlier layers may be precise in localization but will not capture semantics. In this article, we use a concept called hypercolumns to achieve the best in both cases and develop a fully automatic image coloring system. Our approach exploits recent advances in deep neural networks and uses the semantic representation to provide an accurate color prediction. Since many elements of the scene naturally appear by the color distribution, we train our model in such a way to predict the color texture of each pixel. The DIV2K dataset has been used for training, and the obtained results are compared with other methods based on PSNR, which are promising.","PeriodicalId":128323,"journal":{"name":"2020 8th Iranian Joint Congress on Fuzzy and intelligent Systems (CFIS)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Image Colorization Using a Deep Transfer Learning\",\"authors\":\"Leila Kiani, Masoudnia Saeed, H. Nezamabadi-pour\",\"doi\":\"10.1109/CFIS49607.2020.9238737\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Over the past decade, the automatic image coloring has been of particular interest in applications such as repairing damaged or old images. One of the problems with the auto-coloring is the ability to predict multiple color results for gray image pixels. In the presence of noise, the problem becomes more complicated. Recently, some researchers employ conventional neural networks (CNN) to the problem of image colorization. Usually, the output of the last layer of CNN is used as a feature representation. However, the information contained in this layer may be too coarse spatially to allow exact localization. Conversely, earlier layers may be precise in localization but will not capture semantics. In this article, we use a concept called hypercolumns to achieve the best in both cases and develop a fully automatic image coloring system. Our approach exploits recent advances in deep neural networks and uses the semantic representation to provide an accurate color prediction. Since many elements of the scene naturally appear by the color distribution, we train our model in such a way to predict the color texture of each pixel. The DIV2K dataset has been used for training, and the obtained results are compared with other methods based on PSNR, which are promising.\",\"PeriodicalId\":128323,\"journal\":{\"name\":\"2020 8th Iranian Joint Congress on Fuzzy and intelligent Systems (CFIS)\",\"volume\":\"33 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 8th Iranian Joint Congress on Fuzzy and intelligent Systems (CFIS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CFIS49607.2020.9238737\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 8th Iranian Joint Congress on Fuzzy and intelligent Systems (CFIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CFIS49607.2020.9238737","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

摘要

在过去的十年中，自动图像着色在修复损坏或旧图像等应用中得到了特别的关注。自动上色的问题之一是预测灰色图像像素的多种颜色结果的能力。在有噪音的情况下，问题变得更加复杂。近年来，一些研究者将传统神经网络(CNN)应用于图像着色问题。通常使用CNN最后一层的输出作为特征表示。然而，该层中包含的信息在空间上可能过于粗糙，无法进行精确定位。相反，早期的层可能在定位上是精确的，但不会捕获语义。在本文中，我们使用一个称为hypercolumns的概念来实现这两种情况的最佳效果，并开发一个全自动图像着色系统。我们的方法利用了深度神经网络的最新进展，并使用语义表示来提供准确的颜色预测。由于场景的许多元素自然地通过颜色分布出现，我们以这种方式训练我们的模型来预测每个像素的颜色纹理。利用DIV2K数据集进行训练，并与其他基于PSNR的方法进行了比较，结果表明该方法具有较好的应用前景。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Image Colorization Using a Deep Transfer Learning

Over the past decade, the automatic image coloring has been of particular interest in applications such as repairing damaged or old images. One of the problems with the auto-coloring is the ability to predict multiple color results for gray image pixels. In the presence of noise, the problem becomes more complicated. Recently, some researchers employ conventional neural networks (CNN) to the problem of image colorization. Usually, the output of the last layer of CNN is used as a feature representation. However, the information contained in this layer may be too coarse spatially to allow exact localization. Conversely, earlier layers may be precise in localization but will not capture semantics. In this article, we use a concept called hypercolumns to achieve the best in both cases and develop a fully automatic image coloring system. Our approach exploits recent advances in deep neural networks and uses the semantic representation to provide an accurate color prediction. Since many elements of the scene naturally appear by the color distribution, we train our model in such a way to predict the color texture of each pixel. The DIV2K dataset has been used for training, and the obtained results are compared with other methods based on PSNR, which are promising.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2020 8th Iranian Joint Congress on Fuzzy and intelligent Systems (CFIS)

自引率

0.00%

发文量