Image Colorization

GraphiCon'2019 Proceedings. Volume 2 Pub Date : 2019-11-11 DOI:10.30987/graphicon-2019-2-207-210

Дарья Михалина, Daria Mikhalina, Александр Кузьменко, A. Kuz'menko, Константин Дергачев, K. Dergachev, Виталий Шкаберин, Vitaliy Shkaberin

{"title":"Image Colorization","authors":"Дарья Михалина, Daria Mikhalina, Александр Кузьменко, A. Kuz'menko, Константин Дергачев, K. Dergachev, Виталий Шкаберин, Vitaliy Shkaberin","doi":"10.30987/graphicon-2019-2-207-210","DOIUrl":null,"url":null,"abstract":"The article discusses one of the latest ways to colorize a black and white image using deep learning methods. For colorization, a convolutional neural network with a large number of layers (Deep convolutional) is used, the architecture of which includes a ResNet model. This model was pre-trained on images of the ImageNet dataset. A neural network receives a black and white image and returns a colorized color. Since, due to the characteristics of ResNet, an input multiple of 255 is received, a program was written that, using frames, enlarges the image for the required size. During the operation of the neural network, the CIE Lab color model is used, which allows to separate the black and white component of the image from the color. For training the neural network, the Place 365 dataset was used, containing 365 different classes, such as animals, landscape elements, people, and so on. The training was carried out on the Nvidia GTX 1080 video card. The result was a trained neural network capable of colorizing images of any size and format. As example we had a speed of 0.08 seconds and an image of 256 by 256 pixels in size. In connection with the concept of the dataset used for training, the resulting model is focused on the recognition of natural landscapes and urban areas.","PeriodicalId":409819,"journal":{"name":"GraphiCon'2019 Proceedings. Volume 2","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"GraphiCon'2019 Proceedings. Volume 2","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.30987/graphicon-2019-2-207-210","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

The article discusses one of the latest ways to colorize a black and white image using deep learning methods. For colorization, a convolutional neural network with a large number of layers (Deep convolutional) is used, the architecture of which includes a ResNet model. This model was pre-trained on images of the ImageNet dataset. A neural network receives a black and white image and returns a colorized color. Since, due to the characteristics of ResNet, an input multiple of 255 is received, a program was written that, using frames, enlarges the image for the required size. During the operation of the neural network, the CIE Lab color model is used, which allows to separate the black and white component of the image from the color. For training the neural network, the Place 365 dataset was used, containing 365 different classes, such as animals, landscape elements, people, and so on. The training was carried out on the Nvidia GTX 1080 video card. The result was a trained neural network capable of colorizing images of any size and format. As example we had a speed of 0.08 seconds and an image of 256 by 256 pixels in size. In connection with the concept of the dataset used for training, the resulting model is focused on the recognition of natural landscapes and urban areas.

查看原文本刊更多论文

图像彩色化

本文讨论了使用深度学习方法为黑白图像上色的最新方法之一。对于着色，使用了具有大量层的卷积神经网络(Deep convolutional)，其架构包括一个ResNet模型。该模型是在ImageNet数据集的图像上进行预训练的。神经网络接收到一个黑白图像，并返回一个彩色的颜色。由于ResNet的特性，接收到的输入倍数是255，因此编写了一个程序，使用帧将图像放大到所需的大小。在神经网络的操作过程中，使用了CIE Lab颜色模型，该模型允许将图像的黑白成分与颜色分开。为了训练神经网络，使用了Place 365数据集，其中包含365个不同的类，如动物、景观元素、人物等。培训在Nvidia GTX 1080显卡上进行。结果是一个训练有素的神经网络，能够为任何大小和格式的图像着色。例如，我们的速度为0.08秒，图像大小为256 * 256像素。结合用于训练的数据集的概念，生成的模型专注于自然景观和城市地区的识别。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

GraphiCon'2019 Proceedings. Volume 2

自引率

0.00%

发文量