Image Colorization

Дарья Михалина, Daria Mikhalina, Александр Кузьменко, A. Kuz'menko, Константин Дергачев, K. Dergachev, Виталий Шкаберин, Vitaliy Shkaberin
{"title":"Image Colorization","authors":"Дарья Михалина, Daria Mikhalina, Александр Кузьменко, A. Kuz'menko, Константин Дергачев, K. Dergachev, Виталий Шкаберин, Vitaliy Shkaberin","doi":"10.30987/graphicon-2019-2-207-210","DOIUrl":null,"url":null,"abstract":"The article discusses one of the latest ways to colorize a black and white image using deep learning methods. For colorization, a convolutional neural network with a large number of layers (Deep convolutional) is used, the architecture of which includes a ResNet model. This model was pre-trained on images of the ImageNet dataset. A neural network receives a black and white image and returns a colorized color. Since, due to the characteristics of ResNet, an input multiple of 255 is received, a program was written that, using frames, enlarges the image for the required size. During the operation of the neural network, the CIE Lab color model is used, which allows to separate the black and white component of the image from the color. For training the neural network, the Place 365 dataset was used, containing 365 different classes, such as animals, landscape elements, people, and so on. The training was carried out on the Nvidia GTX 1080 video card. The result was a trained neural network capable of colorizing images of any size and format. As example we had a speed of 0.08 seconds and an image of 256 by 256 pixels in size. In connection with the concept of the dataset used for training, the resulting model is focused on the recognition of natural landscapes and urban areas.","PeriodicalId":409819,"journal":{"name":"GraphiCon'2019 Proceedings. Volume 2","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"GraphiCon'2019 Proceedings. Volume 2","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.30987/graphicon-2019-2-207-210","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The article discusses one of the latest ways to colorize a black and white image using deep learning methods. For colorization, a convolutional neural network with a large number of layers (Deep convolutional) is used, the architecture of which includes a ResNet model. This model was pre-trained on images of the ImageNet dataset. A neural network receives a black and white image and returns a colorized color. Since, due to the characteristics of ResNet, an input multiple of 255 is received, a program was written that, using frames, enlarges the image for the required size. During the operation of the neural network, the CIE Lab color model is used, which allows to separate the black and white component of the image from the color. For training the neural network, the Place 365 dataset was used, containing 365 different classes, such as animals, landscape elements, people, and so on. The training was carried out on the Nvidia GTX 1080 video card. The result was a trained neural network capable of colorizing images of any size and format. As example we had a speed of 0.08 seconds and an image of 256 by 256 pixels in size. In connection with the concept of the dataset used for training, the resulting model is focused on the recognition of natural landscapes and urban areas.
图像彩色化
本文讨论了使用深度学习方法为黑白图像上色的最新方法之一。对于着色,使用了具有大量层的卷积神经网络(Deep convolutional),其架构包括一个ResNet模型。该模型是在ImageNet数据集的图像上进行预训练的。神经网络接收到一个黑白图像,并返回一个彩色的颜色。由于ResNet的特性,接收到的输入倍数是255,因此编写了一个程序,使用帧将图像放大到所需的大小。在神经网络的操作过程中,使用了CIE Lab颜色模型,该模型允许将图像的黑白成分与颜色分开。为了训练神经网络,使用了Place 365数据集,其中包含365个不同的类,如动物、景观元素、人物等。培训在Nvidia GTX 1080显卡上进行。结果是一个训练有素的神经网络,能够为任何大小和格式的图像着色。例如,我们的速度为0.08秒,图像大小为256 * 256像素。结合用于训练的数据集的概念,生成的模型专注于自然景观和城市地区的识别。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信