基于深度神经网络的视频色彩分级

IF 0.4 Q4 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

IADIS-International Journal on Computer Science and Information Systems Pub Date : 2018-12-17 DOI:10.33965/IJCSIS_2018130201

J. Gibbs

{"title":"基于深度神经网络的视频色彩分级","authors":"J. Gibbs","doi":"10.33965/IJCSIS_2018130201","DOIUrl":null,"url":null,"abstract":"The task of color grading (or color correction) for film and video is significant and complex, involving aesthetic and technical decisions that require a trained operator and a good deal of time. In order to determine whether deep neural networks are capable of learning this complex aesthetic task, we compare two network frameworks—a classification network, and a conditional generative adversarial network, or cGAN—examining the quality and consistency of their output as potential automated solutions to color correction. Results are very good for both networks, though each exhibits problem areas. The classification network has issues with generalizing due to the need to collect and especially to label all data being used to train it. The cGAN on the other hand can use unlabeled data, which is much easier to collect. While the classification network does not directly affect images, only identifying image problems, the cGAN, creates a new image, introducing potential image degradation in the process; thus multiple adjustments to the network need to be made to create high quality output. We find that the data labeling issue for the classification network is a less tractable problem than the image correction and continuity issues discovered with the cGAN method, which have direct solutions. Thus we conclude the cGAN is the more promising network with which to automate color correction and grading.","PeriodicalId":41878,"journal":{"name":"IADIS-International Journal on Computer Science and Information Systems","volume":"44 1","pages":""},"PeriodicalIF":0.4000,"publicationDate":"2018-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Video color grading via deep neural networks\",\"authors\":\"J. Gibbs\",\"doi\":\"10.33965/IJCSIS_2018130201\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The task of color grading (or color correction) for film and video is significant and complex, involving aesthetic and technical decisions that require a trained operator and a good deal of time. In order to determine whether deep neural networks are capable of learning this complex aesthetic task, we compare two network frameworks—a classification network, and a conditional generative adversarial network, or cGAN—examining the quality and consistency of their output as potential automated solutions to color correction. Results are very good for both networks, though each exhibits problem areas. The classification network has issues with generalizing due to the need to collect and especially to label all data being used to train it. The cGAN on the other hand can use unlabeled data, which is much easier to collect. While the classification network does not directly affect images, only identifying image problems, the cGAN, creates a new image, introducing potential image degradation in the process; thus multiple adjustments to the network need to be made to create high quality output. We find that the data labeling issue for the classification network is a less tractable problem than the image correction and continuity issues discovered with the cGAN method, which have direct solutions. Thus we conclude the cGAN is the more promising network with which to automate color correction and grading.\",\"PeriodicalId\":41878,\"journal\":{\"name\":\"IADIS-International Journal on Computer Science and Information Systems\",\"volume\":\"44 1\",\"pages\":\"\"},\"PeriodicalIF\":0.4000,\"publicationDate\":\"2018-12-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IADIS-International Journal on Computer Science and Information Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.33965/IJCSIS_2018130201\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IADIS-International Journal on Computer Science and Information Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.33965/IJCSIS_2018130201","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}

引用次数: 1

摘要

电影和视频的色彩分级(或色彩校正)任务重要而复杂，涉及美学和技术决策，需要训练有素的操作员和大量的时间。为了确定深度神经网络是否能够学习这种复杂的美学任务，我们比较了两个网络框架——分类网络和条件生成对抗网络，或cgan——检查它们输出的质量和一致性，作为颜色校正的潜在自动化解决方案。两个网络的结果都非常好，尽管每个网络都有问题。由于需要收集，特别是需要标记用于训练的所有数据，分类网络在泛化方面存在问题。另一方面，cGAN可以使用未标记的数据，这更容易收集。虽然分类网络不直接影响图像，仅识别图像问题，但cGAN创建新图像，在此过程中引入潜在的图像退化;因此，需要对网络进行多次调整以产生高质量的输出。我们发现，与使用cGAN方法发现的图像校正和连续性问题相比，分类网络的数据标注问题更难处理，这两个问题有直接的解决方案。因此，我们得出结论，cGAN是更有前途的网络，用于自动色彩校正和分级。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Video color grading via deep neural networks

The task of color grading (or color correction) for film and video is significant and complex, involving aesthetic and technical decisions that require a trained operator and a good deal of time. In order to determine whether deep neural networks are capable of learning this complex aesthetic task, we compare two network frameworks—a classification network, and a conditional generative adversarial network, or cGAN—examining the quality and consistency of their output as potential automated solutions to color correction. Results are very good for both networks, though each exhibits problem areas. The classification network has issues with generalizing due to the need to collect and especially to label all data being used to train it. The cGAN on the other hand can use unlabeled data, which is much easier to collect. While the classification network does not directly affect images, only identifying image problems, the cGAN, creates a new image, introducing potential image degradation in the process; thus multiple adjustments to the network need to be made to create high quality output. We find that the data labeling issue for the classification network is a less tractable problem than the image correction and continuity issues discovered with the cGAN method, which have direct solutions. Thus we conclude the cGAN is the more promising network with which to automate color correction and grading.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IADIS-International Journal on Computer Science and Information Systems COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS-

自引率

0.00%

发文量