AI Gets Creative

Proceedings of the 1st International Workshop on AI for Smart TV Content Production, Access and Delivery Pub Date : 2019-10-21 DOI:10.1145/3347449.3357490

M. Mrak

{"title":"AI Gets Creative","authors":"M. Mrak","doi":"10.1145/3347449.3357490","DOIUrl":null,"url":null,"abstract":"Numerous breakthroughs in multimedia signal processing are being enabled thanks to applications of machine learning in tasks such as multimedia creation, enhancement, classification and compression [1]. Notably, in the context of production and distribution of television programmes, it has been successfully demonstrated how Artificial Intelligence (AI) can support innovation in the creative sector. In the context of delivering TV programmes of stunning visual quality, the applications of deep learning have enabled significant advances when the original content is of poor quality / resolution, or when delivery channels are very limited. Examples when the enhancement of originally poor quality is needed include new content forms (e.g. user generated content) and historical content (e.g. archives), while limitations of delivery channels can, first of all, be addressed by improving content compression. As a state-of-the-art example, the benefits of deep-learning solutions have been recently demonstrated within an end-to-end platform for management of user generated content [2], where deep learning is applied to increase video resolution, evaluate video quality and enrich the video by providing automatic metadata. Within this particular application space where large amount of user generated content is available, the progress has also been made in addressing visual story editing using social media data in automatic ways, making programmes from large amount of content faster [3]. Broadcasters are also interested in restauration of historical content more cheaply. For example, adding colour to \"black and white\" content has until now been an expensive and time-consuming task. However, recently new algorithms have been developed to perform the task more efficiently. Generative Adversarial Networks (GANs) have become the baseline for many image-to-image translation tasks, including image colourisation. Aiming at the generation of more naturally coloured images from \"black and white\" sources, newest algorithms are capable of generalisation of the colour of natural images, producing realistic and plausible results [4]. In the context of content delivery, new generations of compression standards enable significant reduction of required bandwidth [5], however, with a cost of increased computational complexity. This is another area where AI can be utilised for better efficiency - either in its simple forms as decision trees [6,7] or more advanced deep convolutional neural networks [8]. Looking forward, this penetration of AI opens new challenges, such as interpretability of deep learning (to enable use AI in an accountable way as well as to enable AI-inspired low-complexity algorithms) and applicability in systems which require low-complexity solutions and/or do not have enough training data. However, overall further benefits of these new approaches include automatization of many traditional production tasks which has the potential to transform the way content providers make their programmes in cheaper and more effective ways.","PeriodicalId":276496,"journal":{"name":"Proceedings of the 1st International Workshop on AI for Smart TV Content Production, Access and Delivery","volume":"116 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 1st International Workshop on AI for Smart TV Content Production, Access and Delivery","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3347449.3357490","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Numerous breakthroughs in multimedia signal processing are being enabled thanks to applications of machine learning in tasks such as multimedia creation, enhancement, classification and compression [1]. Notably, in the context of production and distribution of television programmes, it has been successfully demonstrated how Artificial Intelligence (AI) can support innovation in the creative sector. In the context of delivering TV programmes of stunning visual quality, the applications of deep learning have enabled significant advances when the original content is of poor quality / resolution, or when delivery channels are very limited. Examples when the enhancement of originally poor quality is needed include new content forms (e.g. user generated content) and historical content (e.g. archives), while limitations of delivery channels can, first of all, be addressed by improving content compression. As a state-of-the-art example, the benefits of deep-learning solutions have been recently demonstrated within an end-to-end platform for management of user generated content [2], where deep learning is applied to increase video resolution, evaluate video quality and enrich the video by providing automatic metadata. Within this particular application space where large amount of user generated content is available, the progress has also been made in addressing visual story editing using social media data in automatic ways, making programmes from large amount of content faster [3]. Broadcasters are also interested in restauration of historical content more cheaply. For example, adding colour to "black and white" content has until now been an expensive and time-consuming task. However, recently new algorithms have been developed to perform the task more efficiently. Generative Adversarial Networks (GANs) have become the baseline for many image-to-image translation tasks, including image colourisation. Aiming at the generation of more naturally coloured images from "black and white" sources, newest algorithms are capable of generalisation of the colour of natural images, producing realistic and plausible results [4]. In the context of content delivery, new generations of compression standards enable significant reduction of required bandwidth [5], however, with a cost of increased computational complexity. This is another area where AI can be utilised for better efficiency - either in its simple forms as decision trees [6,7] or more advanced deep convolutional neural networks [8]. Looking forward, this penetration of AI opens new challenges, such as interpretability of deep learning (to enable use AI in an accountable way as well as to enable AI-inspired low-complexity algorithms) and applicability in systems which require low-complexity solutions and/or do not have enough training data. However, overall further benefits of these new approaches include automatization of many traditional production tasks which has the potential to transform the way content providers make their programmes in cheaper and more effective ways.

查看原文本刊更多论文

AI变得有创意

由于机器学习在多媒体创建、增强、分类和压缩等任务中的应用，多媒体信号处理领域的许多突破正在成为可能。值得注意的是，在电视节目制作和发行的背景下，人工智能(AI)如何支持创意部门的创新已得到成功展示。在提供令人惊叹的视觉质量的电视节目的背景下，深度学习的应用在原始内容质量/分辨率较差或交付渠道非常有限的情况下实现了重大进步。需要提高原本质量较差的内容的例子包括新的内容形式(如用户生成的内容)和历史内容(如档案)，而交付渠道的限制可以首先通过改进内容压缩来解决。作为一个最先进的例子，深度学习解决方案的好处最近在用户生成内容管理的端到端平台b[2]中得到了证明，其中深度学习被应用于提高视频分辨率，评估视频质量，并通过提供自动元数据来丰富视频。在这个特定的应用领域中，大量用户生成的内容是可用的，在使用社交媒体数据以自动方式处理视觉故事编辑方面也取得了进展，使大量内容的节目更快。广播公司也对以更低的成本还原历史内容感兴趣。例如，到目前为止，为“黑白”内容添加颜色一直是一项昂贵且耗时的任务。然而，最近已经开发了新的算法来更有效地执行任务。生成对抗网络(gan)已经成为许多图像到图像翻译任务的基线，包括图像着色。为了从“黑白”源生成更自然的彩色图像，最新的算法能够对自然图像的颜色进行泛化，产生真实可信的结果[4]。在内容交付的上下文中，新一代的压缩标准能够显著减少所需的带宽，然而，其代价是增加了计算复杂性。这是人工智能可以被用于提高效率的另一个领域——无论是以决策树的简单形式[6,7]还是更高级的深度卷积神经网络[[8]]。展望未来，人工智能的这种渗透带来了新的挑战，例如深度学习的可解释性(以负责任的方式使用人工智能以及启用人工智能启发的低复杂性算法)以及在需要低复杂性解决方案和/或没有足够训练数据的系统中的适用性。然而，这些新方法的总体进一步好处包括许多传统制作任务的自动化，这有可能改变内容提供商以更便宜和更有效的方式制作节目的方式。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 1st International Workshop on AI for Smart TV Content Production, Access and Delivery

自引率

0.00%

发文量