Modeling Aesthetics and Emotions in Visual Content: From Vincent van Gogh to Robotics and Vision

Joint Workshop on Aesthetic and Technical Quality Assessment of Multimedia and Media Analytics for Societal Trends Pub Date : 2020-10-12 DOI:10.1145/3423268.3423586

J. Z. Wang

{"title":"Modeling Aesthetics and Emotions in Visual Content: From Vincent van Gogh to Robotics and Vision","authors":"J. Z. Wang","doi":"10.1145/3423268.3423586","DOIUrl":null,"url":null,"abstract":"As inborn characteristics, humans possess the ability to judge visual aesthetics, feel the emotions from the environment and comprehend others? emotional expressions. Many exciting applications become possible if robots or computers can be empowered with similar capabilities. Modeling aesthetics, evoked emotions, and emotional expressions automatically in unconstrained situations, however, is daunting due to the lack of a full understanding of the relationship between low-level visual content and high-level aesthetics or emotional expressions. With the growing availability of data, it is possible to tackle these problems using machine learning and statistical modeling approaches. In the talk, I provide an overview of our research in the last two decades on data-driven analyses of visual artworks and digital visual content for modeling aesthetics and emotions. First, I discuss our analyses of styles in visual artworks. Art historians have long observed the highly characteristic brushstroke styles of Vincent van Gogh and have relied on discerning these styles for authenticating and dating his works. In our work, we compared van Gogh with his contemporaries by statistically analyzing a massive set of automatically extracted brushstrokes. A novel extraction method is developed by exploiting an integration of edge detection and clustering-based segmentation. Evidence substantiates that van Gogh's brushstrokes are strongly rhythmic. Next, I describe an effort to model the aesthetic and emotional characteristics in visual contents such as photographs. By taking a data-driven approach, using the Internet as the data source, we show that computers can be trained to recognize various characteristics that are highly relevant to aesthetics and emotions. Future computer systems equipped with such capabilities are expected to help millions of users with unimagined ways. Finally, I highlight our research on automated recognition of bodily expression of emotion. We propose a scalable and reliable crowdsourcing approach for collecting in-the-wild perceived emotion data for computers to learn to recognize the body language of humans. Comprehensive statistical analysis revealed many interesting insights from the dataset. A system to model the emotional expressions based on bodily movements, named ARBEE (Automated Recognition of Bodily Expression of Emotion), has also been developed and evaluated.","PeriodicalId":393702,"journal":{"name":"Joint Workshop on Aesthetic and Technical Quality Assessment of Multimedia and Media Analytics for Societal Trends","volume":"115 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Joint Workshop on Aesthetic and Technical Quality Assessment of Multimedia and Media Analytics for Societal Trends","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3423268.3423586","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

As inborn characteristics, humans possess the ability to judge visual aesthetics, feel the emotions from the environment and comprehend others? emotional expressions. Many exciting applications become possible if robots or computers can be empowered with similar capabilities. Modeling aesthetics, evoked emotions, and emotional expressions automatically in unconstrained situations, however, is daunting due to the lack of a full understanding of the relationship between low-level visual content and high-level aesthetics or emotional expressions. With the growing availability of data, it is possible to tackle these problems using machine learning and statistical modeling approaches. In the talk, I provide an overview of our research in the last two decades on data-driven analyses of visual artworks and digital visual content for modeling aesthetics and emotions. First, I discuss our analyses of styles in visual artworks. Art historians have long observed the highly characteristic brushstroke styles of Vincent van Gogh and have relied on discerning these styles for authenticating and dating his works. In our work, we compared van Gogh with his contemporaries by statistically analyzing a massive set of automatically extracted brushstrokes. A novel extraction method is developed by exploiting an integration of edge detection and clustering-based segmentation. Evidence substantiates that van Gogh's brushstrokes are strongly rhythmic. Next, I describe an effort to model the aesthetic and emotional characteristics in visual contents such as photographs. By taking a data-driven approach, using the Internet as the data source, we show that computers can be trained to recognize various characteristics that are highly relevant to aesthetics and emotions. Future computer systems equipped with such capabilities are expected to help millions of users with unimagined ways. Finally, I highlight our research on automated recognition of bodily expression of emotion. We propose a scalable and reliable crowdsourcing approach for collecting in-the-wild perceived emotion data for computers to learn to recognize the body language of humans. Comprehensive statistical analysis revealed many interesting insights from the dataset. A system to model the emotional expressions based on bodily movements, named ARBEE (Automated Recognition of Bodily Expression of Emotion), has also been developed and evaluated.

查看原文本刊更多论文

视觉内容中的建模美学和情感:从文森特·梵高到机器人和视觉

人类具有判断视觉美学、感受环境情绪和理解他人的能力，这是人类与生俱来的特征。情感的表达。如果机器人或计算机能够被赋予类似的能力，许多令人兴奋的应用将成为可能。然而，由于缺乏对低级视觉内容与高级美学或情感表达之间关系的充分理解，在不受约束的情况下自动建模美学、唤起的情感和情感表达是令人望而生畏的。随着数据可用性的增加，可以使用机器学习和统计建模方法来解决这些问题。在演讲中，我概述了我们在过去二十年中对视觉艺术品和数字视觉内容的数据驱动分析的研究，以建模美学和情感。首先，我讨论了我们对视觉艺术作品风格的分析。长期以来，艺术史学家一直在观察文森特·梵高极具特色的笔触风格，并依靠对这些风格的辨别来鉴定和确定他的作品的年代。在我们的工作中，我们通过统计分析大量自动提取的笔触，将梵高与同时代的画家进行比较。将边缘检测与聚类分割相结合，提出了一种新的图像提取方法。有证据证明梵高的笔触具有强烈的节奏感。接下来，我描述了在视觉内容(如照片)中建模美学和情感特征的努力。通过采用数据驱动的方法，使用互联网作为数据源，我们展示了计算机可以被训练来识别与美学和情感高度相关的各种特征。未来配备这种能力的计算机系统有望以难以想象的方式帮助数百万用户。最后，我重点介绍了我们在情感身体表达的自动识别方面的研究。我们提出了一种可扩展且可靠的众包方法，用于收集野外感知情绪数据，以供计算机学习识别人类的肢体语言。全面的统计分析揭示了数据集中许多有趣的见解。一种基于身体动作来模拟情绪表达的系统，被称为ARBEE(自动识别身体情绪表达)，也被开发和评估。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Joint Workshop on Aesthetic and Technical Quality Assessment of Multimedia and Media Analytics for Societal Trends

自引率

0.00%

发文量