Affective Computing of Image Emotion Perceptions

Proceedings of the Ninth ACM International Conference on Web Search and Data Mining Pub Date : 2016-02-08 DOI:10.1145/2835776.2855082

Sicheng Zhao

{"title":"Affective Computing of Image Emotion Perceptions","authors":"Sicheng Zhao","doi":"10.1145/2835776.2855082","DOIUrl":null,"url":null,"abstract":"Images can convey rich semantics and evoke strong emotions in viewers. The research of my PhD thesis focuses on image emotion computing (IEC), which aims to predict the emotion perceptions of given images. The development of IEC is greatly constrained by two main challenges: affective gap and subjective evaluation [5]. Previous works mainly focused on finding features that can express emotions better to bridge the affective gap, such as elements-of-art based features [2] and shape features [1]. Based on the emotion representation models, including categorical emotion states (CES) and dimensional emotion space (DES) [5], three different tasks are traditionally performed on IEC: affective image classification, regression and retrieval. The state-of-the-art methods on the three above tasks are image-centric, focusing on the dominant emotions for the majority of viewers. For my PhD thesis, I plan to answer the following questions: 1. Compared to the low-level elements-of-art based features, can we find some higher level features that are more interpretable and have stronger link to emotions? 2. Are the emotions that are evoked in viewers by an image subjective and different? If they are, how can we tackle the user-centric emotion prediction? 3. For imagecentric emotion computing, can we predict the emotion distribution instead of the dominant emotion category? 1. The artistic elements must be carefully arranged and orchestrated into meaningful regions and images to describe specific semantics and emotions. The rules, tools or guidelines of arranging and orchestrating the elements-of-art in an artwork are known as the principles-of-art, which consider various artistic aspects, including balance, emphasis, harmony, variety, gradation, movement, rhythm, and proportion [5]. We systematically study and formulize the former 6 artistic principles, explaining the concepts and translating these concepts into mathematical formulae. 2. The images in Abstract dataset [2] were labeled by 14 people on average. 81% images are assigned with 5 to 8 emotions. So the perceived emotions of different viewers may vary. To further demonstrate this observation, we set up a large-scale dataset, named Image-Emotion-Social-Net dataset, with over 1 million images downloaded from Flickr. To get the personalized emotion labels, firstly we use traditional lexicon-based methods as in [4] to obtain the text segmentation results of the title, tags and descrip-","PeriodicalId":20567,"journal":{"name":"Proceedings of the Ninth ACM International Conference on Web Search and Data Mining","volume":"63 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2016-02-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Ninth ACM International Conference on Web Search and Data Mining","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2835776.2855082","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Images can convey rich semantics and evoke strong emotions in viewers. The research of my PhD thesis focuses on image emotion computing (IEC), which aims to predict the emotion perceptions of given images. The development of IEC is greatly constrained by two main challenges: affective gap and subjective evaluation [5]. Previous works mainly focused on finding features that can express emotions better to bridge the affective gap, such as elements-of-art based features [2] and shape features [1]. Based on the emotion representation models, including categorical emotion states (CES) and dimensional emotion space (DES) [5], three different tasks are traditionally performed on IEC: affective image classification, regression and retrieval. The state-of-the-art methods on the three above tasks are image-centric, focusing on the dominant emotions for the majority of viewers. For my PhD thesis, I plan to answer the following questions: 1. Compared to the low-level elements-of-art based features, can we find some higher level features that are more interpretable and have stronger link to emotions? 2. Are the emotions that are evoked in viewers by an image subjective and different? If they are, how can we tackle the user-centric emotion prediction? 3. For imagecentric emotion computing, can we predict the emotion distribution instead of the dominant emotion category? 1. The artistic elements must be carefully arranged and orchestrated into meaningful regions and images to describe specific semantics and emotions. The rules, tools or guidelines of arranging and orchestrating the elements-of-art in an artwork are known as the principles-of-art, which consider various artistic aspects, including balance, emphasis, harmony, variety, gradation, movement, rhythm, and proportion [5]. We systematically study and formulize the former 6 artistic principles, explaining the concepts and translating these concepts into mathematical formulae. 2. The images in Abstract dataset [2] were labeled by 14 people on average. 81% images are assigned with 5 to 8 emotions. So the perceived emotions of different viewers may vary. To further demonstrate this observation, we set up a large-scale dataset, named Image-Emotion-Social-Net dataset, with over 1 million images downloaded from Flickr. To get the personalized emotion labels, firstly we use traditional lexicon-based methods as in [4] to obtain the text segmentation results of the title, tags and descrip-

查看原文本刊更多论文

图像情感感知的情感计算

图像可以传达丰富的语义，唤起观众强烈的情感。我博士论文的研究重点是图像情感计算(IEC)，旨在预测给定图像的情感感知。情感差距和主观评价这两大挑战极大地制约了IEC的发展[5]。以往的作品主要是寻找能够更好地表达情感的特征来弥合情感的鸿沟，如基于艺术元素的特征[2]和形状特征[1]。基于情感表征模型，包括分类情感状态(CES)和维度情感空间(DES)[5]，情感图像分类、情感图像回归和情感图像检索是情感图像表征的传统任务。在上述三项任务中，最先进的方法是以图像为中心的，专注于大多数观众的主导情绪。对于我的博士论文，我计划回答以下问题:1。与基于低级艺术元素的功能相比，我们是否能够找到一些更具有可解释性且与情感联系更紧密的高级功能?2. 一幅图像在观众心中唤起的情感是主观的和不同的吗?如果是，我们如何解决以用户为中心的情感预测?3.对于以意象为中心的情绪计算，我们能否预测情绪的分布而不是主要的情绪类别?1. 艺术元素必须被精心安排和编排成有意义的区域和图像，以描述特定的语义和情感。在艺术作品中安排和编排艺术元素的规则、工具或指导方针被称为艺术原则，它考虑了艺术的各个方面，包括平衡、强调、和谐、变化、渐变、运动、节奏和比例[5]。我们系统地研究和公式化了前6个艺术原则，解释了这些概念，并将这些概念转化为数学公式。2. 摘要数据集[2]中的图像平均被14人标记。81%的图像被赋予5到8种情感。因此，不同观众的感知情绪可能会有所不同。为了进一步证明这一观察结果，我们建立了一个大型数据集，名为图像-情感-社交网络数据集，其中有超过100万张从Flickr下载的图像。为了获得个性化的情感标签，我们首先使用传统的基于词典的方法，如[4]，得到标题、标签和描述的文本分割结果

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the Ninth ACM International Conference on Web Search and Data Mining

自引率

0.00%

发文量