A Bayesian framework for video affective representation

2009 3rd International Conference on Affective Computing and Intelligent Interaction and Workshops Pub Date : 2009-12-08 DOI:10.1109/ACII.2009.5349563

M. Soleymani, Joep J. M. Kierkels, G. Chanel, T. Pun

{"title":"A Bayesian framework for video affective representation","authors":"M. Soleymani, Joep J. M. Kierkels, G. Chanel, T. Pun","doi":"10.1109/ACII.2009.5349563","DOIUrl":null,"url":null,"abstract":"Emotions that are elicited in response to a video scene contain valuable information for multimedia tagging and indexing. The novelty of this paper is to introduce a Bayesian classification framework for affective video tagging that allows taking contextual information into account. A set of 21 full length movies was first segmented and informative content-based features were extracted from each shot and scene. Shots were then emotionally annotated, providing ground truth affect. The arousal of shots was computed using a linear regression on the content-based features. Bayesian classification based on the shots arousal and content-based features allowed tagging these scenes into three affective classes, namely calm, positive excited and negative excited. To improve classification accuracy, two contextual priors have been proposed: the movie genre prior, and the temporal dimension prior consisting of the probability of transition between emotions in consecutive scenes. The f1 classification measure of 54.9% that was obtained on three emotional classes with a naïve Bayes classifier was improved to 63.4% after utilizing all the priors.","PeriodicalId":330737,"journal":{"name":"2009 3rd International Conference on Affective Computing and Intelligent Interaction and Workshops","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"76","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 3rd International Conference on Affective Computing and Intelligent Interaction and Workshops","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ACII.2009.5349563","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 76

Abstract

Emotions that are elicited in response to a video scene contain valuable information for multimedia tagging and indexing. The novelty of this paper is to introduce a Bayesian classification framework for affective video tagging that allows taking contextual information into account. A set of 21 full length movies was first segmented and informative content-based features were extracted from each shot and scene. Shots were then emotionally annotated, providing ground truth affect. The arousal of shots was computed using a linear regression on the content-based features. Bayesian classification based on the shots arousal and content-based features allowed tagging these scenes into three affective classes, namely calm, positive excited and negative excited. To improve classification accuracy, two contextual priors have been proposed: the movie genre prior, and the temporal dimension prior consisting of the probability of transition between emotions in consecutive scenes. The f1 classification measure of 54.9% that was obtained on three emotional classes with a naïve Bayes classifier was improved to 63.4% after utilizing all the priors.

查看原文本刊更多论文

视频情感表示的贝叶斯框架

对视频场景的反应所引起的情绪包含有价值的信息，可用于多媒体标记和索引。本文的新颖之处在于为情感视频标记引入了一个贝叶斯分类框架，该框架允许将上下文信息考虑在内。首先对一组21部完整长度的电影进行分割，并从每个镜头和场景中提取基于内容的信息特征。然后对镜头进行情感注释，提供真实的影响。使用基于内容的特征的线性回归计算射击的唤醒。基于镜头唤醒和基于内容的特征的贝叶斯分类允许将这些场景标记为三个情感类别，即平静，积极兴奋和消极兴奋。为了提高分类精度，本文提出了两种语境先验:电影类型先验和由连续场景中情绪转换概率组成的时间维度先验。使用naïve贝叶斯分类器对三个情感类获得的f1分类度量为54.9%，在利用所有先验后提高到63.4%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2009 3rd International Conference on Affective Computing and Intelligent Interaction and Workshops

自引率

0.00%

发文量