Understanding Discrete Facial Expressions in Video Using an Emotion Avatar Image.

IEEE Transactions on Systems Man and Cybernetics Part B-Cybernetics Pub Date : 2012-08-01 Epub Date: 2012-05-07 DOI:10.1109/TSMCB.2012.2192269

Songfan Yang, B Bhanu

{"title":"Understanding Discrete Facial Expressions in Video Using an Emotion Avatar Image.","authors":"Songfan Yang, B Bhanu","doi":"10.1109/TSMCB.2012.2192269","DOIUrl":null,"url":null,"abstract":"<p><p>Existing video-based facial expression recognition techniques analyze the geometry-based and appearance-based information in every frame as well as explore the temporal relation among frames. On the contrary, we present a new image-based representation and an associated reference image called the emotion avatar image (EAI), and the avatar reference, respectively. This representation leverages the out-of-plane head rotation. It is not only robust to outliers but also provides a method to aggregate dynamic information from expressions with various lengths. The approach to facial expression analysis consists of the following steps: 1) face detection; 2) face registration of video frames with the avatar reference to form the EAI representation; 3) computation of features from EAIs using both local binary patterns and local phase quantization; and 4) the classification of the feature as one of the emotion type by using a linear support vector machine classifier. Our system is tested on the Facial Expression Recognition and Analysis Challenge (FERA2011) data, i.e., the Geneva Multimodal Emotion Portrayal-Facial Expression Recognition and Analysis Challenge (GEMEP-FERA) data set. The experimental results demonstrate that the information captured in an EAI for a facial expression is a very strong cue for emotion inference. Moreover, our method suppresses the person-specific information for emotion and performs well on unseen data. </p>","PeriodicalId":55006,"journal":{"name":"IEEE Transactions on Systems Man and Cybernetics Part B-Cybernetics","volume":" ","pages":"980-92"},"PeriodicalIF":0.0000,"publicationDate":"2012-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/TSMCB.2012.2192269","citationCount":"108","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Systems Man and Cybernetics Part B-Cybernetics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TSMCB.2012.2192269","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2012/5/7 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 108

Abstract

Existing video-based facial expression recognition techniques analyze the geometry-based and appearance-based information in every frame as well as explore the temporal relation among frames. On the contrary, we present a new image-based representation and an associated reference image called the emotion avatar image (EAI), and the avatar reference, respectively. This representation leverages the out-of-plane head rotation. It is not only robust to outliers but also provides a method to aggregate dynamic information from expressions with various lengths. The approach to facial expression analysis consists of the following steps: 1) face detection; 2) face registration of video frames with the avatar reference to form the EAI representation; 3) computation of features from EAIs using both local binary patterns and local phase quantization; and 4) the classification of the feature as one of the emotion type by using a linear support vector machine classifier. Our system is tested on the Facial Expression Recognition and Analysis Challenge (FERA2011) data, i.e., the Geneva Multimodal Emotion Portrayal-Facial Expression Recognition and Analysis Challenge (GEMEP-FERA) data set. The experimental results demonstrate that the information captured in an EAI for a facial expression is a very strong cue for emotion inference. Moreover, our method suppresses the person-specific information for emotion and performs well on unseen data.

查看原文本刊更多论文

使用情感化身图像理解视频中的离散面部表情。

现有的基于视频的面部表情识别技术分析了每一帧中基于几何和基于外观的信息，并探索了帧间的时间关系。相反，我们提出了一种新的基于图像的表示和一个相关的参考图像，分别称为情感化身图像(EAI)和化身参考。这种表示利用了面外头部旋转。它不仅对异常值具有鲁棒性，而且还提供了一种从不同长度的表达式中聚合动态信息的方法。面部表情分析的方法包括以下几个步骤:1)人脸检测;2)视频帧的人脸配准与化身参考形成EAI表示;3)利用局部二值模式和局部相位量化计算EAIs的特征;4)利用线性支持向量机分类器将特征分类为情感类型之一。我们的系统在面部表情识别和分析挑战(FERA2011)数据上进行了测试，即日内瓦多模态情绪描述-面部表情识别和分析挑战(GEMEP-FERA)数据集。实验结果表明，EAI捕捉到的面部表情信息是一种非常强的情感推断线索。此外，我们的方法抑制了个人特定的情感信息，并在未见过的数据上表现良好。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Transactions on Systems Man and Cybernetics Part B-Cybernetics 工程技术-计算机：控制论

自引率

0.00%

发文量

审稿时长

6.0 months