基于复合相似测度辅助二维经验模态分解的面部图像情感识别

Arghya Bhattacharya, Dwaipayan Choudhury, D. Dey
{"title":"基于复合相似测度辅助二维经验模态分解的面部图像情感识别","authors":"Arghya Bhattacharya, Dwaipayan Choudhury, D. Dey","doi":"10.1109/CMI.2016.7413766","DOIUrl":null,"url":null,"abstract":"The aim of this work is to automatically detect and analyse the emotions from the digital videos and images. Initially the images are extracted from pre-recorded videos, from which the faces are cropped automatically. The training dataset is formed with minimal number of images per subject for each emotion. Bi-dimensional Empirical Mode Decomposition (BEMD) is used to decompose the images in its Intrinsic Mode Functions (IMF). Composite Similarity Measure (CSM) based classification has been employed to detect the correct emotion from the images. \"ENTERFACE'05 Audio-Visual Emotion Database\", \"JAFFE Database\" and a database developed in laboratory called \"DCAB database\" are used to test the performance of the proposed method. The advantage of this method is to be able to classify or rank the emotions found in an image or a video even when the image or video is subjected to feature occlusion such as the subject putting on spectacles or sunglasses. Moreover, it is robust to illumination, different view point and background colour of the image or video. The performance is also invariant to the dress, hair style, facial hair or moustache of the subject. This method is also able to overcome the problem related to ageing to some extent.","PeriodicalId":244262,"journal":{"name":"2016 IEEE First International Conference on Control, Measurement and Instrumentation (CMI)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Emotion recognition from facial image analysis using composite similarity measure aided bidimensional empirical mode decomposition\",\"authors\":\"Arghya Bhattacharya, Dwaipayan Choudhury, D. Dey\",\"doi\":\"10.1109/CMI.2016.7413766\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The aim of this work is to automatically detect and analyse the emotions from the digital videos and images. Initially the images are extracted from pre-recorded videos, from which the faces are cropped automatically. The training dataset is formed with minimal number of images per subject for each emotion. Bi-dimensional Empirical Mode Decomposition (BEMD) is used to decompose the images in its Intrinsic Mode Functions (IMF). Composite Similarity Measure (CSM) based classification has been employed to detect the correct emotion from the images. \\\"ENTERFACE'05 Audio-Visual Emotion Database\\\", \\\"JAFFE Database\\\" and a database developed in laboratory called \\\"DCAB database\\\" are used to test the performance of the proposed method. The advantage of this method is to be able to classify or rank the emotions found in an image or a video even when the image or video is subjected to feature occlusion such as the subject putting on spectacles or sunglasses. Moreover, it is robust to illumination, different view point and background colour of the image or video. The performance is also invariant to the dress, hair style, facial hair or moustache of the subject. This method is also able to overcome the problem related to ageing to some extent.\",\"PeriodicalId\":244262,\"journal\":{\"name\":\"2016 IEEE First International Conference on Control, Measurement and Instrumentation (CMI)\",\"volume\":\"22 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 IEEE First International Conference on Control, Measurement and Instrumentation (CMI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CMI.2016.7413766\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE First International Conference on Control, Measurement and Instrumentation (CMI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CMI.2016.7413766","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

摘要

这项工作的目的是从数字视频和图像中自动检测和分析情感。最初,图像是从预先录制的视频中提取出来的,人脸会被自动裁剪。训练数据集由每个主题的每个情感的最小数量的图像组成。采用二维经验模态分解(BEMD)对图像进行内禀模态函数分解。采用基于复合相似度度量的分类方法从图像中检测出正确的情感。使用“ENTERFACE’05视听情感数据库”、“JAFFE数据库”和实验室开发的数据库“DCAB数据库”来测试所提出方法的性能。这种方法的优点是能够对图像或视频中的情绪进行分类或排序,即使图像或视频受到特征遮挡,例如戴眼镜或太阳镜的对象。此外,该算法对图像或视频的光照、不同视点和背景颜色具有较强的鲁棒性。表演也不受服装、发型、面部毛发或小胡子的影响。这种方法在一定程度上也能够克服与衰老有关的问题。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Emotion recognition from facial image analysis using composite similarity measure aided bidimensional empirical mode decomposition
The aim of this work is to automatically detect and analyse the emotions from the digital videos and images. Initially the images are extracted from pre-recorded videos, from which the faces are cropped automatically. The training dataset is formed with minimal number of images per subject for each emotion. Bi-dimensional Empirical Mode Decomposition (BEMD) is used to decompose the images in its Intrinsic Mode Functions (IMF). Composite Similarity Measure (CSM) based classification has been employed to detect the correct emotion from the images. "ENTERFACE'05 Audio-Visual Emotion Database", "JAFFE Database" and a database developed in laboratory called "DCAB database" are used to test the performance of the proposed method. The advantage of this method is to be able to classify or rank the emotions found in an image or a video even when the image or video is subjected to feature occlusion such as the subject putting on spectacles or sunglasses. Moreover, it is robust to illumination, different view point and background colour of the image or video. The performance is also invariant to the dress, hair style, facial hair or moustache of the subject. This method is also able to overcome the problem related to ageing to some extent.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信