基于复合相似测度辅助二维经验模态分解的面部图像情感识别

2016 IEEE First International Conference on Control, Measurement and Instrumentation (CMI) Pub Date : 1900-01-01 DOI:10.1109/CMI.2016.7413766

Arghya Bhattacharya, Dwaipayan Choudhury, D. Dey

{"title":"基于复合相似测度辅助二维经验模态分解的面部图像情感识别","authors":"Arghya Bhattacharya, Dwaipayan Choudhury, D. Dey","doi":"10.1109/CMI.2016.7413766","DOIUrl":null,"url":null,"abstract":"The aim of this work is to automatically detect and analyse the emotions from the digital videos and images. Initially the images are extracted from pre-recorded videos, from which the faces are cropped automatically. The training dataset is formed with minimal number of images per subject for each emotion. Bi-dimensional Empirical Mode Decomposition (BEMD) is used to decompose the images in its Intrinsic Mode Functions (IMF). Composite Similarity Measure (CSM) based classification has been employed to detect the correct emotion from the images. \"ENTERFACE'05 Audio-Visual Emotion Database\", \"JAFFE Database\" and a database developed in laboratory called \"DCAB database\" are used to test the performance of the proposed method. The advantage of this method is to be able to classify or rank the emotions found in an image or a video even when the image or video is subjected to feature occlusion such as the subject putting on spectacles or sunglasses. Moreover, it is robust to illumination, different view point and background colour of the image or video. The performance is also invariant to the dress, hair style, facial hair or moustache of the subject. This method is also able to overcome the problem related to ageing to some extent.","PeriodicalId":244262,"journal":{"name":"2016 IEEE First International Conference on Control, Measurement and Instrumentation (CMI)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Emotion recognition from facial image analysis using composite similarity measure aided bidimensional empirical mode decomposition\",\"authors\":\"Arghya Bhattacharya, Dwaipayan Choudhury, D. Dey\",\"doi\":\"10.1109/CMI.2016.7413766\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The aim of this work is to automatically detect and analyse the emotions from the digital videos and images. Initially the images are extracted from pre-recorded videos, from which the faces are cropped automatically. The training dataset is formed with minimal number of images per subject for each emotion. Bi-dimensional Empirical Mode Decomposition (BEMD) is used to decompose the images in its Intrinsic Mode Functions (IMF). Composite Similarity Measure (CSM) based classification has been employed to detect the correct emotion from the images. \\\"ENTERFACE'05 Audio-Visual Emotion Database\\\", \\\"JAFFE Database\\\" and a database developed in laboratory called \\\"DCAB database\\\" are used to test the performance of the proposed method. The advantage of this method is to be able to classify or rank the emotions found in an image or a video even when the image or video is subjected to feature occlusion such as the subject putting on spectacles or sunglasses. Moreover, it is robust to illumination, different view point and background colour of the image or video. The performance is also invariant to the dress, hair style, facial hair or moustache of the subject. This method is also able to overcome the problem related to ageing to some extent.\",\"PeriodicalId\":244262,\"journal\":{\"name\":\"2016 IEEE First International Conference on Control, Measurement and Instrumentation (CMI)\",\"volume\":\"22 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 IEEE First International Conference on Control, Measurement and Instrumentation (CMI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CMI.2016.7413766\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE First International Conference on Control, Measurement and Instrumentation (CMI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CMI.2016.7413766","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

摘要

这项工作的目的是从数字视频和图像中自动检测和分析情感。最初，图像是从预先录制的视频中提取出来的，人脸会被自动裁剪。训练数据集由每个主题的每个情感的最小数量的图像组成。采用二维经验模态分解(BEMD)对图像进行内禀模态函数分解。采用基于复合相似度度量的分类方法从图像中检测出正确的情感。使用“ENTERFACE’05视听情感数据库”、“JAFFE数据库”和实验室开发的数据库“DCAB数据库”来测试所提出方法的性能。这种方法的优点是能够对图像或视频中的情绪进行分类或排序，即使图像或视频受到特征遮挡，例如戴眼镜或太阳镜的对象。此外，该算法对图像或视频的光照、不同视点和背景颜色具有较强的鲁棒性。表演也不受服装、发型、面部毛发或小胡子的影响。这种方法在一定程度上也能够克服与衰老有关的问题。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Emotion recognition from facial image analysis using composite similarity measure aided bidimensional empirical mode decomposition

The aim of this work is to automatically detect and analyse the emotions from the digital videos and images. Initially the images are extracted from pre-recorded videos, from which the faces are cropped automatically. The training dataset is formed with minimal number of images per subject for each emotion. Bi-dimensional Empirical Mode Decomposition (BEMD) is used to decompose the images in its Intrinsic Mode Functions (IMF). Composite Similarity Measure (CSM) based classification has been employed to detect the correct emotion from the images. "ENTERFACE'05 Audio-Visual Emotion Database", "JAFFE Database" and a database developed in laboratory called "DCAB database" are used to test the performance of the proposed method. The advantage of this method is to be able to classify or rank the emotions found in an image or a video even when the image or video is subjected to feature occlusion such as the subject putting on spectacles or sunglasses. Moreover, it is robust to illumination, different view point and background colour of the image or video. The performance is also invariant to the dress, hair style, facial hair or moustache of the subject. This method is also able to overcome the problem related to ageing to some extent.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2016 IEEE First International Conference on Control, Measurement and Instrumentation (CMI)

自引率

0.00%

发文量