表达式检测的注册不变表示

2010 International Conference on Digital Image Computing: Techniques and Applications Pub Date : 2010-12-01 DOI:10.1109/DICTA.2010.53

P. Lucey, S. Lucey, J. Cohn

{"title":"表达式检测的注册不变表示","authors":"P. Lucey, S. Lucey, J. Cohn","doi":"10.1109/DICTA.2010.53","DOIUrl":null,"url":null,"abstract":"Active appearance model (AAM) representations have been used to great effect recently in the accurate detection of expression events (e.g., action units, pain, broad expressions, etc.). The motivation for their use, and rationale for their success, lies in their ability to: (i) provide dense (i.e. 60- 70 points on the face) registration accuracy on par with a human labeler, and (ii) the ability to decompose the registered face image to separate appearance and shape representations. Unfortunately, this human-like registration performance is isolated to registration algorithms that are specifically tuned to the illumination, camera and subject being tracked (i.e. \"subject dependent'' algorithms). As a result, it is rare, to see AAM representations being employed in the far more useful \"subject independent'' situations (i.e., where illumination, camera and subject is unknown) due to the inherent increased geometric noise present in the estimated registration. In this paper we argue that \"AAM like'' expression detection results can be obtained in the presence of noisy dense registration through the employment of registration invariant representations (e.g., Gabor magnitudes and HOG features). We demonstrate that good expression detection performance can still be enjoyed over the types of geometric noise often encountered with the more geometrically noisy state of the art generic algorithms (e.g., Bayesian Tangent Shape Models (BTSM), Constrained Local Models (CLM), etc). We show these results on the extended Cohn-Kanade (CK+) database over all facial action units.","PeriodicalId":246460,"journal":{"name":"2010 International Conference on Digital Image Computing: Techniques and Applications","volume":"53-54 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"20","resultStr":"{\"title\":\"Registration Invariant Representations for Expression Detection\",\"authors\":\"P. Lucey, S. Lucey, J. Cohn\",\"doi\":\"10.1109/DICTA.2010.53\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Active appearance model (AAM) representations have been used to great effect recently in the accurate detection of expression events (e.g., action units, pain, broad expressions, etc.). The motivation for their use, and rationale for their success, lies in their ability to: (i) provide dense (i.e. 60- 70 points on the face) registration accuracy on par with a human labeler, and (ii) the ability to decompose the registered face image to separate appearance and shape representations. Unfortunately, this human-like registration performance is isolated to registration algorithms that are specifically tuned to the illumination, camera and subject being tracked (i.e. \\\"subject dependent'' algorithms). As a result, it is rare, to see AAM representations being employed in the far more useful \\\"subject independent'' situations (i.e., where illumination, camera and subject is unknown) due to the inherent increased geometric noise present in the estimated registration. In this paper we argue that \\\"AAM like'' expression detection results can be obtained in the presence of noisy dense registration through the employment of registration invariant representations (e.g., Gabor magnitudes and HOG features). We demonstrate that good expression detection performance can still be enjoyed over the types of geometric noise often encountered with the more geometrically noisy state of the art generic algorithms (e.g., Bayesian Tangent Shape Models (BTSM), Constrained Local Models (CLM), etc). We show these results on the extended Cohn-Kanade (CK+) database over all facial action units.\",\"PeriodicalId\":246460,\"journal\":{\"name\":\"2010 International Conference on Digital Image Computing: Techniques and Applications\",\"volume\":\"53-54 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"20\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2010 International Conference on Digital Image Computing: Techniques and Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/DICTA.2010.53\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 International Conference on Digital Image Computing: Techniques and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DICTA.2010.53","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 20

摘要

近年来，活动外观模型(AAM)表征在表情事件(如动作单元、疼痛、广义表情等)的准确检测中发挥了重要作用。使用它们的动机和它们成功的基本原理在于它们能够:(i)提供与人类标记器相当的密集(即面部上的60- 70个点)注册精度，以及(ii)分解注册的人脸图像以分离外观和形状表示的能力。不幸的是，这种类似人类的注册性能与专门针对照明、相机和被跟踪对象进行调整的注册算法是隔离的。“主题相关”算法)。因此，由于估计配准中存在固有的几何噪声增加，因此很少看到AAM表示被用于更有用的“主体独立”情况(即，照明，相机和主体未知的情况)。在本文中，我们认为通过使用配准不变量表示(例如，Gabor幅度和HOG特征)，可以在存在噪声密集配准的情况下获得“AAM样”表达检测结果。我们证明了良好的表达检测性能仍然可以享受到更多几何噪声的最先进的通用算法(例如，贝叶斯切线形状模型(BTSM)，约束局部模型(CLM)等)经常遇到的几何噪声类型。我们在所有面部动作单元的扩展Cohn-Kanade (CK+)数据库上展示了这些结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Registration Invariant Representations for Expression Detection

Active appearance model (AAM) representations have been used to great effect recently in the accurate detection of expression events (e.g., action units, pain, broad expressions, etc.). The motivation for their use, and rationale for their success, lies in their ability to: (i) provide dense (i.e. 60- 70 points on the face) registration accuracy on par with a human labeler, and (ii) the ability to decompose the registered face image to separate appearance and shape representations. Unfortunately, this human-like registration performance is isolated to registration algorithms that are specifically tuned to the illumination, camera and subject being tracked (i.e. "subject dependent'' algorithms). As a result, it is rare, to see AAM representations being employed in the far more useful "subject independent'' situations (i.e., where illumination, camera and subject is unknown) due to the inherent increased geometric noise present in the estimated registration. In this paper we argue that "AAM like'' expression detection results can be obtained in the presence of noisy dense registration through the employment of registration invariant representations (e.g., Gabor magnitudes and HOG features). We demonstrate that good expression detection performance can still be enjoyed over the types of geometric noise often encountered with the more geometrically noisy state of the art generic algorithms (e.g., Bayesian Tangent Shape Models (BTSM), Constrained Local Models (CLM), etc). We show these results on the extended Cohn-Kanade (CK+) database over all facial action units.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2010 International Conference on Digital Image Computing: Techniques and Applications

自引率

0.00%

发文量