使用Kinect的面部和骨骼模型进行可理解的人物识别

2016 IEEE International Conference on Automation, Quality and Testing, Robotics (AQTR) Pub Date : 2016-05-19 DOI:10.1109/AQTR.2016.7501309

F. Gossen, T. Margaria

{"title":"使用Kinect的面部和骨骼模型进行可理解的人物识别","authors":"F. Gossen, T. Margaria","doi":"10.1109/AQTR.2016.7501309","DOIUrl":null,"url":null,"abstract":"In recent years, significant progress has been made in facial recognition. New systems achieve remarkable recognition rates even in unconstrained environments and overcome humanlevel performance. However, these systems need millions of images to learn facial features. As a consequence, the learning process is computationally expensive. Although the resulting face representation consists of very few features only, it can hardly be understood as it is the result of a complex learning process. In an attempt to overcome these issues, we limit our approach to comprehensible features with clear interpretation. In this paper, we propose a set of comprehensible and stable geometric features based on the Kinect's face and skeleton model, that allow for the use of simple classifiers such as k-NN and LDA. These features include the estimated distances between facial feature points in space that are obtained from the Kinect's face model and geometric distances between joints of the Kinect's skeleton model. In order to obtain more stable features, we propose to aggregate sequences of these measures into a more stable and still comprehensible representation. We test the quality of the proposed features with two classification methods, k-Nearest Neighbour (k-NN) and Linear Discrimination Analysis (LDA) and report a recognition rate of up to 88% for k-NN and up to 89% for LDA on a data set with 37 individuals. The results show that the proposed features can be used to contribute to reliable and comprehensible people recognition.","PeriodicalId":110627,"journal":{"name":"2016 IEEE International Conference on Automation, Quality and Testing, Robotics (AQTR)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Comprehensible people recognition using the Kinect's face and skeleton model\",\"authors\":\"F. Gossen, T. Margaria\",\"doi\":\"10.1109/AQTR.2016.7501309\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In recent years, significant progress has been made in facial recognition. New systems achieve remarkable recognition rates even in unconstrained environments and overcome humanlevel performance. However, these systems need millions of images to learn facial features. As a consequence, the learning process is computationally expensive. Although the resulting face representation consists of very few features only, it can hardly be understood as it is the result of a complex learning process. In an attempt to overcome these issues, we limit our approach to comprehensible features with clear interpretation. In this paper, we propose a set of comprehensible and stable geometric features based on the Kinect's face and skeleton model, that allow for the use of simple classifiers such as k-NN and LDA. These features include the estimated distances between facial feature points in space that are obtained from the Kinect's face model and geometric distances between joints of the Kinect's skeleton model. In order to obtain more stable features, we propose to aggregate sequences of these measures into a more stable and still comprehensible representation. We test the quality of the proposed features with two classification methods, k-Nearest Neighbour (k-NN) and Linear Discrimination Analysis (LDA) and report a recognition rate of up to 88% for k-NN and up to 89% for LDA on a data set with 37 individuals. The results show that the proposed features can be used to contribute to reliable and comprehensible people recognition.\",\"PeriodicalId\":110627,\"journal\":{\"name\":\"2016 IEEE International Conference on Automation, Quality and Testing, Robotics (AQTR)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-05-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 IEEE International Conference on Automation, Quality and Testing, Robotics (AQTR)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/AQTR.2016.7501309\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE International Conference on Automation, Quality and Testing, Robotics (AQTR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AQTR.2016.7501309","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

近年来，人脸识别技术取得了重大进展。新系统即使在不受约束的环境中也能取得显著的识别率，并超越人类水平的表现。然而，这些系统需要数以百万计的图像来学习面部特征。因此，学习过程在计算上是昂贵的。虽然最终的人脸表征只包含很少的特征，但它很难被理解，因为它是一个复杂的学习过程的结果。为了克服这些问题，我们将方法限制在具有清晰解释的可理解特征上。在本文中，我们提出了一组基于Kinect的面部和骨骼模型的可理解和稳定的几何特征，允许使用简单的分类器，如k-NN和LDA。这些特征包括从Kinect面部模型中获得的面部特征点在空间中的估计距离，以及Kinect骨骼模型中关节之间的几何距离。为了获得更稳定的特征，我们建议将这些度量序列聚合成更稳定且仍然可理解的表示。我们用k-最近邻(k-NN)和线性判别分析(LDA)两种分类方法测试了所提出特征的质量，并报告了k-NN的识别率高达88%，LDA的识别率高达89%，数据集中有37个个体。结果表明，所提出的特征可以用于可靠和可理解的人物识别。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Comprehensible people recognition using the Kinect's face and skeleton model

In recent years, significant progress has been made in facial recognition. New systems achieve remarkable recognition rates even in unconstrained environments and overcome humanlevel performance. However, these systems need millions of images to learn facial features. As a consequence, the learning process is computationally expensive. Although the resulting face representation consists of very few features only, it can hardly be understood as it is the result of a complex learning process. In an attempt to overcome these issues, we limit our approach to comprehensible features with clear interpretation. In this paper, we propose a set of comprehensible and stable geometric features based on the Kinect's face and skeleton model, that allow for the use of simple classifiers such as k-NN and LDA. These features include the estimated distances between facial feature points in space that are obtained from the Kinect's face model and geometric distances between joints of the Kinect's skeleton model. In order to obtain more stable features, we propose to aggregate sequences of these measures into a more stable and still comprehensible representation. We test the quality of the proposed features with two classification methods, k-Nearest Neighbour (k-NN) and Linear Discrimination Analysis (LDA) and report a recognition rate of up to 88% for k-NN and up to 89% for LDA on a data set with 37 individuals. The results show that the proposed features can be used to contribute to reliable and comprehensible people recognition.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2016 IEEE International Conference on Automation, Quality and Testing, Robotics (AQTR)

自引率

0.00%

发文量