基于深度信念网络的深度和颜色图像手语拼写分类

2014 Canadian Conference on Computer and Robot Vision Pub Date : 2014-05-06 DOI:10.1109/CRV.2014.20

Lucas Rioux-Maldague, P. Giguère

{"title":"基于深度信念网络的深度和颜色图像手语拼写分类","authors":"Lucas Rioux-Maldague, P. Giguère","doi":"10.1109/CRV.2014.20","DOIUrl":null,"url":null,"abstract":"Automatic sign language recognition is an open problem that has received a lot of attention recently, not only because of its usefulness to signers, but also due to the numerous applications a sign classifier can have. In this article, we present a new feature extraction technique for hand pose recognition using depth and intensity images captured from a Microsoft Kinect sensor. We applied our technique to American Sign Language finger spelling classification using a Deep Belief Network, for which our feature extraction technique is tailored. We evaluated our results on a multi-user data set with two scenarios: one with all known users and one with an unseen user. We achieved 99% recall and precision on the first, and 77% recall and 79% precision on the second. Our method is also capable of real-time sign classification and is adaptive to any environment or lightning intensity.","PeriodicalId":385422,"journal":{"name":"2014 Canadian Conference on Computer and Robot Vision","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"45","resultStr":"{\"title\":\"Sign Language Fingerspelling Classification from Depth and Color Images Using a Deep Belief Network\",\"authors\":\"Lucas Rioux-Maldague, P. Giguère\",\"doi\":\"10.1109/CRV.2014.20\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Automatic sign language recognition is an open problem that has received a lot of attention recently, not only because of its usefulness to signers, but also due to the numerous applications a sign classifier can have. In this article, we present a new feature extraction technique for hand pose recognition using depth and intensity images captured from a Microsoft Kinect sensor. We applied our technique to American Sign Language finger spelling classification using a Deep Belief Network, for which our feature extraction technique is tailored. We evaluated our results on a multi-user data set with two scenarios: one with all known users and one with an unseen user. We achieved 99% recall and precision on the first, and 77% recall and 79% precision on the second. Our method is also capable of real-time sign classification and is adaptive to any environment or lightning intensity.\",\"PeriodicalId\":385422,\"journal\":{\"name\":\"2014 Canadian Conference on Computer and Robot Vision\",\"volume\":\"13 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-05-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"45\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2014 Canadian Conference on Computer and Robot Vision\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CRV.2014.20\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 Canadian Conference on Computer and Robot Vision","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CRV.2014.20","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 45

摘要

自动手语识别是最近受到广泛关注的一个开放问题，不仅是因为它对手语者的有用性，还因为符号分类器可以有许多应用。在这篇文章中，我们提出了一种新的特征提取技术，用于手部姿势识别，使用从微软Kinect传感器捕获的深度和强度图像。我们将该技术应用于使用深度信念网络的美国手语手指拼写分类，我们的特征提取技术是为此量身定制的。我们用两种场景在一个多用户数据集上评估我们的结果:一种是所有已知用户，另一种是不可见用户。我们在第一次上达到了99%的查全率和查准率，在第二次上达到了77%的查全率和79%的查准率。我们的方法还能够实时标记分类，并适应任何环境或闪电强度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Sign Language Fingerspelling Classification from Depth and Color Images Using a Deep Belief Network

Automatic sign language recognition is an open problem that has received a lot of attention recently, not only because of its usefulness to signers, but also due to the numerous applications a sign classifier can have. In this article, we present a new feature extraction technique for hand pose recognition using depth and intensity images captured from a Microsoft Kinect sensor. We applied our technique to American Sign Language finger spelling classification using a Deep Belief Network, for which our feature extraction technique is tailored. We evaluated our results on a multi-user data set with two scenarios: one with all known users and one with an unseen user. We achieved 99% recall and precision on the first, and 77% recall and 79% precision on the second. Our method is also capable of real-time sign classification and is adaptive to any environment or lightning intensity.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2014 Canadian Conference on Computer and Robot Vision

自引率

0.00%

发文量