基于深度神经网络的智能视觉单手手势识别系统

R. Suguna, N. Rupavathy, R. Asmetha
{"title":"基于深度神经网络的智能视觉单手手势识别系统","authors":"R. Suguna, N. Rupavathy, R. Asmetha","doi":"10.4108/EAI.7-6-2021.2308783","DOIUrl":null,"url":null,"abstract":". The primary and expressive mode of human communications are gestures. Human can interact with machines using body postures and finger pointing. Advancements in human-computer interaction (HCI) has presented new innovations in technology making the users to communicate with computers in an instinctual manner. Evidences clearly state that future living space will be dominated by sensor-based devices and hence an efficient human-computer interfaces are required to exchange information. Hand gesture interfaces have been employed in multiple domains and has won social acceptance. System requirements for gesture recognition vary with the intended application areas. Responsiveness, Learnability, Cost and Accuracy are major drivers for success of hand gesture recognition systems. This paper suggest a HCI design that requires no wearable markers or gloves. A noninvasive vision based framework has been suggested for human-machine interface. Deep Neural Networks have provided promising results in vision based tasks. Convolutional Neural Networks (CNN) are claimed for image recognition problems as they learn features from images gradually and automatically. An optimal CNN architecture has been proposed to recognize single handed gestures. The images of hand gestures convey a numerical representation of ten digits. Image augmentation has been performed to increase the size of training data for deep learning. Depending on application, the interpretation of gesture can be customized. The classification performance has been analyzed with metrics reported by confusion matrix. The proposed architecture performs well both in training and testing reporting the accuracy of 98.2% and 96.2% respectively. Tuning the hyper parameter has improved test accuracy. Hand gesture recognition, Deep Learning, Convolutional Neural Network (CNN).","PeriodicalId":422301,"journal":{"name":"Proceedings of the First International Conference on Computing, Communication and Control System, I3CAC 2021, 7-8 June 2021, Bharath University, Chennai, India","volume":"104 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Smart Vision Based Single Handed Gesture Recognition system using deep neural networks\",\"authors\":\"R. Suguna, N. Rupavathy, R. Asmetha\",\"doi\":\"10.4108/EAI.7-6-2021.2308783\",\"DOIUrl\":null,\"url\":null,\"abstract\":\". The primary and expressive mode of human communications are gestures. Human can interact with machines using body postures and finger pointing. Advancements in human-computer interaction (HCI) has presented new innovations in technology making the users to communicate with computers in an instinctual manner. Evidences clearly state that future living space will be dominated by sensor-based devices and hence an efficient human-computer interfaces are required to exchange information. Hand gesture interfaces have been employed in multiple domains and has won social acceptance. System requirements for gesture recognition vary with the intended application areas. Responsiveness, Learnability, Cost and Accuracy are major drivers for success of hand gesture recognition systems. This paper suggest a HCI design that requires no wearable markers or gloves. A noninvasive vision based framework has been suggested for human-machine interface. Deep Neural Networks have provided promising results in vision based tasks. Convolutional Neural Networks (CNN) are claimed for image recognition problems as they learn features from images gradually and automatically. An optimal CNN architecture has been proposed to recognize single handed gestures. The images of hand gestures convey a numerical representation of ten digits. Image augmentation has been performed to increase the size of training data for deep learning. Depending on application, the interpretation of gesture can be customized. The classification performance has been analyzed with metrics reported by confusion matrix. The proposed architecture performs well both in training and testing reporting the accuracy of 98.2% and 96.2% respectively. Tuning the hyper parameter has improved test accuracy. Hand gesture recognition, Deep Learning, Convolutional Neural Network (CNN).\",\"PeriodicalId\":422301,\"journal\":{\"name\":\"Proceedings of the First International Conference on Computing, Communication and Control System, I3CAC 2021, 7-8 June 2021, Bharath University, Chennai, India\",\"volume\":\"104 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the First International Conference on Computing, Communication and Control System, I3CAC 2021, 7-8 June 2021, Bharath University, Chennai, India\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.4108/EAI.7-6-2021.2308783\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the First International Conference on Computing, Communication and Control System, I3CAC 2021, 7-8 June 2021, Bharath University, Chennai, India","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4108/EAI.7-6-2021.2308783","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

. 人类交流的主要和表达方式是手势。人类可以通过身体姿势和手指与机器互动。人机交互(HCI)的发展带来了新的技术创新,使用户以一种本能的方式与计算机交流。有证据清楚地表明,未来的生活空间将由基于传感器的设备主导,因此需要高效的人机界面来交换信息。手势界面已经应用于多个领域,并赢得了社会的认可。手势识别的系统要求因预期的应用领域而异。响应性、易学性、成本和准确性是手势识别系统成功的主要驱动因素。本文提出了一种不需要可穿戴标记或手套的HCI设计。提出了一种基于无创视觉的人机界面框架。深度神经网络在基于视觉的任务中提供了有希望的结果。卷积神经网络(CNN)是一种逐渐自动地从图像中学习特征的技术,被认为可以解决图像识别问题。提出了一种最优的CNN架构来识别单手手势。手势的图像传达了十位数的数字表示。图像增强已被用于增加深度学习训练数据的大小。根据应用程序,手势的解释可以定制。用混淆矩阵报告的指标对分类性能进行了分析。该体系结构在训练和测试中均表现良好,准确率分别为98.2%和96.2%。超参数的调整提高了测试精度。手势识别,深度学习,卷积神经网络(CNN)
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A Smart Vision Based Single Handed Gesture Recognition system using deep neural networks
. The primary and expressive mode of human communications are gestures. Human can interact with machines using body postures and finger pointing. Advancements in human-computer interaction (HCI) has presented new innovations in technology making the users to communicate with computers in an instinctual manner. Evidences clearly state that future living space will be dominated by sensor-based devices and hence an efficient human-computer interfaces are required to exchange information. Hand gesture interfaces have been employed in multiple domains and has won social acceptance. System requirements for gesture recognition vary with the intended application areas. Responsiveness, Learnability, Cost and Accuracy are major drivers for success of hand gesture recognition systems. This paper suggest a HCI design that requires no wearable markers or gloves. A noninvasive vision based framework has been suggested for human-machine interface. Deep Neural Networks have provided promising results in vision based tasks. Convolutional Neural Networks (CNN) are claimed for image recognition problems as they learn features from images gradually and automatically. An optimal CNN architecture has been proposed to recognize single handed gestures. The images of hand gestures convey a numerical representation of ten digits. Image augmentation has been performed to increase the size of training data for deep learning. Depending on application, the interpretation of gesture can be customized. The classification performance has been analyzed with metrics reported by confusion matrix. The proposed architecture performs well both in training and testing reporting the accuracy of 98.2% and 96.2% respectively. Tuning the hyper parameter has improved test accuracy. Hand gesture recognition, Deep Learning, Convolutional Neural Network (CNN).
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信