Automatic Hand Gesture Recognition with Semantic Segmentation and Deep Learning

Bristy Chanda, H. Nyeem
{"title":"Automatic Hand Gesture Recognition with Semantic Segmentation and Deep Learning","authors":"Bristy Chanda, H. Nyeem","doi":"10.1109/icaeee54957.2022.9836425","DOIUrl":null,"url":null,"abstract":"Automatic Hand Gesture Recognition is a key requirement for variety of applications, including translation of Sign Language, Human-Computer Interaction (HCI) and, ubiquitous vision-based systems. Due to the lighting variance and complicated background in the input image set of gestures, meeting this criterion remains a challenge. This paper introduces semantic segmentation to deep learning-based hand gesture recognition system for sign language translation. Building on the U - Net architecture, the proposed model obtains the semantically segmented mask of the input image, which is then fed to convolutional neural networks (CNNs) for multiclass classification. The proposed model is trained and tested for four different depths of the CNN architectures followed by the comparison with some pre-trained CNN architectures such as Inception V3, VGG16, VGG19, ResNet50. The proposed model is evaluated on National University of Singapore (NUS) hand posture dataset II (subset A), which contains 2000 images in 10 classes. A significant recognition rate of 97.15 % is achieved for the proposed model outperforming a set of prominent models and demonstrating its promises for sign language translation.","PeriodicalId":383872,"journal":{"name":"2022 International Conference on Advancement in Electrical and Electronic Engineering (ICAEEE)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-02-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Advancement in Electrical and Electronic Engineering (ICAEEE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/icaeee54957.2022.9836425","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Automatic Hand Gesture Recognition is a key requirement for variety of applications, including translation of Sign Language, Human-Computer Interaction (HCI) and, ubiquitous vision-based systems. Due to the lighting variance and complicated background in the input image set of gestures, meeting this criterion remains a challenge. This paper introduces semantic segmentation to deep learning-based hand gesture recognition system for sign language translation. Building on the U - Net architecture, the proposed model obtains the semantically segmented mask of the input image, which is then fed to convolutional neural networks (CNNs) for multiclass classification. The proposed model is trained and tested for four different depths of the CNN architectures followed by the comparison with some pre-trained CNN architectures such as Inception V3, VGG16, VGG19, ResNet50. The proposed model is evaluated on National University of Singapore (NUS) hand posture dataset II (subset A), which contains 2000 images in 10 classes. A significant recognition rate of 97.15 % is achieved for the proposed model outperforming a set of prominent models and demonstrating its promises for sign language translation.
基于语义分割和深度学习的自动手势识别
自动手势识别是各种应用的关键要求,包括手语翻译,人机交互(HCI)和无处不在的基于视觉的系统。由于手势输入图像集的光照变化和背景复杂,满足这一标准仍然是一个挑战。将语义分割引入到基于深度学习的手势识别系统中,用于手语翻译。该模型基于U - Net架构,获取输入图像的语义分割掩码,然后将其送入卷积神经网络进行多类分类。该模型对四种不同深度的CNN架构进行了训练和测试,然后与一些预训练的CNN架构(如Inception V3, VGG16, VGG19, ResNet50)进行了比较。该模型在新加坡国立大学(NUS)的手部姿势数据集II(子集A)上进行了评估,该数据集包含10个类别的2000张图像。该模型的识别率达到了97.15%,超过了一组著名的模型,证明了它在手语翻译方面的前景。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信