用卷积神经网络进行手势识别,帮助学龄儿童学习基本的算术运算

Alvaro Teran-Quezada, Victor Lopez-Cabrera, José Carlos Rangel, J. Sánchez-Galán
{"title":"用卷积神经网络进行手势识别,帮助学龄儿童学习基本的算术运算","authors":"Alvaro Teran-Quezada, Victor Lopez-Cabrera, José Carlos Rangel, J. Sánchez-Galán","doi":"10.1109/CONCAPAN48024.2022.9997680","DOIUrl":null,"url":null,"abstract":"Hand Gesture and Sign Language Recognition are recognized as non-trivial problems in the computer vision community. Deep Learning (DL) has provided both, a novel analytical framework and solutions. Moreover, Convolutional Neural Networks (CNN) have shown there are still goals to be met and solutions to be given regarding Hand Posture Recognition with the architectures and technologies we have today. This work proposes building a Hand Gesture Recognizer able to identify 13 classes: numbers from 0 to 9, and the signs corresponding to the arithmetic operations of addition, subtraction, and multiplication. The core model aims to translate digits and math symbols signed by students and teachers in a school environment to both make the teaching-learning process more engaging and to promote sign language learning in the student community. Based on Object Detection and leveraging a pretrained model using Transfer Learning, this DL model was retrained with a data set of 1,365 images (105 per class/sign) of Panamanian Sign Language hand-shapes. An accuracy of 88% was reached on the validation data, with proved usefulness on American Sign Language (ASL) for these hand-shapes, and being easily adaptable to other sign systems, including two-handed finger spelling ones like International Sign System (IS).","PeriodicalId":138415,"journal":{"name":"2022 IEEE 40th Central America and Panama Convention (CONCAPAN)","volume":"52 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Hand Gesture Recognition with ConvNets for School-Aged Children to Learn Basic Arithmetic Operations\",\"authors\":\"Alvaro Teran-Quezada, Victor Lopez-Cabrera, José Carlos Rangel, J. Sánchez-Galán\",\"doi\":\"10.1109/CONCAPAN48024.2022.9997680\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Hand Gesture and Sign Language Recognition are recognized as non-trivial problems in the computer vision community. Deep Learning (DL) has provided both, a novel analytical framework and solutions. Moreover, Convolutional Neural Networks (CNN) have shown there are still goals to be met and solutions to be given regarding Hand Posture Recognition with the architectures and technologies we have today. This work proposes building a Hand Gesture Recognizer able to identify 13 classes: numbers from 0 to 9, and the signs corresponding to the arithmetic operations of addition, subtraction, and multiplication. The core model aims to translate digits and math symbols signed by students and teachers in a school environment to both make the teaching-learning process more engaging and to promote sign language learning in the student community. Based on Object Detection and leveraging a pretrained model using Transfer Learning, this DL model was retrained with a data set of 1,365 images (105 per class/sign) of Panamanian Sign Language hand-shapes. An accuracy of 88% was reached on the validation data, with proved usefulness on American Sign Language (ASL) for these hand-shapes, and being easily adaptable to other sign systems, including two-handed finger spelling ones like International Sign System (IS).\",\"PeriodicalId\":138415,\"journal\":{\"name\":\"2022 IEEE 40th Central America and Panama Convention (CONCAPAN)\",\"volume\":\"52 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE 40th Central America and Panama Convention (CONCAPAN)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CONCAPAN48024.2022.9997680\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 40th Central America and Panama Convention (CONCAPAN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CONCAPAN48024.2022.9997680","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

手势和手语识别是计算机视觉界公认的重要问题。深度学习(DL)提供了一种新的分析框架和解决方案。此外,卷积神经网络(CNN)已经表明,在我们今天拥有的架构和技术下,手部姿势识别仍然有目标要实现,也有解决方案要给出。这项工作提出建立一个能够识别13类的手势识别器:从0到9的数字,以及与加法、减法和乘法的算术运算相对应的符号。核心模型旨在翻译由学生和教师在学校环境中签名的数字和数学符号,使教学过程更具吸引力,并促进学生群体的手语学习。基于对象检测并利用迁移学习的预训练模型,该深度学习模型使用1365张巴拿马手语手形图像(每个类/标志105张)的数据集进行了重新训练。验证数据的准确率达到88%,证明了这些手型在美国手语(ASL)上的有效性,并且很容易适应其他符号系统,包括像国际标志系统(IS)这样的双手手指拼写系统。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Hand Gesture Recognition with ConvNets for School-Aged Children to Learn Basic Arithmetic Operations
Hand Gesture and Sign Language Recognition are recognized as non-trivial problems in the computer vision community. Deep Learning (DL) has provided both, a novel analytical framework and solutions. Moreover, Convolutional Neural Networks (CNN) have shown there are still goals to be met and solutions to be given regarding Hand Posture Recognition with the architectures and technologies we have today. This work proposes building a Hand Gesture Recognizer able to identify 13 classes: numbers from 0 to 9, and the signs corresponding to the arithmetic operations of addition, subtraction, and multiplication. The core model aims to translate digits and math symbols signed by students and teachers in a school environment to both make the teaching-learning process more engaging and to promote sign language learning in the student community. Based on Object Detection and leveraging a pretrained model using Transfer Learning, this DL model was retrained with a data set of 1,365 images (105 per class/sign) of Panamanian Sign Language hand-shapes. An accuracy of 88% was reached on the validation data, with proved usefulness on American Sign Language (ASL) for these hand-shapes, and being easily adaptable to other sign systems, including two-handed finger spelling ones like International Sign System (IS).
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信