用卷积神经网络进行手势识别，帮助学龄儿童学习基本的算术运算

2022 IEEE 40th Central America and Panama Convention (CONCAPAN) Pub Date : 2022-11-09 DOI:10.1109/CONCAPAN48024.2022.9997680

Alvaro Teran-Quezada, Victor Lopez-Cabrera, José Carlos Rangel, J. Sánchez-Galán

{"title":"用卷积神经网络进行手势识别，帮助学龄儿童学习基本的算术运算","authors":"Alvaro Teran-Quezada, Victor Lopez-Cabrera, José Carlos Rangel, J. Sánchez-Galán","doi":"10.1109/CONCAPAN48024.2022.9997680","DOIUrl":null,"url":null,"abstract":"Hand Gesture and Sign Language Recognition are recognized as non-trivial problems in the computer vision community. Deep Learning (DL) has provided both, a novel analytical framework and solutions. Moreover, Convolutional Neural Networks (CNN) have shown there are still goals to be met and solutions to be given regarding Hand Posture Recognition with the architectures and technologies we have today. This work proposes building a Hand Gesture Recognizer able to identify 13 classes: numbers from 0 to 9, and the signs corresponding to the arithmetic operations of addition, subtraction, and multiplication. The core model aims to translate digits and math symbols signed by students and teachers in a school environment to both make the teaching-learning process more engaging and to promote sign language learning in the student community. Based on Object Detection and leveraging a pretrained model using Transfer Learning, this DL model was retrained with a data set of 1,365 images (105 per class/sign) of Panamanian Sign Language hand-shapes. An accuracy of 88% was reached on the validation data, with proved usefulness on American Sign Language (ASL) for these hand-shapes, and being easily adaptable to other sign systems, including two-handed finger spelling ones like International Sign System (IS).","PeriodicalId":138415,"journal":{"name":"2022 IEEE 40th Central America and Panama Convention (CONCAPAN)","volume":"52 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Hand Gesture Recognition with ConvNets for School-Aged Children to Learn Basic Arithmetic Operations\",\"authors\":\"Alvaro Teran-Quezada, Victor Lopez-Cabrera, José Carlos Rangel, J. Sánchez-Galán\",\"doi\":\"10.1109/CONCAPAN48024.2022.9997680\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Hand Gesture and Sign Language Recognition are recognized as non-trivial problems in the computer vision community. Deep Learning (DL) has provided both, a novel analytical framework and solutions. Moreover, Convolutional Neural Networks (CNN) have shown there are still goals to be met and solutions to be given regarding Hand Posture Recognition with the architectures and technologies we have today. This work proposes building a Hand Gesture Recognizer able to identify 13 classes: numbers from 0 to 9, and the signs corresponding to the arithmetic operations of addition, subtraction, and multiplication. The core model aims to translate digits and math symbols signed by students and teachers in a school environment to both make the teaching-learning process more engaging and to promote sign language learning in the student community. Based on Object Detection and leveraging a pretrained model using Transfer Learning, this DL model was retrained with a data set of 1,365 images (105 per class/sign) of Panamanian Sign Language hand-shapes. An accuracy of 88% was reached on the validation data, with proved usefulness on American Sign Language (ASL) for these hand-shapes, and being easily adaptable to other sign systems, including two-handed finger spelling ones like International Sign System (IS).\",\"PeriodicalId\":138415,\"journal\":{\"name\":\"2022 IEEE 40th Central America and Panama Convention (CONCAPAN)\",\"volume\":\"52 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE 40th Central America and Panama Convention (CONCAPAN)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CONCAPAN48024.2022.9997680\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 40th Central America and Panama Convention (CONCAPAN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CONCAPAN48024.2022.9997680","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

手势和手语识别是计算机视觉界公认的重要问题。深度学习(DL)提供了一种新的分析框架和解决方案。此外，卷积神经网络(CNN)已经表明，在我们今天拥有的架构和技术下，手部姿势识别仍然有目标要实现，也有解决方案要给出。这项工作提出建立一个能够识别13类的手势识别器:从0到9的数字，以及与加法、减法和乘法的算术运算相对应的符号。核心模型旨在翻译由学生和教师在学校环境中签名的数字和数学符号，使教学过程更具吸引力，并促进学生群体的手语学习。基于对象检测并利用迁移学习的预训练模型，该深度学习模型使用1365张巴拿马手语手形图像(每个类/标志105张)的数据集进行了重新训练。验证数据的准确率达到88%，证明了这些手型在美国手语(ASL)上的有效性，并且很容易适应其他符号系统，包括像国际标志系统(IS)这样的双手手指拼写系统。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Hand Gesture Recognition with ConvNets for School-Aged Children to Learn Basic Arithmetic Operations

Hand Gesture and Sign Language Recognition are recognized as non-trivial problems in the computer vision community. Deep Learning (DL) has provided both, a novel analytical framework and solutions. Moreover, Convolutional Neural Networks (CNN) have shown there are still goals to be met and solutions to be given regarding Hand Posture Recognition with the architectures and technologies we have today. This work proposes building a Hand Gesture Recognizer able to identify 13 classes: numbers from 0 to 9, and the signs corresponding to the arithmetic operations of addition, subtraction, and multiplication. The core model aims to translate digits and math symbols signed by students and teachers in a school environment to both make the teaching-learning process more engaging and to promote sign language learning in the student community. Based on Object Detection and leveraging a pretrained model using Transfer Learning, this DL model was retrained with a data set of 1,365 images (105 per class/sign) of Panamanian Sign Language hand-shapes. An accuracy of 88% was reached on the validation data, with proved usefulness on American Sign Language (ASL) for these hand-shapes, and being easily adaptable to other sign systems, including two-handed finger spelling ones like International Sign System (IS).

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 IEEE 40th Central America and Panama Convention (CONCAPAN)

自引率

0.00%

发文量