American Sign Language Recognition Based on Machine Learning and Neural Network

Lanxi Li, Da Liu, Chenlin Shen, Jing Sun
{"title":"American Sign Language Recognition Based on Machine Learning and Neural Network","authors":"Lanxi Li, Da Liu, Chenlin Shen, Jing Sun","doi":"10.1109/mlise57402.2022.00096","DOIUrl":null,"url":null,"abstract":"Numerous disabilities such as deaf and mute are suffered from not being capable of communicating with normal people, it is necessary to find a way to solve this problem. A feasible method is Sign Language Recognition (SLR) which is a sort of pattern recognition technique. In this paper, machine learning and deep learning methods are applied to recognize and classify American Sign Language (ASL), and only 24 English letters are classified because letter J and Z require fingers to move. First, Principal Component Analysis (PCA) and manifold algorithms are used to do dimension reduction to accelerate the training of machine learning and visualize it. Second, various machine learning methods such as Random Forest Classification (RFC), K-Nearest Neighbor (KNN), Gaussian Naïve Bayes (GNB), Support Vector Machine (SVM), and Stochastic Gradient Descent (SGD) are applied to classify the pattern. Since the SVM algorithm has several hyperparameters, this study uses the Grid Search method to find the best combination of hyperparameter which lead to predicting more accurately. It is found that different dimensionality reduction algorithms have unequal effects on the accuracy of each prediction model, and it can be concluded that the manifold algorithm is the best dimension reduction algorithm only for KNN but not for other prediction models, and PCA is much more feasible than KNN applied in such machine learning algorithms except KNN. Two deep learning algorithms such as Convolutional Neural Networks (CNN) and Deep Neural Networks (DNN) are also used in classification and their accuracy is highest among such algorithms mentioned above.","PeriodicalId":350291,"journal":{"name":"2022 International Conference on Machine Learning and Intelligent Systems Engineering (MLISE)","volume":"60 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Machine Learning and Intelligent Systems Engineering (MLISE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/mlise57402.2022.00096","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Numerous disabilities such as deaf and mute are suffered from not being capable of communicating with normal people, it is necessary to find a way to solve this problem. A feasible method is Sign Language Recognition (SLR) which is a sort of pattern recognition technique. In this paper, machine learning and deep learning methods are applied to recognize and classify American Sign Language (ASL), and only 24 English letters are classified because letter J and Z require fingers to move. First, Principal Component Analysis (PCA) and manifold algorithms are used to do dimension reduction to accelerate the training of machine learning and visualize it. Second, various machine learning methods such as Random Forest Classification (RFC), K-Nearest Neighbor (KNN), Gaussian Naïve Bayes (GNB), Support Vector Machine (SVM), and Stochastic Gradient Descent (SGD) are applied to classify the pattern. Since the SVM algorithm has several hyperparameters, this study uses the Grid Search method to find the best combination of hyperparameter which lead to predicting more accurately. It is found that different dimensionality reduction algorithms have unequal effects on the accuracy of each prediction model, and it can be concluded that the manifold algorithm is the best dimension reduction algorithm only for KNN but not for other prediction models, and PCA is much more feasible than KNN applied in such machine learning algorithms except KNN. Two deep learning algorithms such as Convolutional Neural Networks (CNN) and Deep Neural Networks (DNN) are also used in classification and their accuracy is highest among such algorithms mentioned above.
基于机器学习和神经网络的美国手语识别
许多残疾人,如聋哑人,都遭受着无法与正常人交流的痛苦,有必要找到一种方法来解决这个问题。一种可行的方法是手语识别,它是一种模式识别技术。本文采用机器学习和深度学习方法对美国手语(ASL)进行识别和分类,由于字母J和Z需要手指移动,因此只对24个英文字母进行了分类。首先,利用主成分分析(PCA)和流形算法进行降维,以加速机器学习的训练并使其可视化。其次,采用随机森林分类(RFC)、k近邻(KNN)、高斯Naïve贝叶斯(GNB)、支持向量机(SVM)和随机梯度下降(SGD)等多种机器学习方法对模式进行分类。由于支持向量机算法有多个超参数,本研究采用网格搜索方法寻找超参数的最佳组合,使预测更加准确。研究发现,不同的降维算法对每个预测模型的准确率影响不均匀,可以得出流形算法仅对KNN是最好的降维算法,而对其他预测模型则不是,PCA在除KNN外的机器学习算法中应用比KNN可行得多。卷积神经网络(Convolutional Neural Networks, CNN)和深度神经网络(deep Neural Networks, DNN)这两种深度学习算法也被用于分类,其准确率是上述算法中最高的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信