Leveraging edge detection techniques to enhance Arabic sign language static-gesture recognition using deep learning

IF 2.2 4区 工程技术 Q3 ENGINEERING, MULTIDISCIPLINARY
Journal of Engineering Research Pub Date : 2026-03-01 Epub Date: 2025-09-24 DOI:10.1016/j.jer.2025.09.011
Wahiba Ismaiel , Lilia kechiche , Yassine Aribi , Omer Salih Dawood Omer , Walied Merghani
{"title":"Leveraging edge detection techniques to enhance Arabic sign language static-gesture recognition using deep learning","authors":"Wahiba Ismaiel ,&nbsp;Lilia kechiche ,&nbsp;Yassine Aribi ,&nbsp;Omer Salih Dawood Omer ,&nbsp;Walied Merghani","doi":"10.1016/j.jer.2025.09.011","DOIUrl":null,"url":null,"abstract":"<div><div>Recently, there has been a growing interest in developing solutions to address communication barriers for the deaf and hard-of-hearing community. Sign language is the primary language of this group. Computer vision technology is used to process sign language due to its ease of application. Sign language recognition involves the use of technology to bridge communication gaps and enhance accessibility for individuals who use sign language as their primary form of communication. Many researchers have presented various methods to facilitate communication, among others. These methods include sign language recognition techniques, translation between sign and text or audio, and hand gesture identification, among others. We proposed an effective approach to improve the feature extraction process for Arabic sign gesture recognition. Feature extraction is a crucial aspect of deep learning models because it facilitates data processing, improves performance, and helps interpret results. This process also enables models to manage large datasets more efficiently. We presented two deep learning models: the agile convolutional neural network (ASLR_CNN) and ResNet50, to improve the comprehensiveness of the extracted features. These models were combined with the Canny Edge Detector (CED), which identifies the edges of Arabic hand gestures, as well as the complex features extracted from the edges by the proposed models. To evaluate the effectiveness of our methodology, we trained the proposed models on two public datasets: AASL and ArASL. The performance of these models was evaluated using a variety of metrics, including accuracy, precision, recall, F-score, and confusion matrix. The results indicated that both the ASLR_CNN and ResNet50 models achieved high accuracy on the ArASL dataset, reaching 97.14 % and 96.88 %, respectively. In contrast, the accuracy dropped to 89.49 % and 86.12 % for the ASLR_CNN and ResNet50 models, respectively, when using the AASL dataset.</div></div>","PeriodicalId":48803,"journal":{"name":"Journal of Engineering Research","volume":"14 1","pages":"Pages 896-915"},"PeriodicalIF":2.2000,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Engineering Research","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2307187725005711","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/9/24 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

Recently, there has been a growing interest in developing solutions to address communication barriers for the deaf and hard-of-hearing community. Sign language is the primary language of this group. Computer vision technology is used to process sign language due to its ease of application. Sign language recognition involves the use of technology to bridge communication gaps and enhance accessibility for individuals who use sign language as their primary form of communication. Many researchers have presented various methods to facilitate communication, among others. These methods include sign language recognition techniques, translation between sign and text or audio, and hand gesture identification, among others. We proposed an effective approach to improve the feature extraction process for Arabic sign gesture recognition. Feature extraction is a crucial aspect of deep learning models because it facilitates data processing, improves performance, and helps interpret results. This process also enables models to manage large datasets more efficiently. We presented two deep learning models: the agile convolutional neural network (ASLR_CNN) and ResNet50, to improve the comprehensiveness of the extracted features. These models were combined with the Canny Edge Detector (CED), which identifies the edges of Arabic hand gestures, as well as the complex features extracted from the edges by the proposed models. To evaluate the effectiveness of our methodology, we trained the proposed models on two public datasets: AASL and ArASL. The performance of these models was evaluated using a variety of metrics, including accuracy, precision, recall, F-score, and confusion matrix. The results indicated that both the ASLR_CNN and ResNet50 models achieved high accuracy on the ArASL dataset, reaching 97.14 % and 96.88 %, respectively. In contrast, the accuracy dropped to 89.49 % and 86.12 % for the ASLR_CNN and ResNet50 models, respectively, when using the AASL dataset.
利用边缘检测技术增强使用深度学习的阿拉伯手语静态手势识别
最近,人们对开发解决聋人和听障群体沟通障碍的解决方案越来越感兴趣。手语是这个群体的主要语言。计算机视觉技术因其易于应用而被用于处理手语。手语识别涉及使用技术来弥合沟通差距,并提高使用手语作为主要沟通形式的个人的可及性。许多研究人员提出了各种各样的方法来促进交流。这些方法包括手语识别技术、手语与文本或音频之间的翻译以及手势识别等。提出了一种改进阿拉伯手势识别特征提取过程的有效方法。特征提取是深度学习模型的一个关键方面,因为它简化了数据处理,提高了性能,并有助于解释结果。这个过程还使模型能够更有效地管理大型数据集。为了提高特征提取的全面性,我们提出了两种深度学习模型:敏捷卷积神经网络(ASLR_CNN)和ResNet50。这些模型与Canny边缘检测器(CED)相结合,Canny边缘检测器识别阿拉伯手势的边缘,以及由所提出的模型从边缘提取的复杂特征。为了评估我们方法的有效性,我们在两个公共数据集上训练了所提出的模型:AASL和ArASL。使用各种指标评估这些模型的性能,包括准确性、精密度、召回率、f分数和混淆矩阵。结果表明,ASLR_CNN和ResNet50模型在ArASL数据集上均取得了较高的准确率,分别达到97.14 %和96.88 %。相比之下,当使用AASL数据集时,ASLR_CNN和ResNet50模型的准确率分别下降到89.49 %和86.12 %。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Engineering Research
Journal of Engineering Research ENGINEERING, MULTIDISCIPLINARY-
CiteScore
1.60
自引率
10.00%
发文量
181
审稿时长
20 weeks
期刊介绍: Journal of Engineering Research (JER) is a international, peer reviewed journal which publishes full length original research papers, reviews, case studies related to all areas of Engineering such as: Civil, Mechanical, Industrial, Electrical, Computer, Chemical, Petroleum, Aerospace, Architectural, Biomedical, Coastal, Environmental, Marine & Ocean, Metallurgical & Materials, software, Surveying, Systems and Manufacturing Engineering. In particular, JER focuses on innovative approaches and methods that contribute to solving the environmental and manufacturing problems, which exist primarily in the Arabian Gulf region and the Middle East countries. Kuwait University used to publish the Journal "Kuwait Journal of Science and Engineering" (ISSN: 1024-8684), which included Science and Engineering articles since 1974. In 2011 the decision was taken to split KJSE into two independent Journals - "Journal of Engineering Research "(JER) and "Kuwait Journal of Science" (KJS).
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信
小红书