Leveraging edge detection techniques to enhance Arabic sign language static-gesture recognition using deep learning

IF 2.2 4区工程技术 Q3 ENGINEERING, MULTIDISCIPLINARY

Journal of Engineering Research Pub Date : 2026-03-01 Epub Date: 2025-09-24 DOI:10.1016/j.jer.2025.09.011

Wahiba Ismaiel , Lilia kechiche , Yassine Aribi , Omer Salih Dawood Omer , Walied Merghani

{"title":"Leveraging edge detection techniques to enhance Arabic sign language static-gesture recognition using deep learning","authors":"Wahiba Ismaiel , Lilia kechiche , Yassine Aribi , Omer Salih Dawood Omer , Walied Merghani","doi":"10.1016/j.jer.2025.09.011","DOIUrl":null,"url":null,"abstract":"<div><div>Recently, there has been a growing interest in developing solutions to address communication barriers for the deaf and hard-of-hearing community. Sign language is the primary language of this group. Computer vision technology is used to process sign language due to its ease of application. Sign language recognition involves the use of technology to bridge communication gaps and enhance accessibility for individuals who use sign language as their primary form of communication. Many researchers have presented various methods to facilitate communication, among others. These methods include sign language recognition techniques, translation between sign and text or audio, and hand gesture identification, among others. We proposed an effective approach to improve the feature extraction process for Arabic sign gesture recognition. Feature extraction is a crucial aspect of deep learning models because it facilitates data processing, improves performance, and helps interpret results. This process also enables models to manage large datasets more efficiently. We presented two deep learning models: the agile convolutional neural network (ASLR_CNN) and ResNet50, to improve the comprehensiveness of the extracted features. These models were combined with the Canny Edge Detector (CED), which identifies the edges of Arabic hand gestures, as well as the complex features extracted from the edges by the proposed models. To evaluate the effectiveness of our methodology, we trained the proposed models on two public datasets: AASL and ArASL. The performance of these models was evaluated using a variety of metrics, including accuracy, precision, recall, F-score, and confusion matrix. The results indicated that both the ASLR_CNN and ResNet50 models achieved high accuracy on the ArASL dataset, reaching 97.14 % and 96.88 %, respectively. In contrast, the accuracy dropped to 89.49 % and 86.12 % for the ASLR_CNN and ResNet50 models, respectively, when using the AASL dataset.</div></div>","PeriodicalId":48803,"journal":{"name":"Journal of Engineering Research","volume":"14 1","pages":"Pages 896-915"},"PeriodicalIF":2.2000,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Engineering Research","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2307187725005711","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/9/24 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 0

Abstract

Recently, there has been a growing interest in developing solutions to address communication barriers for the deaf and hard-of-hearing community. Sign language is the primary language of this group. Computer vision technology is used to process sign language due to its ease of application. Sign language recognition involves the use of technology to bridge communication gaps and enhance accessibility for individuals who use sign language as their primary form of communication. Many researchers have presented various methods to facilitate communication, among others. These methods include sign language recognition techniques, translation between sign and text or audio, and hand gesture identification, among others. We proposed an effective approach to improve the feature extraction process for Arabic sign gesture recognition. Feature extraction is a crucial aspect of deep learning models because it facilitates data processing, improves performance, and helps interpret results. This process also enables models to manage large datasets more efficiently. We presented two deep learning models: the agile convolutional neural network (ASLR_CNN) and ResNet50, to improve the comprehensiveness of the extracted features. These models were combined with the Canny Edge Detector (CED), which identifies the edges of Arabic hand gestures, as well as the complex features extracted from the edges by the proposed models. To evaluate the effectiveness of our methodology, we trained the proposed models on two public datasets: AASL and ArASL. The performance of these models was evaluated using a variety of metrics, including accuracy, precision, recall, F-score, and confusion matrix. The results indicated that both the ASLR_CNN and ResNet50 models achieved high accuracy on the ArASL dataset, reaching 97.14 % and 96.88 %, respectively. In contrast, the accuracy dropped to 89.49 % and 86.12 % for the ASLR_CNN and ResNet50 models, respectively, when using the AASL dataset.

查看原文本刊更多论文

利用边缘检测技术增强使用深度学习的阿拉伯手语静态手势识别

最近，人们对开发解决聋人和听障群体沟通障碍的解决方案越来越感兴趣。手语是这个群体的主要语言。计算机视觉技术因其易于应用而被用于处理手语。手语识别涉及使用技术来弥合沟通差距，并提高使用手语作为主要沟通形式的个人的可及性。许多研究人员提出了各种各样的方法来促进交流。这些方法包括手语识别技术、手语与文本或音频之间的翻译以及手势识别等。提出了一种改进阿拉伯手势识别特征提取过程的有效方法。特征提取是深度学习模型的一个关键方面，因为它简化了数据处理，提高了性能，并有助于解释结果。这个过程还使模型能够更有效地管理大型数据集。为了提高特征提取的全面性，我们提出了两种深度学习模型：敏捷卷积神经网络（ASLR_CNN）和ResNet50。这些模型与Canny边缘检测器（CED）相结合，Canny边缘检测器识别阿拉伯手势的边缘，以及由所提出的模型从边缘提取的复杂特征。为了评估我们方法的有效性，我们在两个公共数据集上训练了所提出的模型：AASL和ArASL。使用各种指标评估这些模型的性能，包括准确性、精密度、召回率、f分数和混淆矩阵。结果表明，ASLR_CNN和ResNet50模型在ArASL数据集上均取得了较高的准确率，分别达到97.14 %和96.88 %。相比之下，当使用AASL数据集时，ASLR_CNN和ResNet50模型的准确率分别下降到89.49 %和86.12 %。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Engineering Research ENGINEERING, MULTIDISCIPLINARY-

CiteScore

1.60

自引率

10.00%

发文量

181

审稿时长

20 weeks

期刊介绍： Journal of Engineering Research (JER) is a international, peer reviewed journal which publishes full length original research papers, reviews, case studies related to all areas of Engineering such as: Civil, Mechanical, Industrial, Electrical, Computer, Chemical, Petroleum, Aerospace, Architectural, Biomedical, Coastal, Environmental, Marine & Ocean, Metallurgical & Materials, software, Surveying, Systems and Manufacturing Engineering. In particular, JER focuses on innovative approaches and methods that contribute to solving the environmental and manufacturing problems, which exist primarily in the Arabian Gulf region and the Middle East countries. Kuwait University used to publish the Journal "Kuwait Journal of Science and Engineering" (ISSN: 1024-8684), which included Science and Engineering articles since 1974. In 2011 the decision was taken to split KJSE into two independent Journals - "Journal of Engineering Research "(JER) and "Kuwait Journal of Science" (KJS).