{"title":"Two stream GRU model with ELU activation function for sign language recognition","authors":"Kasian Myagila , Devotha Godfrey Nyambo , Mussa Ally Dida","doi":"10.1016/j.iswa.2025.200513","DOIUrl":null,"url":null,"abstract":"<div><div>Pose Estimation features have been successfully used in human activity recognition including sign language recognition. One of the key challenges in sign language recognition is handling signer-independent modes and hand dominance of signer. This paper proposes the use of the Gated Recurrent Unit (GRU) with the ELU activation function to improve computation efficiency and to enhance model learning efficiency. In addition, the paper proposes two stream model architecture to address the challenge of left and right-hand dominance. The study developed model using a Tanzania Sign language datasets collected using mobile devices and extracted pose estimation feature using MediaPipe holistic framework. According to the results, the proposed model not only achieves an impressive overall accuracy of 95%, but also trains more efficiently than comparable algorithms. Particularly in the signer-independent mode, the two-stream approach led to substantial improvements, achieving a maximum accuracy of 92% and a minimum accuracy of 70% with significant increase on the left handed signer accuracy by 37%. The results highlight the effectiveness of the two-stream approach in overcoming challenges related to left and right-hand dominance, which often arise from signer-specific hand dominance. Additionally, the results indicate that, the proposed model can have a positive impact on limited computational resources while also enhancing the model’s overall performance.</div></div>","PeriodicalId":100684,"journal":{"name":"Intelligent Systems with Applications","volume":"26 ","pages":"Article 200513"},"PeriodicalIF":0.0000,"publicationDate":"2025-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Intelligent Systems with Applications","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2667305325000390","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Pose Estimation features have been successfully used in human activity recognition including sign language recognition. One of the key challenges in sign language recognition is handling signer-independent modes and hand dominance of signer. This paper proposes the use of the Gated Recurrent Unit (GRU) with the ELU activation function to improve computation efficiency and to enhance model learning efficiency. In addition, the paper proposes two stream model architecture to address the challenge of left and right-hand dominance. The study developed model using a Tanzania Sign language datasets collected using mobile devices and extracted pose estimation feature using MediaPipe holistic framework. According to the results, the proposed model not only achieves an impressive overall accuracy of 95%, but also trains more efficiently than comparable algorithms. Particularly in the signer-independent mode, the two-stream approach led to substantial improvements, achieving a maximum accuracy of 92% and a minimum accuracy of 70% with significant increase on the left handed signer accuracy by 37%. The results highlight the effectiveness of the two-stream approach in overcoming challenges related to left and right-hand dominance, which often arise from signer-specific hand dominance. Additionally, the results indicate that, the proposed model can have a positive impact on limited computational resources while also enhancing the model’s overall performance.