{"title":"Lightweight sign language intelligent recognition model based on improved R-C3D","authors":"Haofei Chen, Chang’an Di","doi":"10.1016/j.eij.2025.100801","DOIUrl":null,"url":null,"abstract":"<div><div>The study proposes a continuous dynamic sign language recognition model based on an improved regional 3D convolutional network. A 3D convolutional network is taken as a special extraction sub-network, and the depth separable convolution is introduced into the 3D convolutional network to reduce computational costs. The inverted residual results are taken to avoid information loss issues. In addition, the pre-selection box size of the optimized region 3D convolutional network is shortened, and the action judgment threshold is increased to improve the action accuracy. The average accuracy of the improved 3D convolutional network was 44.2 %, which was higher than that of other types of feature extraction sub-networks. After reducing the pre-selection box, the average accuracy of the time suggestion sub-network increased from 41.6 % to 44.5 %. The loss value also decreased from 0.5 to 0.46. After increasing the action judgment threshold from 0.5 to 0.7, the loss value decreased from 0.58 to 0.17. The loss value of the 3D convolutional network in the entire improved area was only 0.15, the sign language recognition speed was 183 ms, and the average accuracy was 44.6 %, which was better than those of other sign language recognition schemes. The above results indicate that the improved regional 3D convolutional network can accurately and quickly recognize continuous sign language actions.</div></div>","PeriodicalId":56010,"journal":{"name":"Egyptian Informatics Journal","volume":"32 ","pages":"Article 100801"},"PeriodicalIF":4.3000,"publicationDate":"2025-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Egyptian Informatics Journal","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S111086652500194X","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
The study proposes a continuous dynamic sign language recognition model based on an improved regional 3D convolutional network. A 3D convolutional network is taken as a special extraction sub-network, and the depth separable convolution is introduced into the 3D convolutional network to reduce computational costs. The inverted residual results are taken to avoid information loss issues. In addition, the pre-selection box size of the optimized region 3D convolutional network is shortened, and the action judgment threshold is increased to improve the action accuracy. The average accuracy of the improved 3D convolutional network was 44.2 %, which was higher than that of other types of feature extraction sub-networks. After reducing the pre-selection box, the average accuracy of the time suggestion sub-network increased from 41.6 % to 44.5 %. The loss value also decreased from 0.5 to 0.46. After increasing the action judgment threshold from 0.5 to 0.7, the loss value decreased from 0.58 to 0.17. The loss value of the 3D convolutional network in the entire improved area was only 0.15, the sign language recognition speed was 183 ms, and the average accuracy was 44.6 %, which was better than those of other sign language recognition schemes. The above results indicate that the improved regional 3D convolutional network can accurately and quickly recognize continuous sign language actions.
期刊介绍:
The Egyptian Informatics Journal is published by the Faculty of Computers and Artificial Intelligence, Cairo University. This Journal provides a forum for the state-of-the-art research and development in the fields of computing, including computer sciences, information technologies, information systems, operations research and decision support. Innovative and not-previously-published work in subjects covered by the Journal is encouraged to be submitted, whether from academic, research or commercial sources.