基于凸性缺陷和霍夫线的手语识别

International Journal of Software Engineering and Computer Systems Pub Date : 2020-09-28 DOI:10.15344/2456-4451/2020/158

M. Cheok, Z. Omar, M. Jaward

{"title":"基于凸性缺陷和霍夫线的手语识别","authors":"M. Cheok, Z. Omar, M. Jaward","doi":"10.15344/2456-4451/2020/158","DOIUrl":null,"url":null,"abstract":"This paper presents a novel feature extraction framework for sign language recognition application to assist the deaf and speech-impaired in daily communication. Our method extracts visual features from the hand gesture using convexity defects, K-curvature and Hough line which can differentiate visually similar signs. In segmentation stage, Canny edge detection and skin color segmentation with histogram backprojection method are used to extract the hand region from the background. The features to be extracted are categorized into shape, orientation and motion. The shape features of the sign language will then be extracted using convexity defect, K-curvature and Hough line techniques. The orientation feature will be extracted by calculating the palm center to wrist angle. The trajectory motion of the dynamic gesture is extracted using the chain code method. Lastly, Decision Tree classification is employed in classification of both static and dynamic gestures. The proposed framework is carried out in smartphone platform to recognize 26 alphabets, 10 numbers and eight dynamic American Sign Language (ASL). The average accuracy of 29 static ASL achieved is 85.72% and average accuracy of 10 dynamic ASL achieved is 77%. Through this research, it is found that the proposed framework can recognize larger sign languages database as compared with previous convexity defect-based sign language recognition research. *Corresponding Author: Ming Jin Cheok, Universiti Teknologi Malaysia, Skudai, 81300, Malaysia; E-mail: mingjin_91@hotmail.com Citation: Cheok MJ, Omar Z, Jaward MH (2020) Sign Language Recognition Using Convexity Defects and Hough Line. Int J Comput Softw Eng 5: 158. doi: https:// doi.org/10.15344/2456-4451/2020/158 Copyright: © 2020 Cheok et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Some notable appearance-based feature extraction method includes the extraction of Shift-Invariant Feature Transform (SIFT) features from the gesture. For example, in paper [13], SIFT features are extracted from six signs and the features are rotational invariant. Research in [14,15] also extracted SIFT features from the hand gestures, and the features are simplified by first quantizing with K-means clustering and then mapped into Bag-of-Features (BoF). Representing SIFT features using BoF reduces and uniforms the dimensionality of each SIFT features extracted. Speeded Up Robust Feature (SURF) is another notable feature extraction method proposed in [16]. The authors in paper [17] extracted SURF features from a moving hand gestures of consecutive frames, and by analyzing correlation between SURF points, classification of 26 gestures achieves 84.6% accuracy. Computational performance of SIFT and SURF has been compared in paper [15], as the extraction of features is usually a computationally heavy process and hence the efficiency of the technique plays an important role. It is showed that SURF has a faster processing speed. Principle Component Analysis (PCA) is another commonly extracted features in hand gesture recognition research. In paper [18], PCA is extracted together with kurtosis position and chain code, where kurtosis position is used in finding edges and chain code is used in tracking the hand trajectory. The hybrid of these three features has shown to outperform any features alone with an error rate of 10.91%. Model-based sign language recognition based on convexity defect features has been widely applied in research in this field [19-30]. The advantage of using this approach as compared to other appearanceInternational Journal of Computer & Software Engineering Ming Jin Cheok1,*, Zaid Omar1 and Mohamed H. Jaward2 1Universiti Teknologi Malaysia, Skudai, 81300, Malaysia 2School of Engineering, Monash University Subang Jaya, 47500, Malaysia Int J Comput Softw Eng IJCSE, an open access journal ISSN: 2456-4451 Volume 5. 2020. 158 Cheok et al,. Int J Comput Softw Eng 2020, 5: 158 https://doi.org/10.15344/2456-4451/2020/158","PeriodicalId":31240,"journal":{"name":"International Journal of Software Engineering and Computer Systems","volume":"34 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2020-09-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Sign Language Recognition Using Convexity Defects and Hough Line\",\"authors\":\"M. Cheok, Z. Omar, M. Jaward\",\"doi\":\"10.15344/2456-4451/2020/158\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents a novel feature extraction framework for sign language recognition application to assist the deaf and speech-impaired in daily communication. Our method extracts visual features from the hand gesture using convexity defects, K-curvature and Hough line which can differentiate visually similar signs. In segmentation stage, Canny edge detection and skin color segmentation with histogram backprojection method are used to extract the hand region from the background. The features to be extracted are categorized into shape, orientation and motion. The shape features of the sign language will then be extracted using convexity defect, K-curvature and Hough line techniques. The orientation feature will be extracted by calculating the palm center to wrist angle. The trajectory motion of the dynamic gesture is extracted using the chain code method. Lastly, Decision Tree classification is employed in classification of both static and dynamic gestures. The proposed framework is carried out in smartphone platform to recognize 26 alphabets, 10 numbers and eight dynamic American Sign Language (ASL). The average accuracy of 29 static ASL achieved is 85.72% and average accuracy of 10 dynamic ASL achieved is 77%. Through this research, it is found that the proposed framework can recognize larger sign languages database as compared with previous convexity defect-based sign language recognition research. *Corresponding Author: Ming Jin Cheok, Universiti Teknologi Malaysia, Skudai, 81300, Malaysia; E-mail: mingjin_91@hotmail.com Citation: Cheok MJ, Omar Z, Jaward MH (2020) Sign Language Recognition Using Convexity Defects and Hough Line. Int J Comput Softw Eng 5: 158. doi: https:// doi.org/10.15344/2456-4451/2020/158 Copyright: © 2020 Cheok et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Some notable appearance-based feature extraction method includes the extraction of Shift-Invariant Feature Transform (SIFT) features from the gesture. For example, in paper [13], SIFT features are extracted from six signs and the features are rotational invariant. Research in [14,15] also extracted SIFT features from the hand gestures, and the features are simplified by first quantizing with K-means clustering and then mapped into Bag-of-Features (BoF). Representing SIFT features using BoF reduces and uniforms the dimensionality of each SIFT features extracted. Speeded Up Robust Feature (SURF) is another notable feature extraction method proposed in [16]. The authors in paper [17] extracted SURF features from a moving hand gestures of consecutive frames, and by analyzing correlation between SURF points, classification of 26 gestures achieves 84.6% accuracy. Computational performance of SIFT and SURF has been compared in paper [15], as the extraction of features is usually a computationally heavy process and hence the efficiency of the technique plays an important role. It is showed that SURF has a faster processing speed. Principle Component Analysis (PCA) is another commonly extracted features in hand gesture recognition research. In paper [18], PCA is extracted together with kurtosis position and chain code, where kurtosis position is used in finding edges and chain code is used in tracking the hand trajectory. The hybrid of these three features has shown to outperform any features alone with an error rate of 10.91%. Model-based sign language recognition based on convexity defect features has been widely applied in research in this field [19-30]. The advantage of using this approach as compared to other appearanceInternational Journal of Computer & Software Engineering Ming Jin Cheok1,*, Zaid Omar1 and Mohamed H. Jaward2 1Universiti Teknologi Malaysia, Skudai, 81300, Malaysia 2School of Engineering, Monash University Subang Jaya, 47500, Malaysia Int J Comput Softw Eng IJCSE, an open access journal ISSN: 2456-4451 Volume 5. 2020. 158 Cheok et al,. Int J Comput Softw Eng 2020, 5: 158 https://doi.org/10.15344/2456-4451/2020/158\",\"PeriodicalId\":31240,\"journal\":{\"name\":\"International Journal of Software Engineering and Computer Systems\",\"volume\":\"34 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-09-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Software Engineering and Computer Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.15344/2456-4451/2020/158\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Software Engineering and Computer Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.15344/2456-4451/2020/158","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

本文提出了一种新的特征提取框架，用于手语识别应用，以帮助聋人和语言障碍者进行日常交流。我们的方法利用凸度缺陷、k曲率和霍夫线来区分视觉上相似的符号，从手势中提取视觉特征。在分割阶段，采用Canny边缘检测和直方图反投影法进行肤色分割，从背景中提取手部区域。要提取的特征分为形状、方向和运动。然后使用凸性缺陷、k曲率和霍夫线技术提取手语的形状特征。方向特征将通过计算手掌中心到手腕的角度来提取。采用链码法提取动态手势的运动轨迹。最后，采用决策树分类方法对静态和动态手势进行分类。该框架在智能手机平台上实现了对26个字母、10个数字和8种动态美国手语的识别。29例静态ASL的平均准确率为85.72%，10例动态ASL的平均准确率为77%。通过研究发现，与以往基于凸性缺陷的手语识别研究相比，所提出的框架可以识别更大的手语数据库。*通讯作者:Ming Jin Cheok，马来西亚科技大学，Skudai, 81300，马来西亚;引用本文:Cheok MJ, Omar Z, Jaward MH(2020)基于凸性缺陷和霍夫线的手语识别。[J] .计算机工程与软件学报(英文版)，第5期。doi: https:// doi.org/10.15344/2456-4451/2020/158版权所有:©2020 Cheok等。这是一篇根据知识共享署名许可协议发布的开放获取文章，该协议允许在任何媒体上不受限制地使用、分发和复制，前提是要注明原作者和来源。一些值得注意的基于外观的特征提取方法包括从手势中提取移位不变特征变换(SIFT)特征。例如，在论文[13]中，从六个符号中提取SIFT特征，特征是旋转不变性的。[14,15]的研究也从手势中提取了SIFT特征，并首先使用K-means聚类对特征进行量化，然后将特征映射到特征袋(Bag-of-Features, BoF)中。用BoF表示SIFT特征降低并统一了提取的每个SIFT特征的维数。加速鲁棒特征(SURF)是[16]中提出的另一种值得注意的特征提取方法。论文[17]从连续帧的移动手势中提取SURF特征，通过分析SURF点之间的相关性，对26个手势进行分类，准确率达到84.6%。论文[15]对SIFT和SURF的计算性能进行了比较，因为特征提取通常是一个计算量很大的过程，因此该技术的效率起着重要的作用。结果表明，SURF具有较快的处理速度。主成分分析(PCA)是手势识别研究中常用的另一种特征提取方法。论文[18]将PCA与峰度位置和链码一起提取，其中峰度位置用于寻找边缘，链码用于跟踪手部轨迹。这三种特征的混合表现优于单独使用任何特征，错误率为10.91%。基于凸性缺陷特征的基于模型的手语识别在该领域的研究中得到了广泛的应用[19-30]。国际计算机与软件工程学报，Ming Jin Cheok1，*， Zaid Omar1 and Mohamed H. Jaward2 1马来西亚理工大学，Skudai, 81300，马来西亚2莫纳什大学工程学院，Subang Jaya, 47500，马来西亚国际计算机与软件工程学报，ISSN: 2456-4451卷5。2020. 158 Cheok等人，。计算机工程，2020，(5):158 https://doi.org/10.15344/2456-4451/2020/158

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Sign Language Recognition Using Convexity Defects and Hough Line

This paper presents a novel feature extraction framework for sign language recognition application to assist the deaf and speech-impaired in daily communication. Our method extracts visual features from the hand gesture using convexity defects, K-curvature and Hough line which can differentiate visually similar signs. In segmentation stage, Canny edge detection and skin color segmentation with histogram backprojection method are used to extract the hand region from the background. The features to be extracted are categorized into shape, orientation and motion. The shape features of the sign language will then be extracted using convexity defect, K-curvature and Hough line techniques. The orientation feature will be extracted by calculating the palm center to wrist angle. The trajectory motion of the dynamic gesture is extracted using the chain code method. Lastly, Decision Tree classification is employed in classification of both static and dynamic gestures. The proposed framework is carried out in smartphone platform to recognize 26 alphabets, 10 numbers and eight dynamic American Sign Language (ASL). The average accuracy of 29 static ASL achieved is 85.72% and average accuracy of 10 dynamic ASL achieved is 77%. Through this research, it is found that the proposed framework can recognize larger sign languages database as compared with previous convexity defect-based sign language recognition research. *Corresponding Author: Ming Jin Cheok, Universiti Teknologi Malaysia, Skudai, 81300, Malaysia; E-mail: mingjin_91@hotmail.com Citation: Cheok MJ, Omar Z, Jaward MH (2020) Sign Language Recognition Using Convexity Defects and Hough Line. Int J Comput Softw Eng 5: 158. doi: https:// doi.org/10.15344/2456-4451/2020/158 Copyright: © 2020 Cheok et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Some notable appearance-based feature extraction method includes the extraction of Shift-Invariant Feature Transform (SIFT) features from the gesture. For example, in paper [13], SIFT features are extracted from six signs and the features are rotational invariant. Research in [14,15] also extracted SIFT features from the hand gestures, and the features are simplified by first quantizing with K-means clustering and then mapped into Bag-of-Features (BoF). Representing SIFT features using BoF reduces and uniforms the dimensionality of each SIFT features extracted. Speeded Up Robust Feature (SURF) is another notable feature extraction method proposed in [16]. The authors in paper [17] extracted SURF features from a moving hand gestures of consecutive frames, and by analyzing correlation between SURF points, classification of 26 gestures achieves 84.6% accuracy. Computational performance of SIFT and SURF has been compared in paper [15], as the extraction of features is usually a computationally heavy process and hence the efficiency of the technique plays an important role. It is showed that SURF has a faster processing speed. Principle Component Analysis (PCA) is another commonly extracted features in hand gesture recognition research. In paper [18], PCA is extracted together with kurtosis position and chain code, where kurtosis position is used in finding edges and chain code is used in tracking the hand trajectory. The hybrid of these three features has shown to outperform any features alone with an error rate of 10.91%. Model-based sign language recognition based on convexity defect features has been widely applied in research in this field [19-30]. The advantage of using this approach as compared to other appearanceInternational Journal of Computer & Software Engineering Ming Jin Cheok1,*, Zaid Omar1 and Mohamed H. Jaward2 1Universiti Teknologi Malaysia, Skudai, 81300, Malaysia 2School of Engineering, Monash University Subang Jaya, 47500, Malaysia Int J Comput Softw Eng IJCSE, an open access journal ISSN: 2456-4451 Volume 5. 2020. 158 Cheok et al,. Int J Comput Softw Eng 2020, 5: 158 https://doi.org/10.15344/2456-4451/2020/158

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

International Journal of Software Engineering and Computer Systems

自引率

0.00%

发文量