{"title":"Static hand gesture recognition using stacked Denoising Sparse Autoencoders","authors":"Varun Kumar, G. Nandi, R. Kala","doi":"10.1109/IC3.2014.6897155","DOIUrl":null,"url":null,"abstract":"With the advent of personal computers, humans have always wanted to communicate with them in either their natural language or by using gestures. This gave birth to the field of Human Computer Interaction and its subfield Automatic Sign Language Recognition. This paper proposes the method of automatic feature extraction of the images of hand. These extracted features are then used to train the Softmax classifier to classify them into 20 classes. Five stacked Denoising Sparse Autoencoders (DSAE) trained in unsupervised fashion are used to extract features from image. The proposed architecture is trained and tested on a standard dataset [1] which was extended by adding random jitters such as rotation and Gaussian noise. The performance of the proposed architecture is 83% which is better than shallow Neural Network trained on manual hand-engineered features called Principal Components which is used as a benchmark.","PeriodicalId":444918,"journal":{"name":"2014 Seventh International Conference on Contemporary Computing (IC3)","volume":"212 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"16","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 Seventh International Conference on Contemporary Computing (IC3)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IC3.2014.6897155","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 16
Abstract
With the advent of personal computers, humans have always wanted to communicate with them in either their natural language or by using gestures. This gave birth to the field of Human Computer Interaction and its subfield Automatic Sign Language Recognition. This paper proposes the method of automatic feature extraction of the images of hand. These extracted features are then used to train the Softmax classifier to classify them into 20 classes. Five stacked Denoising Sparse Autoencoders (DSAE) trained in unsupervised fashion are used to extract features from image. The proposed architecture is trained and tested on a standard dataset [1] which was extended by adding random jitters such as rotation and Gaussian noise. The performance of the proposed architecture is 83% which is better than shallow Neural Network trained on manual hand-engineered features called Principal Components which is used as a benchmark.