Shivraj Sharma, H. Pallab Jyoti Dutta, M. Bhuyan, R. Laskar
{"title":"Hand Gesture Localization and Classification by Deep Neural Network for Online Text Entry","authors":"Shivraj Sharma, H. Pallab Jyoti Dutta, M. Bhuyan, R. Laskar","doi":"10.1109/ASPCON49795.2020.9276713","DOIUrl":null,"url":null,"abstract":"Hand gesture recognition is an important aspect of human-computer interaction. A proper hand gesture recognizing system can be used to build a robust text entry system for human-computer interface. This work proposes a real-time hand localization and recognition system. For hand localization, YOLOv3 is used that predicts a bounding box around the hand, and for hand gestures classification, a pretrained VGG16 network is employed. The bounding box regression technique helped localize the ROI (region of interest) and reduced the complexity, that aided in the classification task. The experimental results show that the proposed method is capable of recognizing the gestures with high testing accuracy on three benchmark datasets, namely, ASL (American sign language), Libras and NUS (National University of Singapore) datasets.","PeriodicalId":193814,"journal":{"name":"2020 IEEE Applied Signal Processing Conference (ASPCON)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE Applied Signal Processing Conference (ASPCON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ASPCON49795.2020.9276713","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Hand gesture recognition is an important aspect of human-computer interaction. A proper hand gesture recognizing system can be used to build a robust text entry system for human-computer interface. This work proposes a real-time hand localization and recognition system. For hand localization, YOLOv3 is used that predicts a bounding box around the hand, and for hand gestures classification, a pretrained VGG16 network is employed. The bounding box regression technique helped localize the ROI (region of interest) and reduced the complexity, that aided in the classification task. The experimental results show that the proposed method is capable of recognizing the gestures with high testing accuracy on three benchmark datasets, namely, ASL (American sign language), Libras and NUS (National University of Singapore) datasets.