Yiyang Zhang, X. Pu, Xiaolu Wang, Haopeng Guo, Ke Liu, Qian-Ying Yang, Lili Wang
{"title":"Design concept of sign language recognition translation and gesture recognition control system based on deep learning and machine vision","authors":"Yiyang Zhang, X. Pu, Xiaolu Wang, Haopeng Guo, Ke Liu, Qian-Ying Yang, Lili Wang","doi":"10.1117/12.2653702","DOIUrl":null,"url":null,"abstract":"With the development of society, gestures are used in many aspects, but the computer's functionality for gesture recognition is still to be improved. This article is mainly a preliminary idea of a basic gesture recognition system built based on the existing Google deep learning framework TensorFlow and gesture recognition components in MediaPipe and OpenCv machine vision open-source library. The training dataset is first subjected to skeleton key point coordinate extraction, then the pre-processed dataset is used to train the neural network and constitute the preliminary model, and finally the model is corrected and changed in the end.","PeriodicalId":253792,"journal":{"name":"Conference on Optics and Communication Technology","volume":"58 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Conference on Optics and Communication Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.2653702","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
With the development of society, gestures are used in many aspects, but the computer's functionality for gesture recognition is still to be improved. This article is mainly a preliminary idea of a basic gesture recognition system built based on the existing Google deep learning framework TensorFlow and gesture recognition components in MediaPipe and OpenCv machine vision open-source library. The training dataset is first subjected to skeleton key point coordinate extraction, then the pre-processed dataset is used to train the neural network and constitute the preliminary model, and finally the model is corrected and changed in the end.