{"title":"基于更快R-CNN网络的3D手势识别方法*","authors":"Hua Chai, M. Fei, Aolei Yang, Ling Chen","doi":"10.1109/ANZCC.2018.8606613","DOIUrl":null,"url":null,"abstract":"The traditional static gesture recognition algorithm is easily affected by the complex environment which can cause low recognition rate. Therefore, an improved Faster R-CNN network is adopted to improve the gesture recognition accuracy based on 3D TOF (Time of Flight) camera. The contribution of this paper includes: 1) The group own TOF camera is used for data acquisition of 300 images for each of the five gestures, and the images collected by the TOF camera can reduce the effect of complex environments and provide depth information of gestures; 2) The gesture classification is improved by adding the Inception v3 in the Faster R-CNN. Experiments are carried out to show that the improved network has higher accuracy and stronger robustness, compared with traditional Faster R-CNN, and the localization of gestures in space is accurate with a small number of datasets.","PeriodicalId":358801,"journal":{"name":"2018 Australian & New Zealand Control Conference (ANZCC)","volume":"54 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"3D Gesture Recognition Method Based on Faster R-CNN Network*\",\"authors\":\"Hua Chai, M. Fei, Aolei Yang, Ling Chen\",\"doi\":\"10.1109/ANZCC.2018.8606613\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The traditional static gesture recognition algorithm is easily affected by the complex environment which can cause low recognition rate. Therefore, an improved Faster R-CNN network is adopted to improve the gesture recognition accuracy based on 3D TOF (Time of Flight) camera. The contribution of this paper includes: 1) The group own TOF camera is used for data acquisition of 300 images for each of the five gestures, and the images collected by the TOF camera can reduce the effect of complex environments and provide depth information of gestures; 2) The gesture classification is improved by adding the Inception v3 in the Faster R-CNN. Experiments are carried out to show that the improved network has higher accuracy and stronger robustness, compared with traditional Faster R-CNN, and the localization of gestures in space is accurate with a small number of datasets.\",\"PeriodicalId\":358801,\"journal\":{\"name\":\"2018 Australian & New Zealand Control Conference (ANZCC)\",\"volume\":\"54 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 Australian & New Zealand Control Conference (ANZCC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ANZCC.2018.8606613\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 Australian & New Zealand Control Conference (ANZCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ANZCC.2018.8606613","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
摘要
传统的静态手势识别算法容易受到复杂环境的影响,导致识别率较低。因此,采用改进的Faster R-CNN网络来提高基于3D TOF (Time of Flight)相机的手势识别精度。本文的贡献包括:1)使用集团自有的TOF相机对五种手势各采集300幅图像,TOF相机采集的图像可以减少复杂环境的影响,提供手势的深度信息;2)通过在Faster R-CNN中加入Inception v3来改进手势分类。实验表明,与传统的Faster R-CNN相比,改进后的网络具有更高的精度和更强的鲁棒性,并且在少量数据集下就能准确地定位手势在空间中的位置。
3D Gesture Recognition Method Based on Faster R-CNN Network*
The traditional static gesture recognition algorithm is easily affected by the complex environment which can cause low recognition rate. Therefore, an improved Faster R-CNN network is adopted to improve the gesture recognition accuracy based on 3D TOF (Time of Flight) camera. The contribution of this paper includes: 1) The group own TOF camera is used for data acquisition of 300 images for each of the five gestures, and the images collected by the TOF camera can reduce the effect of complex environments and provide depth information of gestures; 2) The gesture classification is improved by adding the Inception v3 in the Faster R-CNN. Experiments are carried out to show that the improved network has higher accuracy and stronger robustness, compared with traditional Faster R-CNN, and the localization of gestures in space is accurate with a small number of datasets.