复杂环境下基于深度学习的实时手势识别

Weixin Wu, Meiping Shi, Tao Wu, Dawei Zhao, Shuai Zhang, Junxiang Li
{"title":"复杂环境下基于深度学习的实时手势识别","authors":"Weixin Wu, Meiping Shi, Tao Wu, Dawei Zhao, Shuai Zhang, Junxiang Li","doi":"10.1109/CCDC.2019.8833328","DOIUrl":null,"url":null,"abstract":"Real-time hand gesture recognition in complex environments has many challenges, such as poor real-time performances and robustness to environmental changes. This paper takes the hand gesture control of the unmanned vehicle as the application background, and focuses on the gesture detection and recognition of video streams based on deep learning in the complex environment. In this paper, we detect the hand in a complex environment by training the ssd_mobilenet model, and initialize the tracking with kalman filter. Then, we detect the hand keypoints by following the architecture of Convolutional Pose Machines (CPMs), in order to obtain the belief maps for all keypoints that are used as the train sets of Convolutional Neural Networks (CNNs). Finally, based on the results obtained by our classification, this paper proposes a method of multi-frame recursion to minimize the influences of redundant frames and error frames. In this paper, eight kinds of gestures for controlling vehicle are identified. The experimental results show that our method can successfully realize real-time hand gesture recognition in the video streams. The recognition accuracy can reach 96.7%, and the average recognition speed reaches 12 fps, which basically meets the real-time requirements and successfully applys to mobile terminals such as TX2 for engineering practice.","PeriodicalId":254705,"journal":{"name":"2019 Chinese Control And Decision Conference (CCDC)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Real-time Hand Gesture Recognition Based on Deep Learning in Complex Environments\",\"authors\":\"Weixin Wu, Meiping Shi, Tao Wu, Dawei Zhao, Shuai Zhang, Junxiang Li\",\"doi\":\"10.1109/CCDC.2019.8833328\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Real-time hand gesture recognition in complex environments has many challenges, such as poor real-time performances and robustness to environmental changes. This paper takes the hand gesture control of the unmanned vehicle as the application background, and focuses on the gesture detection and recognition of video streams based on deep learning in the complex environment. In this paper, we detect the hand in a complex environment by training the ssd_mobilenet model, and initialize the tracking with kalman filter. Then, we detect the hand keypoints by following the architecture of Convolutional Pose Machines (CPMs), in order to obtain the belief maps for all keypoints that are used as the train sets of Convolutional Neural Networks (CNNs). Finally, based on the results obtained by our classification, this paper proposes a method of multi-frame recursion to minimize the influences of redundant frames and error frames. In this paper, eight kinds of gestures for controlling vehicle are identified. The experimental results show that our method can successfully realize real-time hand gesture recognition in the video streams. The recognition accuracy can reach 96.7%, and the average recognition speed reaches 12 fps, which basically meets the real-time requirements and successfully applys to mobile terminals such as TX2 for engineering practice.\",\"PeriodicalId\":254705,\"journal\":{\"name\":\"2019 Chinese Control And Decision Conference (CCDC)\",\"volume\":\"19 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-06-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 Chinese Control And Decision Conference (CCDC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CCDC.2019.8833328\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 Chinese Control And Decision Conference (CCDC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCDC.2019.8833328","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

复杂环境下的实时手势识别存在实时性差、对环境变化的鲁棒性差等问题。本文以无人驾驶汽车的手势控制为应用背景,重点研究复杂环境下基于深度学习的视频流手势检测与识别。在本文中,我们通过训练ssd_mobilenet模型来检测复杂环境中的手,并使用卡尔曼滤波初始化跟踪。然后,我们按照卷积姿态机(Convolutional Pose Machines, cpm)的架构检测手部关键点,从而得到所有关键点的信念映射,作为卷积神经网络(Convolutional Neural Networks, cnn)的训练集。最后,基于我们的分类结果,本文提出了一种多帧递归方法,以最小化冗余帧和错误帧的影响。本文识别了八种用于车辆控制的手势。实验结果表明,该方法可以成功地实现视频流中的实时手势识别。识别精度可达96.7%,平均识别速度达到12fps,基本满足实时性要求,成功应用于TX2等移动终端进行工程实践。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Real-time Hand Gesture Recognition Based on Deep Learning in Complex Environments
Real-time hand gesture recognition in complex environments has many challenges, such as poor real-time performances and robustness to environmental changes. This paper takes the hand gesture control of the unmanned vehicle as the application background, and focuses on the gesture detection and recognition of video streams based on deep learning in the complex environment. In this paper, we detect the hand in a complex environment by training the ssd_mobilenet model, and initialize the tracking with kalman filter. Then, we detect the hand keypoints by following the architecture of Convolutional Pose Machines (CPMs), in order to obtain the belief maps for all keypoints that are used as the train sets of Convolutional Neural Networks (CNNs). Finally, based on the results obtained by our classification, this paper proposes a method of multi-frame recursion to minimize the influences of redundant frames and error frames. In this paper, eight kinds of gestures for controlling vehicle are identified. The experimental results show that our method can successfully realize real-time hand gesture recognition in the video streams. The recognition accuracy can reach 96.7%, and the average recognition speed reaches 12 fps, which basically meets the real-time requirements and successfully applys to mobile terminals such as TX2 for engineering practice.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信