基于计算机视觉的人机手势交互技术研究

He Guo, Rui Zhang, Y. Li, Ying Cheng, Peng Xia
{"title":"基于计算机视觉的人机手势交互技术研究","authors":"He Guo, Rui Zhang, Y. Li, Ying Cheng, Peng Xia","doi":"10.1109/IAEAC54830.2022.9929908","DOIUrl":null,"url":null,"abstract":"With the development of human-computer interaction technology, gesture recognition is becoming more and more important. At the same time, due to the rapid development of automotive intelligence, the introduction of human-computer interaction technology into intelligent vehicles has increasingly become an important work. Aiming at the problems of low accuracy, low recognition efficiency and weak anti-interference ability of previous gesture recognition applications in driving scenes. This paper presents an improved yolov5 algorithm. By adding the improved and optimized k-means++ clustering optimization algorithm, the problems of unstable clustering effect of K-means clustering algorithm in yolov5 model and slow convergence to large-scale data are solved. In addition, by combining the C3 module in the backbone network with the attention mechanism (CBAM), the effect of target gesture recognition under complex background is improved. Finally, the latest optimization method of loss function (EIOU) is added to the algorithm model to improve the accuracy of training convergence. The average recognition accuracy of the algorithm proposed in this paper is 4.8% higher than 88.19% of the original yolov5s algorithm when the intersection to union ratio threshold is 0.5 to 0.95. The practical availability of the improved gesture recognition algorithm is verified by the simulation scene based on ROS (robot operating system) and unity.","PeriodicalId":349113,"journal":{"name":"2022 IEEE 6th Advanced Information Technology, Electronic and Automation Control Conference (IAEAC )","volume":"25 8","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Research on human-vehicle gesture interaction technology based on computer visionbility\",\"authors\":\"He Guo, Rui Zhang, Y. Li, Ying Cheng, Peng Xia\",\"doi\":\"10.1109/IAEAC54830.2022.9929908\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the development of human-computer interaction technology, gesture recognition is becoming more and more important. At the same time, due to the rapid development of automotive intelligence, the introduction of human-computer interaction technology into intelligent vehicles has increasingly become an important work. Aiming at the problems of low accuracy, low recognition efficiency and weak anti-interference ability of previous gesture recognition applications in driving scenes. This paper presents an improved yolov5 algorithm. By adding the improved and optimized k-means++ clustering optimization algorithm, the problems of unstable clustering effect of K-means clustering algorithm in yolov5 model and slow convergence to large-scale data are solved. In addition, by combining the C3 module in the backbone network with the attention mechanism (CBAM), the effect of target gesture recognition under complex background is improved. Finally, the latest optimization method of loss function (EIOU) is added to the algorithm model to improve the accuracy of training convergence. The average recognition accuracy of the algorithm proposed in this paper is 4.8% higher than 88.19% of the original yolov5s algorithm when the intersection to union ratio threshold is 0.5 to 0.95. The practical availability of the improved gesture recognition algorithm is verified by the simulation scene based on ROS (robot operating system) and unity.\",\"PeriodicalId\":349113,\"journal\":{\"name\":\"2022 IEEE 6th Advanced Information Technology, Electronic and Automation Control Conference (IAEAC )\",\"volume\":\"25 8\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-10-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE 6th Advanced Information Technology, Electronic and Automation Control Conference (IAEAC )\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IAEAC54830.2022.9929908\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 6th Advanced Information Technology, Electronic and Automation Control Conference (IAEAC )","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IAEAC54830.2022.9929908","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

随着人机交互技术的发展,手势识别变得越来越重要。同时,由于汽车智能化的快速发展,将人机交互技术引入智能汽车越来越成为一项重要的工作。针对以往驾驶场景中手势识别应用准确率低、识别效率低、抗干扰能力弱的问题。本文提出了一种改进的yolov5算法。通过加入改进优化的k-means++聚类优化算法,解决了k-means聚类算法在yolov5模型中聚类效果不稳定、对大规模数据收敛速度慢的问题。此外,通过将骨干网络中的C3模块与注意机制(CBAM)相结合,提高了复杂背景下目标手势识别的效果。最后,在算法模型中加入最新的损失函数(EIOU)优化方法,提高训练收敛的精度。当交并比阈值为0.5 ~ 0.95时,本文算法的平均识别准确率比原yolov5s算法的88.19%提高了4.8%。通过基于ROS(机器人操作系统)和unity的仿真场景验证了改进手势识别算法的实用性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Research on human-vehicle gesture interaction technology based on computer visionbility
With the development of human-computer interaction technology, gesture recognition is becoming more and more important. At the same time, due to the rapid development of automotive intelligence, the introduction of human-computer interaction technology into intelligent vehicles has increasingly become an important work. Aiming at the problems of low accuracy, low recognition efficiency and weak anti-interference ability of previous gesture recognition applications in driving scenes. This paper presents an improved yolov5 algorithm. By adding the improved and optimized k-means++ clustering optimization algorithm, the problems of unstable clustering effect of K-means clustering algorithm in yolov5 model and slow convergence to large-scale data are solved. In addition, by combining the C3 module in the backbone network with the attention mechanism (CBAM), the effect of target gesture recognition under complex background is improved. Finally, the latest optimization method of loss function (EIOU) is added to the algorithm model to improve the accuracy of training convergence. The average recognition accuracy of the algorithm proposed in this paper is 4.8% higher than 88.19% of the original yolov5s algorithm when the intersection to union ratio threshold is 0.5 to 0.95. The practical availability of the improved gesture recognition algorithm is verified by the simulation scene based on ROS (robot operating system) and unity.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信