Improved Convolutional 3D Networks for Micro-Movements Recognition

Rui Yuan, Lihua Zhang
{"title":"Improved Convolutional 3D Networks for Micro-Movements Recognition","authors":"Rui Yuan, Lihua Zhang","doi":"10.1109/cmvit57620.2023.00026","DOIUrl":null,"url":null,"abstract":"It is of great significance for computers to recognize the actions in videos. The human body’s action recognition has been applied in many fields. The majority of action recognition methods have relatively low precision in recognizing micro-movements. In some specific scenarios, tasks such as intelligent home companionship for the elderly and early warning for dangerous driving behaviors, the micro-actions of the observed are extremely important in the recognition task. At the same time, due to the physiological characteristics of the elderly or the limitation of the environment, the amplitude of the actions is relatively small. This research suggests an action recognition method based on deep learning to better analyze micro-movements-oriented action recognition. Inspired by transformer, we split an image into fixed-size patches. The network structure of C3D is improved. The idea of image patch is introduced to reduce the receptive field of each region in the video frame. Finally, the experimental verification is performed on two action recognition datasets, UCF101 and NTU. The average accuracies on UCF101 and NTU respectively are 91.74% and 88.01%, which show that the proposed algorithm can effectively improve the recognition ability of micro-movements and obtain better results compared with other baselines.","PeriodicalId":191655,"journal":{"name":"2023 7th International Conference on Machine Vision and Information Technology (CMVIT)","volume":"84 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 7th International Conference on Machine Vision and Information Technology (CMVIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/cmvit57620.2023.00026","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

It is of great significance for computers to recognize the actions in videos. The human body’s action recognition has been applied in many fields. The majority of action recognition methods have relatively low precision in recognizing micro-movements. In some specific scenarios, tasks such as intelligent home companionship for the elderly and early warning for dangerous driving behaviors, the micro-actions of the observed are extremely important in the recognition task. At the same time, due to the physiological characteristics of the elderly or the limitation of the environment, the amplitude of the actions is relatively small. This research suggests an action recognition method based on deep learning to better analyze micro-movements-oriented action recognition. Inspired by transformer, we split an image into fixed-size patches. The network structure of C3D is improved. The idea of image patch is introduced to reduce the receptive field of each region in the video frame. Finally, the experimental verification is performed on two action recognition datasets, UCF101 and NTU. The average accuracies on UCF101 and NTU respectively are 91.74% and 88.01%, which show that the proposed algorithm can effectively improve the recognition ability of micro-movements and obtain better results compared with other baselines.
微运动识别的改进卷积三维网络
计算机对视频中的动作进行识别具有重要意义。人体动作识别在许多领域都有应用。大多数动作识别方法在识别微动作时精度较低。在一些特定场景中,如智能居家陪伴老人、危险驾驶行为预警等任务中,被观察者的微动作在识别任务中是极其重要的。同时,由于老年人的生理特点或环境的限制,动作的幅度相对较小。为了更好地分析面向微动作的动作识别,本研究提出了一种基于深度学习的动作识别方法。受transformer的启发,我们将图像分割成固定大小的小块。改进了C3D的网络结构。引入图像patch的思想,减小视频帧中各区域的接受野。最后,在UCF101和NTU两个动作识别数据集上进行了实验验证。在UCF101和NTU上的平均准确率分别为91.74%和88.01%,表明本文算法可以有效提高微运动的识别能力,与其他基线相比效果更好。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信