基于混合深度学习模型和Shearlet变换的人体动作识别

Nemir Al-Azzawi
{"title":"基于混合深度学习模型和Shearlet变换的人体动作识别","authors":"Nemir Al-Azzawi","doi":"10.1109/ICITEE49829.2020.9271687","DOIUrl":null,"url":null,"abstract":"The hybrid deep learning model has become common in all recent studies dealing with machine vision and human action recognition. Most of the accuracy in revealing knowledge of machine vision is in extracting important features, including segmentation of the image. This paper proposes a new model for recognizing human actions from video sequences by integrating repetitive, gated recurrent neural networks across multiple scales with shearlet-based image segmentation extraction. Segmentations are the most critical information to distinguish human action. The feature extraction can impact the complexity of the calculation and the performance of the algorithm. The idea is to increase training robustness and improve segmentation through the use of the shearlet transform. Hence, the video classification based on a recurrent neural network and shearlet transform will work optimally. The proposed approach is evaluated on human activity videos using KTH, UCF-101, and UCF Sports Action datasets. The experimental results showed state-of-the-art performance in comparison to current methods. The average resulting classification accuracy is 95.1% for the KTH datasets. That was the optimal case in our proposed model reached.","PeriodicalId":245013,"journal":{"name":"2020 12th International Conference on Information Technology and Electrical Engineering (ICITEE)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Human Action Recognition based on Hybrid Deep Learning Model and Shearlet Transform\",\"authors\":\"Nemir Al-Azzawi\",\"doi\":\"10.1109/ICITEE49829.2020.9271687\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The hybrid deep learning model has become common in all recent studies dealing with machine vision and human action recognition. Most of the accuracy in revealing knowledge of machine vision is in extracting important features, including segmentation of the image. This paper proposes a new model for recognizing human actions from video sequences by integrating repetitive, gated recurrent neural networks across multiple scales with shearlet-based image segmentation extraction. Segmentations are the most critical information to distinguish human action. The feature extraction can impact the complexity of the calculation and the performance of the algorithm. The idea is to increase training robustness and improve segmentation through the use of the shearlet transform. Hence, the video classification based on a recurrent neural network and shearlet transform will work optimally. The proposed approach is evaluated on human activity videos using KTH, UCF-101, and UCF Sports Action datasets. The experimental results showed state-of-the-art performance in comparison to current methods. The average resulting classification accuracy is 95.1% for the KTH datasets. That was the optimal case in our proposed model reached.\",\"PeriodicalId\":245013,\"journal\":{\"name\":\"2020 12th International Conference on Information Technology and Electrical Engineering (ICITEE)\",\"volume\":\"37 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-10-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 12th International Conference on Information Technology and Electrical Engineering (ICITEE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICITEE49829.2020.9271687\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 12th International Conference on Information Technology and Electrical Engineering (ICITEE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICITEE49829.2020.9271687","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

混合深度学习模型在最近的机器视觉和人类行为识别研究中已经变得非常普遍。机器视觉揭示知识的准确性主要在于提取重要特征,包括图像的分割。本文提出了一种将重复、门控、多尺度递归神经网络与基于shearlet的图像分割提取相结合,从视频序列中识别人类动作的新模型。分割是区分人类行为最关键的信息。特征提取会影响计算的复杂度和算法的性能。其思想是通过使用shearlet变换来提高训练鲁棒性和改进分割。因此,基于循环神经网络和shearlet变换的视频分类效果最佳。我们使用KTH、UCF-101和UCF Sports Action数据集对人类活动视频进行了评估。实验结果表明,与现有方法相比,该方法具有最先进的性能。KTH数据集的平均分类准确率为95.1%。这是我们提出的模型所达到的最优情况。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Human Action Recognition based on Hybrid Deep Learning Model and Shearlet Transform
The hybrid deep learning model has become common in all recent studies dealing with machine vision and human action recognition. Most of the accuracy in revealing knowledge of machine vision is in extracting important features, including segmentation of the image. This paper proposes a new model for recognizing human actions from video sequences by integrating repetitive, gated recurrent neural networks across multiple scales with shearlet-based image segmentation extraction. Segmentations are the most critical information to distinguish human action. The feature extraction can impact the complexity of the calculation and the performance of the algorithm. The idea is to increase training robustness and improve segmentation through the use of the shearlet transform. Hence, the video classification based on a recurrent neural network and shearlet transform will work optimally. The proposed approach is evaluated on human activity videos using KTH, UCF-101, and UCF Sports Action datasets. The experimental results showed state-of-the-art performance in comparison to current methods. The average resulting classification accuracy is 95.1% for the KTH datasets. That was the optimal case in our proposed model reached.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信