基于混合深度学习模型和Shearlet变换的人体动作识别

2020 12th International Conference on Information Technology and Electrical Engineering (ICITEE) Pub Date : 2020-10-06 DOI:10.1109/ICITEE49829.2020.9271687

Nemir Al-Azzawi

{"title":"基于混合深度学习模型和Shearlet变换的人体动作识别","authors":"Nemir Al-Azzawi","doi":"10.1109/ICITEE49829.2020.9271687","DOIUrl":null,"url":null,"abstract":"The hybrid deep learning model has become common in all recent studies dealing with machine vision and human action recognition. Most of the accuracy in revealing knowledge of machine vision is in extracting important features, including segmentation of the image. This paper proposes a new model for recognizing human actions from video sequences by integrating repetitive, gated recurrent neural networks across multiple scales with shearlet-based image segmentation extraction. Segmentations are the most critical information to distinguish human action. The feature extraction can impact the complexity of the calculation and the performance of the algorithm. The idea is to increase training robustness and improve segmentation through the use of the shearlet transform. Hence, the video classification based on a recurrent neural network and shearlet transform will work optimally. The proposed approach is evaluated on human activity videos using KTH, UCF-101, and UCF Sports Action datasets. The experimental results showed state-of-the-art performance in comparison to current methods. The average resulting classification accuracy is 95.1% for the KTH datasets. That was the optimal case in our proposed model reached.","PeriodicalId":245013,"journal":{"name":"2020 12th International Conference on Information Technology and Electrical Engineering (ICITEE)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Human Action Recognition based on Hybrid Deep Learning Model and Shearlet Transform\",\"authors\":\"Nemir Al-Azzawi\",\"doi\":\"10.1109/ICITEE49829.2020.9271687\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The hybrid deep learning model has become common in all recent studies dealing with machine vision and human action recognition. Most of the accuracy in revealing knowledge of machine vision is in extracting important features, including segmentation of the image. This paper proposes a new model for recognizing human actions from video sequences by integrating repetitive, gated recurrent neural networks across multiple scales with shearlet-based image segmentation extraction. Segmentations are the most critical information to distinguish human action. The feature extraction can impact the complexity of the calculation and the performance of the algorithm. The idea is to increase training robustness and improve segmentation through the use of the shearlet transform. Hence, the video classification based on a recurrent neural network and shearlet transform will work optimally. The proposed approach is evaluated on human activity videos using KTH, UCF-101, and UCF Sports Action datasets. The experimental results showed state-of-the-art performance in comparison to current methods. The average resulting classification accuracy is 95.1% for the KTH datasets. That was the optimal case in our proposed model reached.\",\"PeriodicalId\":245013,\"journal\":{\"name\":\"2020 12th International Conference on Information Technology and Electrical Engineering (ICITEE)\",\"volume\":\"37 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-10-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 12th International Conference on Information Technology and Electrical Engineering (ICITEE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICITEE49829.2020.9271687\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 12th International Conference on Information Technology and Electrical Engineering (ICITEE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICITEE49829.2020.9271687","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

混合深度学习模型在最近的机器视觉和人类行为识别研究中已经变得非常普遍。机器视觉揭示知识的准确性主要在于提取重要特征，包括图像的分割。本文提出了一种将重复、门控、多尺度递归神经网络与基于shearlet的图像分割提取相结合，从视频序列中识别人类动作的新模型。分割是区分人类行为最关键的信息。特征提取会影响计算的复杂度和算法的性能。其思想是通过使用shearlet变换来提高训练鲁棒性和改进分割。因此，基于循环神经网络和shearlet变换的视频分类效果最佳。我们使用KTH、UCF-101和UCF Sports Action数据集对人类活动视频进行了评估。实验结果表明，与现有方法相比，该方法具有最先进的性能。KTH数据集的平均分类准确率为95.1%。这是我们提出的模型所达到的最优情况。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Human Action Recognition based on Hybrid Deep Learning Model and Shearlet Transform

The hybrid deep learning model has become common in all recent studies dealing with machine vision and human action recognition. Most of the accuracy in revealing knowledge of machine vision is in extracting important features, including segmentation of the image. This paper proposes a new model for recognizing human actions from video sequences by integrating repetitive, gated recurrent neural networks across multiple scales with shearlet-based image segmentation extraction. Segmentations are the most critical information to distinguish human action. The feature extraction can impact the complexity of the calculation and the performance of the algorithm. The idea is to increase training robustness and improve segmentation through the use of the shearlet transform. Hence, the video classification based on a recurrent neural network and shearlet transform will work optimally. The proposed approach is evaluated on human activity videos using KTH, UCF-101, and UCF Sports Action datasets. The experimental results showed state-of-the-art performance in comparison to current methods. The average resulting classification accuracy is 95.1% for the KTH datasets. That was the optimal case in our proposed model reached.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2020 12th International Conference on Information Technology and Electrical Engineering (ICITEE)

自引率

0.00%

发文量