基于骨架动作识别的洗牌图卷积网络

IF 0.8 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Journal of Advanced Computational Intelligence and Intelligent Informatics Pub Date : 2023-09-20 DOI:10.20965/jaciii.2023.p0790

Qiwei Yu, Yaping Dai, Kaoru Hirota, Shuai Shao, Wei Dai

{"title":"基于骨架动作识别的洗牌图卷积网络","authors":"Qiwei Yu, Yaping Dai, Kaoru Hirota, Shuai Shao, Wei Dai","doi":"10.20965/jaciii.2023.p0790","DOIUrl":null,"url":null,"abstract":"A shuffle graph convolutional network (Shuffle-GCN) is proposed to recognize human action by analyzing skeleton data. It uses channel split and channel shuffle operations to process multi-feature channels of skeleton data, which reduces the computational cost of graph convolution operation. Compared with the classical two-stream adaptive graph convolutional network model, the proposed method achieves a higher precision with 1/3 of the floating-point operations (FLOPs). Even more, a channel-level topology modeling method is designed to extract more motion information of human skeleton by learning the graph topology from different channels dynamically. The performance of Shuffle-GCN is tested under 56,880 action clips from the NTU RGB+D dataset with the accuracy 96.0% and the computational complexity 12.8 GFLOPs. The proposed method offers feasible solutions for developing practical applications of action recognition.","PeriodicalId":45921,"journal":{"name":"Journal of Advanced Computational Intelligence and Intelligent Informatics","volume":"26 1","pages":"0"},"PeriodicalIF":0.8000,"publicationDate":"2023-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Shuffle Graph Convolutional Network for Skeleton-Based Action Recognition\",\"authors\":\"Qiwei Yu, Yaping Dai, Kaoru Hirota, Shuai Shao, Wei Dai\",\"doi\":\"10.20965/jaciii.2023.p0790\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A shuffle graph convolutional network (Shuffle-GCN) is proposed to recognize human action by analyzing skeleton data. It uses channel split and channel shuffle operations to process multi-feature channels of skeleton data, which reduces the computational cost of graph convolution operation. Compared with the classical two-stream adaptive graph convolutional network model, the proposed method achieves a higher precision with 1/3 of the floating-point operations (FLOPs). Even more, a channel-level topology modeling method is designed to extract more motion information of human skeleton by learning the graph topology from different channels dynamically. The performance of Shuffle-GCN is tested under 56,880 action clips from the NTU RGB+D dataset with the accuracy 96.0% and the computational complexity 12.8 GFLOPs. The proposed method offers feasible solutions for developing practical applications of action recognition.\",\"PeriodicalId\":45921,\"journal\":{\"name\":\"Journal of Advanced Computational Intelligence and Intelligent Informatics\",\"volume\":\"26 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.8000,\"publicationDate\":\"2023-09-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Advanced Computational Intelligence and Intelligent Informatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.20965/jaciii.2023.p0790\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Advanced Computational Intelligence and Intelligent Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.20965/jaciii.2023.p0790","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

提出了一种洗牌图卷积网络(shuffle - gcn)，通过分析骨架数据来识别人体动作。它采用通道分割和通道shuffle操作来处理骨架数据的多特征通道，降低了图卷积运算的计算成本。与经典的双流自适应图卷积网络模型相比，该方法以1/3的浮点运算(FLOPs)实现了更高的精度。在此基础上，设计了通道级拓扑建模方法，通过动态学习不同通道的图拓扑，提取人体骨骼的更多运动信息。在NTU RGB+D数据集的56,880个动作片段上测试了Shuffle-GCN的性能，准确率为96.0%，计算复杂度为12.8 GFLOPs。该方法为动作识别的实际应用提供了可行的解决方案。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Shuffle Graph Convolutional Network for Skeleton-Based Action Recognition

A shuffle graph convolutional network (Shuffle-GCN) is proposed to recognize human action by analyzing skeleton data. It uses channel split and channel shuffle operations to process multi-feature channels of skeleton data, which reduces the computational cost of graph convolution operation. Compared with the classical two-stream adaptive graph convolutional network model, the proposed method achieves a higher precision with 1/3 of the floating-point operations (FLOPs). Even more, a channel-level topology modeling method is designed to extract more motion information of human skeleton by learning the graph topology from different channels dynamically. The performance of Shuffle-GCN is tested under 56,880 action clips from the NTU RGB+D dataset with the accuracy 96.0% and the computational complexity 12.8 GFLOPs. The proposed method offers feasible solutions for developing practical applications of action recognition.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Journal of Advanced Computational Intelligence and Intelligent Informatics COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE-

CiteScore

1.50

自引率

14.30%

发文量

期刊介绍： JACIII focuses on advanced computational intelligence and intelligent informatics. The topics include, but are not limited to; Fuzzy logic, Fuzzy control, Neural Networks, GA and Evolutionary Computation, Hybrid Systems, Adaptation and Learning Systems, Distributed Intelligent Systems, Network systems, Multi-media, Human interface, Biologically inspired evolutionary systems, Artificial life, Chaos, Complex systems, Fractals, Robotics, Medical applications, Pattern recognition, Virtual reality, Wavelet analysis, Scientific applications, Industrial applications, and Artistic applications.