Spatial-Temporal Local Augmentation Graph Convolutional Networks

2023 4th International Seminar on Artificial Intelligence, Networking and Information Technology (AINIT) Pub Date : 2023-06-16 DOI:10.1109/AINIT59027.2023.10212579

Siyu Chen, Huahu Xu, Cheng Chen, Zhe Zhu

{"title":"Spatial-Temporal Local Augmentation Graph Convolutional Networks","authors":"Siyu Chen, Huahu Xu, Cheng Chen, Zhe Zhu","doi":"10.1109/AINIT59027.2023.10212579","DOIUrl":null,"url":null,"abstract":"Action recognition based on skeleton models has been widely focused in the field of computer vision in recent years. Most of the previous methods only focus on the change trajectory of the same joint point in the moving process, with less consideration of the correlation between joints in the moving process, and many of the current action recognition models lack sufficient consideration of local relationships, so this paper constructs a more generalized spatial-temporal skeleton graph considering the inter-frame dependence of neighboring skeletons, and introduces a local enhancement module, using the idea of local aggregation on each node for local aggregation, combining the node's own features with the aggregated features of neighboring nodes, so as to better capture the local relationships between nodes. The model can combine global and local information to provide a more comprehensive feature representation, thus improving the performance of the model. The introduction of local relationships can also increase the flexibility and sensitivity to the details of the model. Finally, we validate the proposed Spatial-Temporal Local Augmentation Graph Convolutional Networks (ST-LAGCN) model in two skeleton datasets, NTURGB+D and Kinetics, and compare it with several state-of-the-art graph neural network models for action recognition, both of which show improved performance.","PeriodicalId":276778,"journal":{"name":"2023 4th International Seminar on Artificial Intelligence, Networking and Information Technology (AINIT)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 4th International Seminar on Artificial Intelligence, Networking and Information Technology (AINIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AINIT59027.2023.10212579","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Action recognition based on skeleton models has been widely focused in the field of computer vision in recent years. Most of the previous methods only focus on the change trajectory of the same joint point in the moving process, with less consideration of the correlation between joints in the moving process, and many of the current action recognition models lack sufficient consideration of local relationships, so this paper constructs a more generalized spatial-temporal skeleton graph considering the inter-frame dependence of neighboring skeletons, and introduces a local enhancement module, using the idea of local aggregation on each node for local aggregation, combining the node's own features with the aggregated features of neighboring nodes, so as to better capture the local relationships between nodes. The model can combine global and local information to provide a more comprehensive feature representation, thus improving the performance of the model. The introduction of local relationships can also increase the flexibility and sensitivity to the details of the model. Finally, we validate the proposed Spatial-Temporal Local Augmentation Graph Convolutional Networks (ST-LAGCN) model in two skeleton datasets, NTURGB+D and Kinetics, and compare it with several state-of-the-art graph neural network models for action recognition, both of which show improved performance.

查看原文本刊更多论文

时空局部增强图卷积网络

基于骨骼模型的动作识别是近年来计算机视觉领域的研究热点。以往的方法大多只关注运动过程中同一关节点的变化轨迹，很少考虑运动过程中关节之间的相关性，而且目前的许多动作识别模型缺乏对局部关系的充分考虑，因此本文考虑到相邻骨架的帧间依赖性，构建了一个更广义的时空骨架图，并引入了局部增强模块。利用每个节点的局部聚合思想进行局部聚合，将节点自身的特征与相邻节点的聚合特征相结合，从而更好地捕捉节点之间的局部关系。该模型可以将全局信息和局部信息结合起来，提供更全面的特征表示，从而提高模型的性能。引入局部关系还可以增加对模型细节的灵活性和敏感性。最后，我们在NTURGB+D和Kinetics两个骨架数据集上验证了所提出的时空局部增强图卷积网络(ST-LAGCN)模型，并将其与几种最先进的动作识别图神经网络模型进行了比较，两者都显示出改进的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2023 4th International Seminar on Artificial Intelligence, Networking and Information Technology (AINIT)

自引率

0.00%

发文量