{"title":"Spatial-Temporal Local Augmentation Graph Convolutional Networks","authors":"Siyu Chen, Huahu Xu, Cheng Chen, Zhe Zhu","doi":"10.1109/AINIT59027.2023.10212579","DOIUrl":null,"url":null,"abstract":"Action recognition based on skeleton models has been widely focused in the field of computer vision in recent years. Most of the previous methods only focus on the change trajectory of the same joint point in the moving process, with less consideration of the correlation between joints in the moving process, and many of the current action recognition models lack sufficient consideration of local relationships, so this paper constructs a more generalized spatial-temporal skeleton graph considering the inter-frame dependence of neighboring skeletons, and introduces a local enhancement module, using the idea of local aggregation on each node for local aggregation, combining the node's own features with the aggregated features of neighboring nodes, so as to better capture the local relationships between nodes. The model can combine global and local information to provide a more comprehensive feature representation, thus improving the performance of the model. The introduction of local relationships can also increase the flexibility and sensitivity to the details of the model. Finally, we validate the proposed Spatial-Temporal Local Augmentation Graph Convolutional Networks (ST-LAGCN) model in two skeleton datasets, NTURGB+D and Kinetics, and compare it with several state-of-the-art graph neural network models for action recognition, both of which show improved performance.","PeriodicalId":276778,"journal":{"name":"2023 4th International Seminar on Artificial Intelligence, Networking and Information Technology (AINIT)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 4th International Seminar on Artificial Intelligence, Networking and Information Technology (AINIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AINIT59027.2023.10212579","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Action recognition based on skeleton models has been widely focused in the field of computer vision in recent years. Most of the previous methods only focus on the change trajectory of the same joint point in the moving process, with less consideration of the correlation between joints in the moving process, and many of the current action recognition models lack sufficient consideration of local relationships, so this paper constructs a more generalized spatial-temporal skeleton graph considering the inter-frame dependence of neighboring skeletons, and introduces a local enhancement module, using the idea of local aggregation on each node for local aggregation, combining the node's own features with the aggregated features of neighboring nodes, so as to better capture the local relationships between nodes. The model can combine global and local information to provide a more comprehensive feature representation, thus improving the performance of the model. The introduction of local relationships can also increase the flexibility and sensitivity to the details of the model. Finally, we validate the proposed Spatial-Temporal Local Augmentation Graph Convolutional Networks (ST-LAGCN) model in two skeleton datasets, NTURGB+D and Kinetics, and compare it with several state-of-the-art graph neural network models for action recognition, both of which show improved performance.