{"title":"SlowFast-GCN: A Novel Skeleton-Based Action Recognition Framework","authors":"Cheng-Hung Lin, Po-Yung Chou, Cheng-Hsien Lin, Min-Yen Tsai","doi":"10.1109/ICPAI51961.2020.00039","DOIUrl":null,"url":null,"abstract":"Human action recognition plays an important role in video surveillance, human-computer interaction, video understanding, and virtual reality. Different from two-dimensional object recognition, human action recognition is a dynamic object recognition with a time series relationship, and it faces many challenges from complex environments, such as color shift, light and shadow changes, and sampling angles. In order to improve the accuracy of human action recognition, many studies have proposed skeleton-based action recognition methods that are not affected by the background, but the current framework does not have much discussion on the integration of the time dimension.In this paper, we propose a novel SlowFast-GCN framework which combines the advantages of ST-GCN and SlowFastNet with dynamic human skeleton to improve the accuracy of human action recognition. The proposed framework uses two streams, one stream captures fine-grained motion changes, and the other stream captures static semantics. Through these two streams, we can merge the human skeleton features from two different time dimensions. Experimental results show that the proposed framework outperforms to state-of-the-art approaches on the NTU-RGBD dataset.","PeriodicalId":330198,"journal":{"name":"2020 International Conference on Pervasive Artificial Intelligence (ICPAI)","volume":"71 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference on Pervasive Artificial Intelligence (ICPAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICPAI51961.2020.00039","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
Human action recognition plays an important role in video surveillance, human-computer interaction, video understanding, and virtual reality. Different from two-dimensional object recognition, human action recognition is a dynamic object recognition with a time series relationship, and it faces many challenges from complex environments, such as color shift, light and shadow changes, and sampling angles. In order to improve the accuracy of human action recognition, many studies have proposed skeleton-based action recognition methods that are not affected by the background, but the current framework does not have much discussion on the integration of the time dimension.In this paper, we propose a novel SlowFast-GCN framework which combines the advantages of ST-GCN and SlowFastNet with dynamic human skeleton to improve the accuracy of human action recognition. The proposed framework uses two streams, one stream captures fine-grained motion changes, and the other stream captures static semantics. Through these two streams, we can merge the human skeleton features from two different time dimensions. Experimental results show that the proposed framework outperforms to state-of-the-art approaches on the NTU-RGBD dataset.