{"title":"Early Fusion Graph Convolutional Network for Skeleton-Based Action Recognition","authors":"Xiaoxue Zhao, Cuiwei Liu, Xiangbin Shi","doi":"10.1109/mlsp52302.2021.9596448","DOIUrl":null,"url":null,"abstract":"Skeleton-based action recognition has attracted much attention in computer vision. Recently, Graph Convolutional Networks (GCNs) with multi-stream fusion strategies have obtained remarkable performance. Most of these models make decisions of action recognition by merging the prediction scores of multiple streams, while ignoring the complementary properties of different data streams for building representative features. In this paper, we propose a novel Early Fusion Graph Convolutional Network (EF-GCN), which fuses hidden features extracted from multiple skeleton data streams at different levels to enhance the discriminative power of features. Unlike the previous GCN-based models that train networks corresponding to different streams independently, all the subnetworks in the proposed EF-GCN are jointly learned in an end-to-end manner. Experiments conducted on two skeleton datasets (i.e., NTU-RGB+D and NTU-120 RGB+D) show the superior performance of EF-GCN.","PeriodicalId":156116,"journal":{"name":"2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/mlsp52302.2021.9596448","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Skeleton-based action recognition has attracted much attention in computer vision. Recently, Graph Convolutional Networks (GCNs) with multi-stream fusion strategies have obtained remarkable performance. Most of these models make decisions of action recognition by merging the prediction scores of multiple streams, while ignoring the complementary properties of different data streams for building representative features. In this paper, we propose a novel Early Fusion Graph Convolutional Network (EF-GCN), which fuses hidden features extracted from multiple skeleton data streams at different levels to enhance the discriminative power of features. Unlike the previous GCN-based models that train networks corresponding to different streams independently, all the subnetworks in the proposed EF-GCN are jointly learned in an end-to-end manner. Experiments conducted on two skeleton datasets (i.e., NTU-RGB+D and NTU-120 RGB+D) show the superior performance of EF-GCN.