Tianming Zhuang, Pengbiao Zhao, Peng Xiao, Bin Wang
{"title":"基于分割策略的多流CNN-LSTM网络人体动作识别","authors":"Tianming Zhuang, Pengbiao Zhao, Peng Xiao, Bin Wang","doi":"10.1145/3448748.3448815","DOIUrl":null,"url":null,"abstract":"The wide application of human action recognition in the field of computer vision makes it a hot research topic in the past decades. In recent years, the prevalence of deep sensors and the proposal of real-time skeleton estimation algorithm based on deep images make human action recognition based on skeleton sequence attract increasing attention of researchers. Most of the existing work is aimed at extracting the spatial information of different joint nodes in a frame, but they do not fully consider the combination of temporal and spatial features. At the same time, the different joints were regarded as equally significant in most previous work, which is obviously not in line with the physiological characteristics and kinematics of human body. Therefore, in this paper, a human joint partition strategy is proposed to divide 25 human joints. In addition, a cnn-lstm framework is designed, which can simultaneously model the spatio-temporal characteristics of human skeleton sequence data, and extract the spatial domain information of different joints in a frame and the temporal domain information embedded in consecutive frames.","PeriodicalId":89223,"journal":{"name":"Proceedings ... International Joint Conference on Bioinformatics, Systems Biology and Intellgent Computing. International Joint Conference on Bioinformatics, Systems Biology and Intelligent Computing","volume":"76 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multi-Stream CNN-LSTM Network with Partition Strategy for Human Action Recognition\",\"authors\":\"Tianming Zhuang, Pengbiao Zhao, Peng Xiao, Bin Wang\",\"doi\":\"10.1145/3448748.3448815\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The wide application of human action recognition in the field of computer vision makes it a hot research topic in the past decades. In recent years, the prevalence of deep sensors and the proposal of real-time skeleton estimation algorithm based on deep images make human action recognition based on skeleton sequence attract increasing attention of researchers. Most of the existing work is aimed at extracting the spatial information of different joint nodes in a frame, but they do not fully consider the combination of temporal and spatial features. At the same time, the different joints were regarded as equally significant in most previous work, which is obviously not in line with the physiological characteristics and kinematics of human body. Therefore, in this paper, a human joint partition strategy is proposed to divide 25 human joints. In addition, a cnn-lstm framework is designed, which can simultaneously model the spatio-temporal characteristics of human skeleton sequence data, and extract the spatial domain information of different joints in a frame and the temporal domain information embedded in consecutive frames.\",\"PeriodicalId\":89223,\"journal\":{\"name\":\"Proceedings ... International Joint Conference on Bioinformatics, Systems Biology and Intellgent Computing. International Joint Conference on Bioinformatics, Systems Biology and Intelligent Computing\",\"volume\":\"76 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-01-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings ... International Joint Conference on Bioinformatics, Systems Biology and Intellgent Computing. International Joint Conference on Bioinformatics, Systems Biology and Intelligent Computing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3448748.3448815\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings ... International Joint Conference on Bioinformatics, Systems Biology and Intellgent Computing. International Joint Conference on Bioinformatics, Systems Biology and Intelligent Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3448748.3448815","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Multi-Stream CNN-LSTM Network with Partition Strategy for Human Action Recognition
The wide application of human action recognition in the field of computer vision makes it a hot research topic in the past decades. In recent years, the prevalence of deep sensors and the proposal of real-time skeleton estimation algorithm based on deep images make human action recognition based on skeleton sequence attract increasing attention of researchers. Most of the existing work is aimed at extracting the spatial information of different joint nodes in a frame, but they do not fully consider the combination of temporal and spatial features. At the same time, the different joints were regarded as equally significant in most previous work, which is obviously not in line with the physiological characteristics and kinematics of human body. Therefore, in this paper, a human joint partition strategy is proposed to divide 25 human joints. In addition, a cnn-lstm framework is designed, which can simultaneously model the spatio-temporal characteristics of human skeleton sequence data, and extract the spatial domain information of different joints in a frame and the temporal domain information embedded in consecutive frames.