{"title":"Motion Information Image for Action Recognition","authors":"Yue Yan, Jianming Liu, Qin Cheng, Zhenshan Lu","doi":"10.1109/ICSP54964.2022.9778732","DOIUrl":null,"url":null,"abstract":"The paper proposes an effective motion information computation method, called Motion Information Image, which can aggregate motion information of multiple frames into one frame to represent the dynamic process of compact motion in video for action recognition. Motion Information Image can adopt an average sampling strategy to represent the temporal modeling relationship of the video, which allows the clever conversion of motion information into image information. Besides, we use convolutional neural networks (CNNs) combined with Motion Information Image to build a new Motion Information Image network for end-to-end action recognition. The performance of the method is improved by 2.0%~3.0% in NTU RGB+D and NTU RGB+D 120 datasets with extensive experimental validation and exhibits significant advantages for recognizing fine-grained actions.","PeriodicalId":363766,"journal":{"name":"2022 7th International Conference on Intelligent Computing and Signal Processing (ICSP)","volume":"49 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 7th International Conference on Intelligent Computing and Signal Processing (ICSP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSP54964.2022.9778732","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The paper proposes an effective motion information computation method, called Motion Information Image, which can aggregate motion information of multiple frames into one frame to represent the dynamic process of compact motion in video for action recognition. Motion Information Image can adopt an average sampling strategy to represent the temporal modeling relationship of the video, which allows the clever conversion of motion information into image information. Besides, we use convolutional neural networks (CNNs) combined with Motion Information Image to build a new Motion Information Image network for end-to-end action recognition. The performance of the method is improved by 2.0%~3.0% in NTU RGB+D and NTU RGB+D 120 datasets with extensive experimental validation and exhibits significant advantages for recognizing fine-grained actions.