{"title":"基于模式理论的动作识别视频时间结构学习","authors":"Xiaoyu Zhang","doi":"10.1145/3404555.3404628","DOIUrl":null,"url":null,"abstract":"Aiming at the problem that a large amount of background information in the videos cause low judgment of actions, this paper proposed a graph model based on pattern theory for human complex action recognition. Firstly, a video is divided into video units and each video unit corresponds to an atomic action. The atomic action labels of videos are initialized by k-Means. Secondly, the key generator proposal module and the interpretative operation module are proposed to select important foreground information and obtain a reasonable representation of atomic action sequences. In the inference stage, the atomic action sequences of test videos are matched with template sequences by the Dynamic Time Warping algorithm (DTW) to obtain the action categories. The experimental results show that compared with the most existing human action recognition models, our model can explain the temporal process of action occurrence and obtain a more discriminatory sequence representation, which can effectively improve the accuracy of action recognition.","PeriodicalId":220526,"journal":{"name":"Proceedings of the 2020 6th International Conference on Computing and Artificial Intelligence","volume":"98 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Learning Temporal Structure of Videos for Action Recognition Using Pattern Theory\",\"authors\":\"Xiaoyu Zhang\",\"doi\":\"10.1145/3404555.3404628\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Aiming at the problem that a large amount of background information in the videos cause low judgment of actions, this paper proposed a graph model based on pattern theory for human complex action recognition. Firstly, a video is divided into video units and each video unit corresponds to an atomic action. The atomic action labels of videos are initialized by k-Means. Secondly, the key generator proposal module and the interpretative operation module are proposed to select important foreground information and obtain a reasonable representation of atomic action sequences. In the inference stage, the atomic action sequences of test videos are matched with template sequences by the Dynamic Time Warping algorithm (DTW) to obtain the action categories. The experimental results show that compared with the most existing human action recognition models, our model can explain the temporal process of action occurrence and obtain a more discriminatory sequence representation, which can effectively improve the accuracy of action recognition.\",\"PeriodicalId\":220526,\"journal\":{\"name\":\"Proceedings of the 2020 6th International Conference on Computing and Artificial Intelligence\",\"volume\":\"98 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-04-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2020 6th International Conference on Computing and Artificial Intelligence\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3404555.3404628\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2020 6th International Conference on Computing and Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3404555.3404628","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Learning Temporal Structure of Videos for Action Recognition Using Pattern Theory
Aiming at the problem that a large amount of background information in the videos cause low judgment of actions, this paper proposed a graph model based on pattern theory for human complex action recognition. Firstly, a video is divided into video units and each video unit corresponds to an atomic action. The atomic action labels of videos are initialized by k-Means. Secondly, the key generator proposal module and the interpretative operation module are proposed to select important foreground information and obtain a reasonable representation of atomic action sequences. In the inference stage, the atomic action sequences of test videos are matched with template sequences by the Dynamic Time Warping algorithm (DTW) to obtain the action categories. The experimental results show that compared with the most existing human action recognition models, our model can explain the temporal process of action occurrence and obtain a more discriminatory sequence representation, which can effectively improve the accuracy of action recognition.