基于模式理论的动作识别视频时间结构学习

Proceedings of the 2020 6th International Conference on Computing and Artificial Intelligence Pub Date : 2020-04-23 DOI:10.1145/3404555.3404628

Xiaoyu Zhang

{"title":"基于模式理论的动作识别视频时间结构学习","authors":"Xiaoyu Zhang","doi":"10.1145/3404555.3404628","DOIUrl":null,"url":null,"abstract":"Aiming at the problem that a large amount of background information in the videos cause low judgment of actions, this paper proposed a graph model based on pattern theory for human complex action recognition. Firstly, a video is divided into video units and each video unit corresponds to an atomic action. The atomic action labels of videos are initialized by k-Means. Secondly, the key generator proposal module and the interpretative operation module are proposed to select important foreground information and obtain a reasonable representation of atomic action sequences. In the inference stage, the atomic action sequences of test videos are matched with template sequences by the Dynamic Time Warping algorithm (DTW) to obtain the action categories. The experimental results show that compared with the most existing human action recognition models, our model can explain the temporal process of action occurrence and obtain a more discriminatory sequence representation, which can effectively improve the accuracy of action recognition.","PeriodicalId":220526,"journal":{"name":"Proceedings of the 2020 6th International Conference on Computing and Artificial Intelligence","volume":"98 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Learning Temporal Structure of Videos for Action Recognition Using Pattern Theory\",\"authors\":\"Xiaoyu Zhang\",\"doi\":\"10.1145/3404555.3404628\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Aiming at the problem that a large amount of background information in the videos cause low judgment of actions, this paper proposed a graph model based on pattern theory for human complex action recognition. Firstly, a video is divided into video units and each video unit corresponds to an atomic action. The atomic action labels of videos are initialized by k-Means. Secondly, the key generator proposal module and the interpretative operation module are proposed to select important foreground information and obtain a reasonable representation of atomic action sequences. In the inference stage, the atomic action sequences of test videos are matched with template sequences by the Dynamic Time Warping algorithm (DTW) to obtain the action categories. The experimental results show that compared with the most existing human action recognition models, our model can explain the temporal process of action occurrence and obtain a more discriminatory sequence representation, which can effectively improve the accuracy of action recognition.\",\"PeriodicalId\":220526,\"journal\":{\"name\":\"Proceedings of the 2020 6th International Conference on Computing and Artificial Intelligence\",\"volume\":\"98 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-04-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2020 6th International Conference on Computing and Artificial Intelligence\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3404555.3404628\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2020 6th International Conference on Computing and Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3404555.3404628","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

针对视频中大量背景信息导致动作判断能力低下的问题，提出了一种基于模式理论的人类复杂动作识别图模型。首先，将视频分成视频单元，每个视频单元对应一个原子动作。视频的原子动作标签通过k-Means初始化。其次，提出关键生成建议模块和解释操作模块，选择重要的前景信息，获得原子动作序列的合理表示;在推理阶段，通过动态时间扭曲算法(DTW)将测试视频的原子动作序列与模板序列进行匹配，得到动作类别。实验结果表明，与大多数现有的人类动作识别模型相比，我们的模型可以解释动作发生的时间过程，并获得更具歧视性的序列表示，可以有效提高动作识别的准确性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Learning Temporal Structure of Videos for Action Recognition Using Pattern Theory

Aiming at the problem that a large amount of background information in the videos cause low judgment of actions, this paper proposed a graph model based on pattern theory for human complex action recognition. Firstly, a video is divided into video units and each video unit corresponds to an atomic action. The atomic action labels of videos are initialized by k-Means. Secondly, the key generator proposal module and the interpretative operation module are proposed to select important foreground information and obtain a reasonable representation of atomic action sequences. In the inference stage, the atomic action sequences of test videos are matched with template sequences by the Dynamic Time Warping algorithm (DTW) to obtain the action categories. The experimental results show that compared with the most existing human action recognition models, our model can explain the temporal process of action occurrence and obtain a more discriminatory sequence representation, which can effectively improve the accuracy of action recognition.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the 2020 6th International Conference on Computing and Artificial Intelligence

自引率

0.00%

发文量