基于三维卷积的高效运动地图生成迭代模型，用于表示视频判别信息

2017 International Conference on Virtual Reality and Visualization (ICVRV) Pub Date : 2017-10-01 DOI:10.1109/ICVRV.2017.00111

Sheeraz Arif, Wang Wangjing

{"title":"基于三维卷积的高效运动地图生成迭代模型，用于表示视频判别信息","authors":"Sheeraz Arif, Wang Wangjing","doi":"10.1109/ICVRV.2017.00111","DOIUrl":null,"url":null,"abstract":"In this paper, we present a simple method to integrate the discriminative information of video for the action recognition tasks. We introduce the concept of motion map to represent the prefix of video sequences by optimizing the recognition accuracy of original video. 3-dimensional convolution (3Dconv) based model is used to generate the new motion map by integrating current motion map and future video frame. This model is capable of increasing the length of training video in iterative manner and allow us to generate the final motion map. Experimental evaluation results on widely used datasets i.e HMDB51 and UCF101 have revealed effectiveness and flexibility of proposed method over other baseline schemes.","PeriodicalId":187934,"journal":{"name":"2017 International Conference on Virtual Reality and Visualization (ICVRV)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"3-Dimensional Convolution Based Iterative Model for Efficient Motion Map Generation for Representing Video Discriminative Information\",\"authors\":\"Sheeraz Arif, Wang Wangjing\",\"doi\":\"10.1109/ICVRV.2017.00111\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we present a simple method to integrate the discriminative information of video for the action recognition tasks. We introduce the concept of motion map to represent the prefix of video sequences by optimizing the recognition accuracy of original video. 3-dimensional convolution (3Dconv) based model is used to generate the new motion map by integrating current motion map and future video frame. This model is capable of increasing the length of training video in iterative manner and allow us to generate the final motion map. Experimental evaluation results on widely used datasets i.e HMDB51 and UCF101 have revealed effectiveness and flexibility of proposed method over other baseline schemes.\",\"PeriodicalId\":187934,\"journal\":{\"name\":\"2017 International Conference on Virtual Reality and Visualization (ICVRV)\",\"volume\":\"7 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 International Conference on Virtual Reality and Visualization (ICVRV)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICVRV.2017.00111\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International Conference on Virtual Reality and Visualization (ICVRV)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICVRV.2017.00111","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

针对动作识别任务，提出了一种简单的视频判别信息集成方法。通过优化原始视频的识别精度，引入运动地图的概念来表示视频序列的前缀。采用基于三维卷积(3Dconv)的模型，将当前运动图与未来视频帧相结合，生成新的运动图。该模型能够以迭代的方式增加训练视频的长度，并允许我们生成最终的运动图。在广泛使用的数据集HMDB51和UCF101上的实验评估结果表明，该方法比其他基准方案更有效和灵活。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

3-Dimensional Convolution Based Iterative Model for Efficient Motion Map Generation for Representing Video Discriminative Information

In this paper, we present a simple method to integrate the discriminative information of video for the action recognition tasks. We introduce the concept of motion map to represent the prefix of video sequences by optimizing the recognition accuracy of original video. 3-dimensional convolution (3Dconv) based model is used to generate the new motion map by integrating current motion map and future video frame. This model is capable of increasing the length of training video in iterative manner and allow us to generate the final motion map. Experimental evaluation results on widely used datasets i.e HMDB51 and UCF101 have revealed effectiveness and flexibility of proposed method over other baseline schemes.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2017 International Conference on Virtual Reality and Visualization (ICVRV)

自引率

0.00%

发文量