基于时空深度特征的无监督视频预测网络

2018 25th International Conference on Mechatronics and Machine Vision in Practice (M2VIP) Pub Date : 2018-11-01 DOI:10.1109/M2VIP.2018.8600864

Beibei Jin, Rong Zhou, Zhisheng Zhang, Min Dai

{"title":"基于时空深度特征的无监督视频预测网络","authors":"Beibei Jin, Rong Zhou, Zhisheng Zhang, Min Dai","doi":"10.1109/M2VIP.2018.8600864","DOIUrl":null,"url":null,"abstract":"Predicting the future states of things is an important performance form of intelligence and it is also of vital importance in real-time systems such as autonomous cars and robotics. This paper aims to tackle a video prediction task. Previous methods for future frame prediction are always subject to restrictions from environment, leading to poor accuracy and blurry prediction details. In this work, we present an unsupervised video prediction framework which iteratively anticipates the raw RGB pixel values in future video frames. Extensive experiments are implemented on advanced datasets — KTH and KITTI. The results demonstrate that our method achieves a good performance.","PeriodicalId":365579,"journal":{"name":"2018 25th International Conference on Mechatronics and Machine Vision in Practice (M2VIP)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Unsupervised Video Prediction Network with Spatio-temporal Deep Features\",\"authors\":\"Beibei Jin, Rong Zhou, Zhisheng Zhang, Min Dai\",\"doi\":\"10.1109/M2VIP.2018.8600864\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Predicting the future states of things is an important performance form of intelligence and it is also of vital importance in real-time systems such as autonomous cars and robotics. This paper aims to tackle a video prediction task. Previous methods for future frame prediction are always subject to restrictions from environment, leading to poor accuracy and blurry prediction details. In this work, we present an unsupervised video prediction framework which iteratively anticipates the raw RGB pixel values in future video frames. Extensive experiments are implemented on advanced datasets — KTH and KITTI. The results demonstrate that our method achieves a good performance.\",\"PeriodicalId\":365579,\"journal\":{\"name\":\"2018 25th International Conference on Mechatronics and Machine Vision in Practice (M2VIP)\",\"volume\":\"26 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 25th International Conference on Mechatronics and Machine Vision in Practice (M2VIP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/M2VIP.2018.8600864\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 25th International Conference on Mechatronics and Machine Vision in Practice (M2VIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/M2VIP.2018.8600864","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

预测事物的未来状态是智能的一种重要表现形式，在自动驾驶汽车和机器人等实时系统中也至关重要。本文旨在解决一个视频预测任务。以往的未来帧预测方法往往受到环境的限制，导致预测精度差，预测细节模糊。在这项工作中，我们提出了一个无监督视频预测框架，迭代地预测未来视频帧中的原始RGB像素值。在先进的数据集KTH和KITTI上进行了广泛的实验。结果表明，该方法取得了良好的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Unsupervised Video Prediction Network with Spatio-temporal Deep Features

Predicting the future states of things is an important performance form of intelligence and it is also of vital importance in real-time systems such as autonomous cars and robotics. This paper aims to tackle a video prediction task. Previous methods for future frame prediction are always subject to restrictions from environment, leading to poor accuracy and blurry prediction details. In this work, we present an unsupervised video prediction framework which iteratively anticipates the raw RGB pixel values in future video frames. Extensive experiments are implemented on advanced datasets — KTH and KITTI. The results demonstrate that our method achieves a good performance.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2018 25th International Conference on Mechatronics and Machine Vision in Practice (M2VIP)

自引率

0.00%

发文量