Aniket Aayush, Animesh Khare, Abhijnan Bajpai, M. Aman Kumar, Ramamoorthy Srinath
{"title":"基于递归神经网络的视频预测","authors":"Aniket Aayush, Animesh Khare, Abhijnan Bajpai, M. Aman Kumar, Ramamoorthy Srinath","doi":"10.1109/ICAIA57370.2023.10169630","DOIUrl":null,"url":null,"abstract":"With the advent of recent technologies, Deep Learning, a subset of machine learning, has gained popularity in solving problems in a variety of domains. One vast field of application for Deep Learning approaches is in the generation of video frames. While Video Interpolation has come a long way, Deep Learning based Video Prediction remains a prominent area of research. We implement a recurrent neural network for the task of video prediction, and then analyze the performance of the model for several different generation ratios and examine the impact of variations in the videos on the model’s ability to stick close to the ground truth. The model is tested on raw videos from Youtube of the ‘Comedian’ category. The quality of the model is evaluated using quantitative and qualitative metrics. The potential use case can be in video streaming to reduce the transmission bandwidth and to generate frames that are not present in the video.","PeriodicalId":196526,"journal":{"name":"2023 International Conference on Artificial Intelligence and Applications (ICAIA) Alliance Technology Conference (ATCON-1)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-04-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Video Prediction using Recurrent Neural Network\",\"authors\":\"Aniket Aayush, Animesh Khare, Abhijnan Bajpai, M. Aman Kumar, Ramamoorthy Srinath\",\"doi\":\"10.1109/ICAIA57370.2023.10169630\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the advent of recent technologies, Deep Learning, a subset of machine learning, has gained popularity in solving problems in a variety of domains. One vast field of application for Deep Learning approaches is in the generation of video frames. While Video Interpolation has come a long way, Deep Learning based Video Prediction remains a prominent area of research. We implement a recurrent neural network for the task of video prediction, and then analyze the performance of the model for several different generation ratios and examine the impact of variations in the videos on the model’s ability to stick close to the ground truth. The model is tested on raw videos from Youtube of the ‘Comedian’ category. The quality of the model is evaluated using quantitative and qualitative metrics. The potential use case can be in video streaming to reduce the transmission bandwidth and to generate frames that are not present in the video.\",\"PeriodicalId\":196526,\"journal\":{\"name\":\"2023 International Conference on Artificial Intelligence and Applications (ICAIA) Alliance Technology Conference (ATCON-1)\",\"volume\":\"31 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-04-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 International Conference on Artificial Intelligence and Applications (ICAIA) Alliance Technology Conference (ATCON-1)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICAIA57370.2023.10169630\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Artificial Intelligence and Applications (ICAIA) Alliance Technology Conference (ATCON-1)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAIA57370.2023.10169630","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
With the advent of recent technologies, Deep Learning, a subset of machine learning, has gained popularity in solving problems in a variety of domains. One vast field of application for Deep Learning approaches is in the generation of video frames. While Video Interpolation has come a long way, Deep Learning based Video Prediction remains a prominent area of research. We implement a recurrent neural network for the task of video prediction, and then analyze the performance of the model for several different generation ratios and examine the impact of variations in the videos on the model’s ability to stick close to the ground truth. The model is tested on raw videos from Youtube of the ‘Comedian’ category. The quality of the model is evaluated using quantitative and qualitative metrics. The potential use case can be in video streaming to reduce the transmission bandwidth and to generate frames that are not present in the video.