{"title":"COME for No-Reference Video Quality Assessment","authors":"Chunfeng Wang, Li Su, W. Zhang","doi":"10.1109/MIPR.2018.00056","DOIUrl":null,"url":null,"abstract":"Nowadays, the issue of objective Video Quality Assessment (VQA) has been extensively studied. In this paper, we present an effective general-purpose VQA method named COnvolutional neural network and Multi-regression based Evaluation (COME). It requires no referred lossless video and is universal for non-specific types of distortion. A modified 2D convolutional neural network is introduced to learn the spatial features at frame level. At the same time, the motion information is extracted as temporal features at sequence level. And a multi-regression model is proposed to comprehensively assess the final video quality according to human’s psychological perception. The proposed method is tested on two commonly used databases with numerous kinds of distortions. The experimental results show that the proposed COME method is comparable with most popular full-reference VQA methods.","PeriodicalId":320000,"journal":{"name":"2018 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"15","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MIPR.2018.00056","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 15
Abstract
Nowadays, the issue of objective Video Quality Assessment (VQA) has been extensively studied. In this paper, we present an effective general-purpose VQA method named COnvolutional neural network and Multi-regression based Evaluation (COME). It requires no referred lossless video and is universal for non-specific types of distortion. A modified 2D convolutional neural network is introduced to learn the spatial features at frame level. At the same time, the motion information is extracted as temporal features at sequence level. And a multi-regression model is proposed to comprehensively assess the final video quality according to human’s psychological perception. The proposed method is tested on two commonly used databases with numerous kinds of distortions. The experimental results show that the proposed COME method is comparable with most popular full-reference VQA methods.