基于新型子带间三维DWT特征的无参考视频质量评估(VQA

2020 National Conference on Communications (NCC) Pub Date : 2020-02-01 DOI:10.1109/NCC48643.2020.9056046

Anish Kumar Vishwakarma, K. Bhurchandi

{"title":"基于新型子带间三维DWT特征的无参考视频质量评估(VQA","authors":"Anish Kumar Vishwakarma, K. Bhurchandi","doi":"10.1109/NCC48643.2020.9056046","DOIUrl":null,"url":null,"abstract":"Major goal of blind video quality assessment (VQA) is to predict visual quality of videos for enhancing the quality of service (QoS) without any reference. However, the conventional VQA model uses video as a two-dimensional image sequence and extracts the features on a frame to frame basis; which completely neglects temporal nature of the video data and the corresponding distortion content as well. In this work, we come up with a novel no-reference VQA model that describes and exploits the inter sub-band statistics of the three-dimensional discrete wavelet transform (3-D DWT) coefficients of video blocks. First, the 3-D DWT transform decomposes the video block of selected size into eight 3-D DWT sub-bands. Then we propose various novel statistical features using the sub-band coefficients. Inter sub-band statistics depicts the spread of the various frequency components and correlation between them. 3-D DWT features automatically take care of the temporal distortions along with spatial distortions and subsequently the support vector regression (SVR) model estimates them to predict the visual quality score of distorted videos. Experimental results on LIVE database demonstrate the superiority of the proposed VQA model over the other state-of-the-art methods.","PeriodicalId":183772,"journal":{"name":"2020 National Conference on Communications (NCC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"No-Reference Video Quality Assessment (VQA) Using Novel Inter Sub-band 3-D DWT Features\",\"authors\":\"Anish Kumar Vishwakarma, K. Bhurchandi\",\"doi\":\"10.1109/NCC48643.2020.9056046\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Major goal of blind video quality assessment (VQA) is to predict visual quality of videos for enhancing the quality of service (QoS) without any reference. However, the conventional VQA model uses video as a two-dimensional image sequence and extracts the features on a frame to frame basis; which completely neglects temporal nature of the video data and the corresponding distortion content as well. In this work, we come up with a novel no-reference VQA model that describes and exploits the inter sub-band statistics of the three-dimensional discrete wavelet transform (3-D DWT) coefficients of video blocks. First, the 3-D DWT transform decomposes the video block of selected size into eight 3-D DWT sub-bands. Then we propose various novel statistical features using the sub-band coefficients. Inter sub-band statistics depicts the spread of the various frequency components and correlation between them. 3-D DWT features automatically take care of the temporal distortions along with spatial distortions and subsequently the support vector regression (SVR) model estimates them to predict the visual quality score of distorted videos. Experimental results on LIVE database demonstrate the superiority of the proposed VQA model over the other state-of-the-art methods.\",\"PeriodicalId\":183772,\"journal\":{\"name\":\"2020 National Conference on Communications (NCC)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 National Conference on Communications (NCC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/NCC48643.2020.9056046\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 National Conference on Communications (NCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NCC48643.2020.9056046","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

盲视频质量评估(VQA)的主要目标是在没有任何参考的情况下预测视频的视觉质量，以提高服务质量(QoS)。然而，传统的VQA模型将视频作为二维图像序列，逐帧提取特征;这完全忽略了视频数据的时效性以及相应的失真内容。在这项工作中，我们提出了一种新的无参考VQA模型，该模型描述并利用了视频块的三维离散小波变换(3-D DWT)系数的子带间统计。首先，对选定大小的视频块进行三维DWT变换，将其分解为8个三维DWT子带;然后利用子带系数提出了各种新的统计特征。子带间统计描述了各种频率分量的分布和它们之间的相关性。三维DWT特征自动处理时间和空间畸变，然后使用支持向量回归(SVR)模型对时间和空间畸变进行估计，从而预测失真视频的视觉质量分数。在LIVE数据库上的实验结果表明，所提出的VQA模型比其他先进的方法具有优越性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

No-Reference Video Quality Assessment (VQA) Using Novel Inter Sub-band 3-D DWT Features

Major goal of blind video quality assessment (VQA) is to predict visual quality of videos for enhancing the quality of service (QoS) without any reference. However, the conventional VQA model uses video as a two-dimensional image sequence and extracts the features on a frame to frame basis; which completely neglects temporal nature of the video data and the corresponding distortion content as well. In this work, we come up with a novel no-reference VQA model that describes and exploits the inter sub-band statistics of the three-dimensional discrete wavelet transform (3-D DWT) coefficients of video blocks. First, the 3-D DWT transform decomposes the video block of selected size into eight 3-D DWT sub-bands. Then we propose various novel statistical features using the sub-band coefficients. Inter sub-band statistics depicts the spread of the various frequency components and correlation between them. 3-D DWT features automatically take care of the temporal distortions along with spatial distortions and subsequently the support vector regression (SVR) model estimates them to predict the visual quality score of distorted videos. Experimental results on LIVE database demonstrate the superiority of the proposed VQA model over the other state-of-the-art methods.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2020 National Conference on Communications (NCC)

自引率

0.00%

发文量