{"title":"基于新型子带间三维DWT特征的无参考视频质量评估(VQA","authors":"Anish Kumar Vishwakarma, K. Bhurchandi","doi":"10.1109/NCC48643.2020.9056046","DOIUrl":null,"url":null,"abstract":"Major goal of blind video quality assessment (VQA) is to predict visual quality of videos for enhancing the quality of service (QoS) without any reference. However, the conventional VQA model uses video as a two-dimensional image sequence and extracts the features on a frame to frame basis; which completely neglects temporal nature of the video data and the corresponding distortion content as well. In this work, we come up with a novel no-reference VQA model that describes and exploits the inter sub-band statistics of the three-dimensional discrete wavelet transform (3-D DWT) coefficients of video blocks. First, the 3-D DWT transform decomposes the video block of selected size into eight 3-D DWT sub-bands. Then we propose various novel statistical features using the sub-band coefficients. Inter sub-band statistics depicts the spread of the various frequency components and correlation between them. 3-D DWT features automatically take care of the temporal distortions along with spatial distortions and subsequently the support vector regression (SVR) model estimates them to predict the visual quality score of distorted videos. Experimental results on LIVE database demonstrate the superiority of the proposed VQA model over the other state-of-the-art methods.","PeriodicalId":183772,"journal":{"name":"2020 National Conference on Communications (NCC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"No-Reference Video Quality Assessment (VQA) Using Novel Inter Sub-band 3-D DWT Features\",\"authors\":\"Anish Kumar Vishwakarma, K. Bhurchandi\",\"doi\":\"10.1109/NCC48643.2020.9056046\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Major goal of blind video quality assessment (VQA) is to predict visual quality of videos for enhancing the quality of service (QoS) without any reference. However, the conventional VQA model uses video as a two-dimensional image sequence and extracts the features on a frame to frame basis; which completely neglects temporal nature of the video data and the corresponding distortion content as well. In this work, we come up with a novel no-reference VQA model that describes and exploits the inter sub-band statistics of the three-dimensional discrete wavelet transform (3-D DWT) coefficients of video blocks. First, the 3-D DWT transform decomposes the video block of selected size into eight 3-D DWT sub-bands. Then we propose various novel statistical features using the sub-band coefficients. Inter sub-band statistics depicts the spread of the various frequency components and correlation between them. 3-D DWT features automatically take care of the temporal distortions along with spatial distortions and subsequently the support vector regression (SVR) model estimates them to predict the visual quality score of distorted videos. Experimental results on LIVE database demonstrate the superiority of the proposed VQA model over the other state-of-the-art methods.\",\"PeriodicalId\":183772,\"journal\":{\"name\":\"2020 National Conference on Communications (NCC)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 National Conference on Communications (NCC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/NCC48643.2020.9056046\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 National Conference on Communications (NCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NCC48643.2020.9056046","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
No-Reference Video Quality Assessment (VQA) Using Novel Inter Sub-band 3-D DWT Features
Major goal of blind video quality assessment (VQA) is to predict visual quality of videos for enhancing the quality of service (QoS) without any reference. However, the conventional VQA model uses video as a two-dimensional image sequence and extracts the features on a frame to frame basis; which completely neglects temporal nature of the video data and the corresponding distortion content as well. In this work, we come up with a novel no-reference VQA model that describes and exploits the inter sub-band statistics of the three-dimensional discrete wavelet transform (3-D DWT) coefficients of video blocks. First, the 3-D DWT transform decomposes the video block of selected size into eight 3-D DWT sub-bands. Then we propose various novel statistical features using the sub-band coefficients. Inter sub-band statistics depicts the spread of the various frequency components and correlation between them. 3-D DWT features automatically take care of the temporal distortions along with spatial distortions and subsequently the support vector regression (SVR) model estimates them to predict the visual quality score of distorted videos. Experimental results on LIVE database demonstrate the superiority of the proposed VQA model over the other state-of-the-art methods.