No-reference video quality assessment using data dimensionality reduction and attention-based pooling

International Conference on Digital Image Processing Pub Date : 2022-10-12 DOI:10.1117/12.2643807

Zhiwei Wang, Linjing Lai

引用次数: 0

Abstract

This paper proposes a new end-to-end no-reference (NR) video quality assessment (VQA) algorithm that makes use of dimensionality reduction and attention-based pooling. Firstly, the dataset is expanded through data enhancement based on frame sampling. Secondly, the cropped video blocks are input into the trainable data dimensionality reduction module which adopts 3D convolution to reduce the dimension of the data. Then, the dimensionality reduced data is input into the backbone of the algorithm to extract spatial features. The extracted features are pooled through attention-based pooling. Finally, the pooled features are regressed to the quality score through the full connection layer. Experimental results show that the proposed algorithm has achieved competitive performance on the LIVE, LIVE Mobile and CVD2014 datasets, and has low complexity.

查看原文本刊更多论文

使用数据降维和基于注意力池的无参考视频质量评估

本文提出了一种基于降维和注意力池的端到端无参考视频质量评估算法。首先，通过基于帧采样的数据增强对数据集进行扩展;其次，将裁剪后的视频块输入可训练数据降维模块，该模块采用三维卷积对数据进行降维。然后，将降维后的数据输入到算法的主干中提取空间特征。提取的特征通过基于注意力的池化进行池化。最后，通过全连接层将混合特征回归到质量分数。实验结果表明，该算法在LIVE、LIVE Mobile和CVD2014数据集上取得了较好的性能，且具有较低的复杂度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International Conference on Digital Image Processing

自引率

0.00%

发文量