VQ-ViCNet: Strengthen Unique Features Comparison Autoencoder with Embedding Space for Covid-19 Image Classification

2022 IEEE 8th International Conference on Cloud Computing and Intelligent Systems (CCIS) Pub Date : 2022-11-26 DOI:10.1109/CCIS57298.2022.10016338

Qide Liu, Jielei Chu, Hua Yu, Xinlei Wang, Tianrui Li

引用次数: 0

Abstract

In this paper, we propose a new novel coronavirus pneumonia image classification model based on the combination of Transformer and convolutional network(VQ-ViCNet), and present a vector quantization feature enhancement module for the inconspicuous characteristics of lung medical image data. This model extracts the local latent layer features of the image through the convolutional network, and learns the deep global features of the image data through the Transformer’s multi-head self attention algorithm. After the calculation of convolution and attention, the features learned by the Transformer Encoder are enhanced by the vector quantization feature enhancement module and able to better complete the final downstream tasks. This model performs better than convolutional architectures, pure attention architectures and generative models on all 6 public datasets.

查看原文本刊更多论文

VQ-ViCNet:基于嵌入空间的Covid-19图像分类增强独特特征比较自编码器

本文提出了一种基于Transformer和卷积网络相结合的新型冠状病毒肺炎图像分类模型(VQ-ViCNet)，并针对肺部医学图像数据的不显著特征提出了矢量量化特征增强模块。该模型通过卷积网络提取图像的局部潜在层特征，并通过Transformer的多头自关注算法学习图像数据的深度全局特征。经过卷积计算和关注后，Transformer Encoder学习到的特征通过矢量量化特征增强模块进行增强，能够更好地完成最终的下游任务。该模型在所有6个公共数据集上的表现都优于卷积架构、纯注意力架构和生成模型。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 IEEE 8th International Conference on Cloud Computing and Intelligent Systems (CCIS)

自引率

0.00%

发文量