基于图卷积网络的图像分类标签分布学习

2022 International Conference on Cyber-Physical Social Intelligence (ICCSI) Pub Date : 2022-11-18 DOI:10.1109/ICCSI55536.2022.9970653

Changqing Gong, Shanshan Wang, Yiquan Wu, Chongwen Liu, M. El-Yacoubi, Huafeng Qin

{"title":"基于图卷积网络的图像分类标签分布学习","authors":"Changqing Gong, Shanshan Wang, Yiquan Wu, Chongwen Liu, M. El-Yacoubi, Huafeng Qin","doi":"10.1109/ICCSI55536.2022.9970653","DOIUrl":null,"url":null,"abstract":"The one-hot vector employed for true label representation has been widely applied for image classification. However, the one-hot representation assumes that a single label is only associated with one instance, which is not reasonable because labels are generally not completely independent and instances may relate to multiple labels for the real scenarios. Such one-hot representation may ignore the relevance among labels that provide more supervision information for model training. To capture and explore such significant relevance in image classification, we propose, in this paper, GCNLDL, a Graph Convolutional Network based Label Distribution Learning approach for classification. GCNLDL builds a directed graph over the images, where each node (sample) is represented by the embedding of an image, where GCN learns the relevance between an input image and multiple training images from different classes to obtain the label distribution vector. The resulting vector is further combined with the one-hot label to recover a realistic label distribution of the input image, which is employed to train the state-of-the-art classification models. Furthermore, a multilayer perception is proposed to learn an effective label correlation matrix to guide information propagation among the nodes in GCN. GCNLDL is capable of capturing the relevance among labels by representation learning graph structure among image samples during training process and produces a better label distribution to guide the training of the state-of-the-art image classification models, resulting in a performance improvement of image classification. Rigorous experimental results on four public image classification datasets show that GCNLDL outperforms other approaches and effectively improves the performance of deep learning classification based models.","PeriodicalId":421514,"journal":{"name":"2022 International Conference on Cyber-Physical Social Intelligence (ICCSI)","volume":"113 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Graph Convolutional Networks-based Label Distribution Learning for Image Classification\",\"authors\":\"Changqing Gong, Shanshan Wang, Yiquan Wu, Chongwen Liu, M. El-Yacoubi, Huafeng Qin\",\"doi\":\"10.1109/ICCSI55536.2022.9970653\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The one-hot vector employed for true label representation has been widely applied for image classification. However, the one-hot representation assumes that a single label is only associated with one instance, which is not reasonable because labels are generally not completely independent and instances may relate to multiple labels for the real scenarios. Such one-hot representation may ignore the relevance among labels that provide more supervision information for model training. To capture and explore such significant relevance in image classification, we propose, in this paper, GCNLDL, a Graph Convolutional Network based Label Distribution Learning approach for classification. GCNLDL builds a directed graph over the images, where each node (sample) is represented by the embedding of an image, where GCN learns the relevance between an input image and multiple training images from different classes to obtain the label distribution vector. The resulting vector is further combined with the one-hot label to recover a realistic label distribution of the input image, which is employed to train the state-of-the-art classification models. Furthermore, a multilayer perception is proposed to learn an effective label correlation matrix to guide information propagation among the nodes in GCN. GCNLDL is capable of capturing the relevance among labels by representation learning graph structure among image samples during training process and produces a better label distribution to guide the training of the state-of-the-art image classification models, resulting in a performance improvement of image classification. Rigorous experimental results on four public image classification datasets show that GCNLDL outperforms other approaches and effectively improves the performance of deep learning classification based models.\",\"PeriodicalId\":421514,\"journal\":{\"name\":\"2022 International Conference on Cyber-Physical Social Intelligence (ICCSI)\",\"volume\":\"113 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 International Conference on Cyber-Physical Social Intelligence (ICCSI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCSI55536.2022.9970653\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Cyber-Physical Social Intelligence (ICCSI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCSI55536.2022.9970653","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

用于真标签表示的单热向量在图像分类中得到了广泛应用。但是，one-hot表示假设单个标签只与一个实例相关联，这是不合理的，因为标签通常不是完全独立的，并且在实际场景中，实例可能与多个标签相关。这种单热表示可能会忽略标签之间的相关性，而标签之间的相关性为模型训练提供了更多的监督信息。为了捕捉和探索图像分类中这种重要的相关性，我们在本文中提出了GCNLDL，一种基于图卷积网络的标签分布学习方法。GCNLDL在图像上构建一个有向图，其中每个节点(样本)由图像的嵌入表示，GCN学习输入图像与来自不同类别的多个训练图像之间的相关性，以获得标签分布向量。得到的向量与单热标签进一步结合，恢复输入图像的真实标签分布，用于训练最先进的分类模型。在此基础上，提出了一种多层感知来学习有效的标签相关矩阵，以指导GCN中节点间的信息传播。GCNLDL能够在训练过程中通过表示学习图像样本之间的图结构来捕获标签之间的相关性，并产生更好的标签分布来指导最先进的图像分类模型的训练，从而提高图像分类的性能。在4个公共图像分类数据集上的严格实验结果表明，GCNLDL优于其他方法，有效地提高了基于深度学习的分类模型的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Graph Convolutional Networks-based Label Distribution Learning for Image Classification

The one-hot vector employed for true label representation has been widely applied for image classification. However, the one-hot representation assumes that a single label is only associated with one instance, which is not reasonable because labels are generally not completely independent and instances may relate to multiple labels for the real scenarios. Such one-hot representation may ignore the relevance among labels that provide more supervision information for model training. To capture and explore such significant relevance in image classification, we propose, in this paper, GCNLDL, a Graph Convolutional Network based Label Distribution Learning approach for classification. GCNLDL builds a directed graph over the images, where each node (sample) is represented by the embedding of an image, where GCN learns the relevance between an input image and multiple training images from different classes to obtain the label distribution vector. The resulting vector is further combined with the one-hot label to recover a realistic label distribution of the input image, which is employed to train the state-of-the-art classification models. Furthermore, a multilayer perception is proposed to learn an effective label correlation matrix to guide information propagation among the nodes in GCN. GCNLDL is capable of capturing the relevance among labels by representation learning graph structure among image samples during training process and produces a better label distribution to guide the training of the state-of-the-art image classification models, resulting in a performance improvement of image classification. Rigorous experimental results on four public image classification datasets show that GCNLDL outperforms other approaches and effectively improves the performance of deep learning classification based models.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 International Conference on Cyber-Physical Social Intelligence (ICCSI)

自引率

0.00%

发文量