CSRP: Modeling class spatial relation with prototype network for novel class discovery

IF 3.5 2区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Applied Intelligence Pub Date : 2024-12-23 DOI:10.1007/s10489-024-05946-5

Wei Jin, Nannan Li, Jiuqing Dong, Huiwen Guo, Wenmin Wang, Chuanchuan You

{"title":"CSRP: Modeling class spatial relation with prototype network for novel class discovery","authors":"Wei Jin, Nannan Li, Jiuqing Dong, Huiwen Guo, Wenmin Wang, Chuanchuan You","doi":"10.1007/s10489-024-05946-5","DOIUrl":null,"url":null,"abstract":"<div><p>Novel Class Discovery(NCD) is a learning paradigm within the open-world task, in which machine learning models leverage prior knowledge to guide unknown samples into semantic clusters in an unsupervised environment. Recent research notes that maintaining class relations can assist classifiers in better recognizing unknown classes. Inspired by this study, we propose Class-Spatial-Relation modeling with a Prototype network (CSRP). A prototype network is a machine learning model used to classify tasks. It performs by learning prototypes for each class and makes classification decisions based on the similarity between a given sample and these prototypes. It conducts complex class boundaries better than linear classification models, providing higher flexibility and accuracy for classification tasks. Specifically, the proposed prototype network enables spatial modeling based on the distance between samples and each prototype, which can better obtain class relation information to improve the model’s interpretability and robustness. In addition, we simultaneously perform knowledge distillation on known and unknown classes to balance the model’s classification performance for each class. To evaluate the effectiveness and generality of our method, we perform extensive experiments on the CIFAR-100 dataset and fine-grained datasets: Stanford Cars, CUB-200-2011, and FGVC-Aircraft, respectively. Our method results are comparable to existing state-of-the-art performance in the standard dataset CIFAF100, while outstanding performance on three fine-grained datasets surpassed the baseline by 3%-9%. In addition, our method creates more compact clusters in the latent space than in linear classification. The success demonstrates the effectiveness of our approach.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 3","pages":""},"PeriodicalIF":3.5000,"publicationDate":"2024-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Intelligence","FirstCategoryId":"94","ListUrlMain":"https://link.springer.com/article/10.1007/s10489-024-05946-5","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Novel Class Discovery(NCD) is a learning paradigm within the open-world task, in which machine learning models leverage prior knowledge to guide unknown samples into semantic clusters in an unsupervised environment. Recent research notes that maintaining class relations can assist classifiers in better recognizing unknown classes. Inspired by this study, we propose Class-Spatial-Relation modeling with a Prototype network (CSRP). A prototype network is a machine learning model used to classify tasks. It performs by learning prototypes for each class and makes classification decisions based on the similarity between a given sample and these prototypes. It conducts complex class boundaries better than linear classification models, providing higher flexibility and accuracy for classification tasks. Specifically, the proposed prototype network enables spatial modeling based on the distance between samples and each prototype, which can better obtain class relation information to improve the model’s interpretability and robustness. In addition, we simultaneously perform knowledge distillation on known and unknown classes to balance the model’s classification performance for each class. To evaluate the effectiveness and generality of our method, we perform extensive experiments on the CIFAR-100 dataset and fine-grained datasets: Stanford Cars, CUB-200-2011, and FGVC-Aircraft, respectively. Our method results are comparable to existing state-of-the-art performance in the standard dataset CIFAF100, while outstanding performance on three fine-grained datasets surpassed the baseline by 3%-9%. In addition, our method creates more compact clusters in the latent space than in linear classification. The success demonstrates the effectiveness of our approach.

Abstract Image

查看原文本刊更多论文

基于原型网络的类空间关系建模研究

新颖类发现（NCD）是开放世界任务中的一种学习范式，其中机器学习模型利用先验知识将未知样本引导到无监督环境中的语义聚类中。最近的研究指出，维持类关系可以帮助分类器更好地识别未知的类。受此启发，我们提出了基于原型网络（CSRP）的类-空间关系建模方法。原型网络是一种用于分类任务的机器学习模型。它通过学习每个类的原型来执行，并根据给定样本与这些原型之间的相似性做出分类决策。它比线性分类模型更能处理复杂的类边界，为分类任务提供了更高的灵活性和准确性。具体而言，本文提出的原型网络基于样本与每个原型之间的距离进行空间建模，可以更好地获取类关系信息，提高模型的可解释性和鲁棒性。此外，我们同时对已知和未知类进行知识蒸馏，以平衡模型对每个类的分类性能。为了评估我们方法的有效性和通用性，我们分别在CIFAR-100数据集和细粒度数据集上进行了广泛的实验：斯坦福汽车、CUB-200-2011和FGVC-Aircraft。我们的方法结果与标准数据集CIFAF100中现有的最先进性能相当，而在三个细粒度数据集上的出色性能超过了基线3%-9%。此外，与线性分类相比，我们的方法在潜在空间中创建了更紧凑的聚类。这一成功证明了我们方法的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Applied Intelligence 工程技术-计算机：人工智能

CiteScore

6.60

自引率

20.80%

发文量

1361

审稿时长

5.9 months

期刊介绍： With a focus on research in artificial intelligence and neural networks, this journal addresses issues involving solutions of real-life manufacturing, defense, management, government and industrial problems which are too complex to be solved through conventional approaches and require the simulation of intelligent thought processes, heuristics, applications of knowledge, and distributed and parallel processing. The integration of these multiple approaches in solving complex problems is of particular importance. The journal presents new and original research and technological developments, addressing real and complex issues applicable to difficult problems. It provides a medium for exchanging scientific research and technological achievements accomplished by the international community.