Selecting Central and Divergent Samples via Leading Tree Metric Space for Semisupervised Learning

IF 10.7 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Ji Xu;Gang Ren;Jianhang Tang;Weiping Ding;Guoyin Wang
{"title":"Selecting Central and Divergent Samples via Leading Tree Metric Space for Semisupervised Learning","authors":"Ji Xu;Gang Ren;Jianhang Tang;Weiping Ding;Guoyin Wang","doi":"10.1109/TFUZZ.2025.3528400","DOIUrl":null,"url":null,"abstract":"The distribution of the labeled data can greatly affect the performance of a semisupervised learning (SSL) model. Most existing SSL models select the labeled data randomly and equally allocate the labeling quota among the classes, leading to considerable unstableness and degeneration of performance. This study unsupervisedly constructs a leading forest that forms another metric space, based on which it is convenient to define the fuzzy membership function to characterize central and divergent samples and select both types with fuzzy Xor logic. The labeling quota can, thus, be allocated adaptively among different classes. The proposed determinate labeling strategy can generally improve the performance for most SSLs. Especially, when combined with the kernelized large margin component analysis, it produces a novel semisupervised classification model. In addition, the multimodal issue in SSL is effectively addressed by the multigranular structure of leading forest that readily facilitates multiple local metrics learning. Extensive experimental results demonstrate that the proposed method achieved competitive efficiency and encouraging accuracy when compared with the state-of-the-art methods.","PeriodicalId":13212,"journal":{"name":"IEEE Transactions on Fuzzy Systems","volume":"33 5","pages":"1578-1591"},"PeriodicalIF":10.7000,"publicationDate":"2025-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Fuzzy Systems","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10839088/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

The distribution of the labeled data can greatly affect the performance of a semisupervised learning (SSL) model. Most existing SSL models select the labeled data randomly and equally allocate the labeling quota among the classes, leading to considerable unstableness and degeneration of performance. This study unsupervisedly constructs a leading forest that forms another metric space, based on which it is convenient to define the fuzzy membership function to characterize central and divergent samples and select both types with fuzzy Xor logic. The labeling quota can, thus, be allocated adaptively among different classes. The proposed determinate labeling strategy can generally improve the performance for most SSLs. Especially, when combined with the kernelized large margin component analysis, it produces a novel semisupervised classification model. In addition, the multimodal issue in SSL is effectively addressed by the multigranular structure of leading forest that readily facilitates multiple local metrics learning. Extensive experimental results demonstrate that the proposed method achieved competitive efficiency and encouraging accuracy when compared with the state-of-the-art methods.
基于领先树度量空间的半监督学习中心和发散样本选择
标记数据的分布会极大地影响半监督学习(SSL)模型的性能。现有的SSL模型大多随机选择标记数据,并在类之间平均分配标记配额,这导致了相当大的不稳定性和性能下降。本研究无监督地构造了一个先导森林,该先导森林形成了另一个度量空间,在此基础上可以方便地定义模糊隶属函数来表征中心样本和发散样本,并利用模糊异或逻辑选择这两种类型。因此,标签配额可以在不同的类别之间自适应地分配。所提出的确定性标记策略通常可以提高大多数ssl的性能。特别地,当与核化大余量成分分析相结合时,产生了一种新的半监督分类模型。此外,领先森林的多颗粒结构可以有效地解决SSL中的多模态问题,从而容易地促进多个局部度量学习。大量的实验结果表明,与现有的方法相比,该方法具有竞争力的效率和令人鼓舞的准确性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
IEEE Transactions on Fuzzy Systems
IEEE Transactions on Fuzzy Systems 工程技术-工程:电子与电气
CiteScore
20.50
自引率
13.40%
发文量
517
审稿时长
3.0 months
期刊介绍: The IEEE Transactions on Fuzzy Systems is a scholarly journal that focuses on the theory, design, and application of fuzzy systems. It aims to publish high-quality technical papers that contribute significant technical knowledge and exploratory developments in the field of fuzzy systems. The journal particularly emphasizes engineering systems and scientific applications. In addition to research articles, the Transactions also includes a letters section featuring current information, comments, and rebuttals related to published papers.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信