Ji Xu;Gang Ren;Jianhang Tang;Weiping Ding;Guoyin Wang
{"title":"Selecting Central and Divergent Samples via Leading Tree Metric Space for Semisupervised Learning","authors":"Ji Xu;Gang Ren;Jianhang Tang;Weiping Ding;Guoyin Wang","doi":"10.1109/TFUZZ.2025.3528400","DOIUrl":null,"url":null,"abstract":"The distribution of the labeled data can greatly affect the performance of a semisupervised learning (SSL) model. Most existing SSL models select the labeled data randomly and equally allocate the labeling quota among the classes, leading to considerable unstableness and degeneration of performance. This study unsupervisedly constructs a leading forest that forms another metric space, based on which it is convenient to define the fuzzy membership function to characterize central and divergent samples and select both types with fuzzy Xor logic. The labeling quota can, thus, be allocated adaptively among different classes. The proposed determinate labeling strategy can generally improve the performance for most SSLs. Especially, when combined with the kernelized large margin component analysis, it produces a novel semisupervised classification model. In addition, the multimodal issue in SSL is effectively addressed by the multigranular structure of leading forest that readily facilitates multiple local metrics learning. Extensive experimental results demonstrate that the proposed method achieved competitive efficiency and encouraging accuracy when compared with the state-of-the-art methods.","PeriodicalId":13212,"journal":{"name":"IEEE Transactions on Fuzzy Systems","volume":"33 5","pages":"1578-1591"},"PeriodicalIF":10.7000,"publicationDate":"2025-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Fuzzy Systems","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10839088/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
The distribution of the labeled data can greatly affect the performance of a semisupervised learning (SSL) model. Most existing SSL models select the labeled data randomly and equally allocate the labeling quota among the classes, leading to considerable unstableness and degeneration of performance. This study unsupervisedly constructs a leading forest that forms another metric space, based on which it is convenient to define the fuzzy membership function to characterize central and divergent samples and select both types with fuzzy Xor logic. The labeling quota can, thus, be allocated adaptively among different classes. The proposed determinate labeling strategy can generally improve the performance for most SSLs. Especially, when combined with the kernelized large margin component analysis, it produces a novel semisupervised classification model. In addition, the multimodal issue in SSL is effectively addressed by the multigranular structure of leading forest that readily facilitates multiple local metrics learning. Extensive experimental results demonstrate that the proposed method achieved competitive efficiency and encouraging accuracy when compared with the state-of-the-art methods.
期刊介绍:
The IEEE Transactions on Fuzzy Systems is a scholarly journal that focuses on the theory, design, and application of fuzzy systems. It aims to publish high-quality technical papers that contribute significant technical knowledge and exploratory developments in the field of fuzzy systems. The journal particularly emphasizes engineering systems and scientific applications. In addition to research articles, the Transactions also includes a letters section featuring current information, comments, and rebuttals related to published papers.