Zhiyong Li , Linqing Huang , Tianhao Gu , Qingkai Bu , Fuyu Qi , Jinfu Fan
{"title":"基于k -均值图转换器的部分多标签学习","authors":"Zhiyong Li , Linqing Huang , Tianhao Gu , Qingkai Bu , Fuyu Qi , Jinfu Fan","doi":"10.1016/j.knosys.2025.114017","DOIUrl":null,"url":null,"abstract":"<div><div>Partial Multi-Label Learning (PML) is a weakly supervised learning problem in which each instance is assigned a set of candidate labels, including relevant and irrelevant ones. The traditional label disambiguation strategies often rely on prior knowledge or auxiliary information and do not fully consider the complementary information between different label features. To tackle this challenge, we propose a K-means Graph Transformer for PML (PML-KGT), which introduces cluster centers and graph structure design to approximate label features. The goal of this process is to learn the features of different label classes through the complementary information from candidate labels of similar instances in the graph structure, while effective label features can accurately measure the relationship between instance and candidate labels, thus mitigating the influence of noisy labels. Additionally, we introduce a novel partial multi-label correction loss that determines candidate label weights based on evaluating the correlation between cluster centers and instances, thereby alleviating the effect of noise labels. As training progresses, the ground-truth labels are gradually recognized, and the improved cluster centers and labels contribute to enhancing the performance of the classifier. Comprehensive experiments on the real-world and synthetic PML datasets validate the advantage of the PML-KGT.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"325 ","pages":"Article 114017"},"PeriodicalIF":7.6000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Partial multi-label learning via K-means graph transformer\",\"authors\":\"Zhiyong Li , Linqing Huang , Tianhao Gu , Qingkai Bu , Fuyu Qi , Jinfu Fan\",\"doi\":\"10.1016/j.knosys.2025.114017\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Partial Multi-Label Learning (PML) is a weakly supervised learning problem in which each instance is assigned a set of candidate labels, including relevant and irrelevant ones. The traditional label disambiguation strategies often rely on prior knowledge or auxiliary information and do not fully consider the complementary information between different label features. To tackle this challenge, we propose a K-means Graph Transformer for PML (PML-KGT), which introduces cluster centers and graph structure design to approximate label features. The goal of this process is to learn the features of different label classes through the complementary information from candidate labels of similar instances in the graph structure, while effective label features can accurately measure the relationship between instance and candidate labels, thus mitigating the influence of noisy labels. Additionally, we introduce a novel partial multi-label correction loss that determines candidate label weights based on evaluating the correlation between cluster centers and instances, thereby alleviating the effect of noise labels. As training progresses, the ground-truth labels are gradually recognized, and the improved cluster centers and labels contribute to enhancing the performance of the classifier. Comprehensive experiments on the real-world and synthetic PML datasets validate the advantage of the PML-KGT.</div></div>\",\"PeriodicalId\":49939,\"journal\":{\"name\":\"Knowledge-Based Systems\",\"volume\":\"325 \",\"pages\":\"Article 114017\"},\"PeriodicalIF\":7.6000,\"publicationDate\":\"2025-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Knowledge-Based Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0950705125010627\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge-Based Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0950705125010627","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Partial multi-label learning via K-means graph transformer
Partial Multi-Label Learning (PML) is a weakly supervised learning problem in which each instance is assigned a set of candidate labels, including relevant and irrelevant ones. The traditional label disambiguation strategies often rely on prior knowledge or auxiliary information and do not fully consider the complementary information between different label features. To tackle this challenge, we propose a K-means Graph Transformer for PML (PML-KGT), which introduces cluster centers and graph structure design to approximate label features. The goal of this process is to learn the features of different label classes through the complementary information from candidate labels of similar instances in the graph structure, while effective label features can accurately measure the relationship between instance and candidate labels, thus mitigating the influence of noisy labels. Additionally, we introduce a novel partial multi-label correction loss that determines candidate label weights based on evaluating the correlation between cluster centers and instances, thereby alleviating the effect of noise labels. As training progresses, the ground-truth labels are gradually recognized, and the improved cluster centers and labels contribute to enhancing the performance of the classifier. Comprehensive experiments on the real-world and synthetic PML datasets validate the advantage of the PML-KGT.
期刊介绍:
Knowledge-Based Systems, an international and interdisciplinary journal in artificial intelligence, publishes original, innovative, and creative research results in the field. It focuses on knowledge-based and other artificial intelligence techniques-based systems. The journal aims to support human prediction and decision-making through data science and computation techniques, provide a balanced coverage of theory and practical study, and encourage the development and implementation of knowledge-based intelligence models, methods, systems, and software tools. Applications in business, government, education, engineering, and healthcare are emphasized.