Lin Sun;Qifeng Zhang;Weiping Ding;Tianxiang Wang;Jiucheng Xu
{"title":"FCPFS: Fuzzy Granular Ball Clustering-Based Partial Multilabel Feature Selection With Fuzzy Mutual Information","authors":"Lin Sun;Qifeng Zhang;Weiping Ding;Tianxiang Wang;Jiucheng Xu","doi":"10.1109/TETCI.2024.3399665","DOIUrl":null,"url":null,"abstract":"In the partial multilabel learning, incorrect labels are annotated because of their low quality and poor recognition. To decrease secondary errors in partial multilabel classification, this paper proposes a novel fuzzy granular ball clustering-based partial multilabel feature selection scheme with fuzzy mutual information. First, to overcome the defect that the traditional granular ball model cannot be applied to partial multilabel classification and its splitting rules are anomalous and stochastic, an objective function is designed by the fuzzy membership degree, the splitting rules and termination conditions are redesigned, and a new fuzzy granular ball clustering method using fuzzy <italic>k</i>-means can be developed to preprocess partial multilabel data. Second, to reduce the impact of noise labels, the instance set of each granular ball is generated according to fuzzy granular ball clustering instead of neighborhood class, and the fuzzy similarity relationship between instances is constructed. Subsequently, granular ball-based fuzzy entropy measures and fuzzy mutual information and their properties are proposed in granular ball-based partial multilabel systems. Finally, the dependence and relevance between features and label sets are studied, the significance of features based on fuzzy mutual information is presented, and then a heuristic partial multilabel feature selection method is constructed to enhance the effect of partial multilabel data classification. Experiments on 18 partial multilabel datasets illustrate the availability of our method compared to other multilabel classification algorithms in its classification effect.","PeriodicalId":13135,"journal":{"name":"IEEE Transactions on Emerging Topics in Computational Intelligence","volume":"9 1","pages":"590-606"},"PeriodicalIF":5.3000,"publicationDate":"2024-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Emerging Topics in Computational Intelligence","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10545651/","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
In the partial multilabel learning, incorrect labels are annotated because of their low quality and poor recognition. To decrease secondary errors in partial multilabel classification, this paper proposes a novel fuzzy granular ball clustering-based partial multilabel feature selection scheme with fuzzy mutual information. First, to overcome the defect that the traditional granular ball model cannot be applied to partial multilabel classification and its splitting rules are anomalous and stochastic, an objective function is designed by the fuzzy membership degree, the splitting rules and termination conditions are redesigned, and a new fuzzy granular ball clustering method using fuzzy k-means can be developed to preprocess partial multilabel data. Second, to reduce the impact of noise labels, the instance set of each granular ball is generated according to fuzzy granular ball clustering instead of neighborhood class, and the fuzzy similarity relationship between instances is constructed. Subsequently, granular ball-based fuzzy entropy measures and fuzzy mutual information and their properties are proposed in granular ball-based partial multilabel systems. Finally, the dependence and relevance between features and label sets are studied, the significance of features based on fuzzy mutual information is presented, and then a heuristic partial multilabel feature selection method is constructed to enhance the effect of partial multilabel data classification. Experiments on 18 partial multilabel datasets illustrate the availability of our method compared to other multilabel classification algorithms in its classification effect.
期刊介绍:
The IEEE Transactions on Emerging Topics in Computational Intelligence (TETCI) publishes original articles on emerging aspects of computational intelligence, including theory, applications, and surveys.
TETCI is an electronics only publication. TETCI publishes six issues per year.
Authors are encouraged to submit manuscripts in any emerging topic in computational intelligence, especially nature-inspired computing topics not covered by other IEEE Computational Intelligence Society journals. A few such illustrative examples are glial cell networks, computational neuroscience, Brain Computer Interface, ambient intelligence, non-fuzzy computing with words, artificial life, cultural learning, artificial endocrine networks, social reasoning, artificial hormone networks, computational intelligence for the IoT and Smart-X technologies.