{"title":"ECC-CS: a multi-label data classification algorithm for class imbalance based on cost-sensitive learning","authors":"Jicong Duan, Hualong Yu","doi":"10.1117/12.2685783","DOIUrl":null,"url":null,"abstract":"Multi-label learning has attracted much attention due to its numerous applications in the real world. Although many multilabel learning algorithms have been proposed, most of them neglect the class imbalance problem that exists in multi-label data. Even when some studies focus on this problem, they often fail to explore label correlations. To address these two issues simultaneously, a cost-sensitive-based class imbalance multi-label data classification method called ECC-CS (Ensemble classifier chains - Cost sensitive) is proposed. In brief, ECC-CS combines the popular Ensemble Classifier Chain (ECC) algorithm with cost-sensitive learning techniques. Therefore, the ECC-CS algorithm simultaneously inherits the merit of label correlation exploration held by ECC and the merit of addressing the class imbalance problem owned by cost-sensitive learning techniques. Specifically, in ECC-CS, the cost-sensitive learning technique runs on each branch of ECC. 12 benchmark multi-label datasets were used to compare the proposed ECC-CS algorithm with several traditional class imbalance multi-label learning algorithms. The experimental results have indicated the effectiveness and superiority of the proposed ECC-CS algorithm.","PeriodicalId":305812,"journal":{"name":"International Conference on Electronic Information Technology","volume":"164 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-08-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Electronic Information Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.2685783","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Multi-label learning has attracted much attention due to its numerous applications in the real world. Although many multilabel learning algorithms have been proposed, most of them neglect the class imbalance problem that exists in multi-label data. Even when some studies focus on this problem, they often fail to explore label correlations. To address these two issues simultaneously, a cost-sensitive-based class imbalance multi-label data classification method called ECC-CS (Ensemble classifier chains - Cost sensitive) is proposed. In brief, ECC-CS combines the popular Ensemble Classifier Chain (ECC) algorithm with cost-sensitive learning techniques. Therefore, the ECC-CS algorithm simultaneously inherits the merit of label correlation exploration held by ECC and the merit of addressing the class imbalance problem owned by cost-sensitive learning techniques. Specifically, in ECC-CS, the cost-sensitive learning technique runs on each branch of ECC. 12 benchmark multi-label datasets were used to compare the proposed ECC-CS algorithm with several traditional class imbalance multi-label learning algorithms. The experimental results have indicated the effectiveness and superiority of the proposed ECC-CS algorithm.