Li Liu, Jinrui Guo, Ziqi Yin, Rui Chen, Guojun Huang
{"title":"一种新的基于三向距离的模糊大余量分配机用于不平衡分类","authors":"Li Liu, Jinrui Guo, Ziqi Yin, Rui Chen, Guojun Huang","doi":"10.1007/s40747-025-01797-w","DOIUrl":null,"url":null,"abstract":"<p>Class imbalance is a prevalent issue in practical applications, which poses significant challenges for classifiers. The large margin distribution machine (LDM) introduces the margin distribution of samples to replace the traditional minimum margin, resulting in extensively enhanced classification performance. However, the hyperplane of LDM tends to be skewed toward the minority class, due to the optimization property for margin means. Moreover, the absence of non-deterministic options and measurement of the confidence level of samples further restricts the capability to manage uncertainty in imbalanced classification tasks. To solve these problems, we propose a novel three-way distance-based fuzzy large margin distribution machine (3W-DBFLDM). Specifically, we introduce a distance-based factor to mitigate the impact of sample size imbalance on classification results by increasing the distance weights of the minority class. Additionally, three-way decision model is introduced to deal with uncertainty, and the model’s robustness is further enhanced by utilizing the fuzzy membership degree that reflects the importance level of each input point. Comparative experiments conducted on UCI datasets demonstrate that the 3W-DBFLDM model surpasses other models in classification accuracy, stability, and robustness. Furthermore, the cost comparison experiment validate that the 3W-DBFLDM model reduces the overall decision cost.</p>","PeriodicalId":10524,"journal":{"name":"Complex & Intelligent Systems","volume":"12 1","pages":""},"PeriodicalIF":4.6000,"publicationDate":"2025-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A novel three-way distance-based fuzzy large margin distribution machine for imbalance classification\",\"authors\":\"Li Liu, Jinrui Guo, Ziqi Yin, Rui Chen, Guojun Huang\",\"doi\":\"10.1007/s40747-025-01797-w\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Class imbalance is a prevalent issue in practical applications, which poses significant challenges for classifiers. The large margin distribution machine (LDM) introduces the margin distribution of samples to replace the traditional minimum margin, resulting in extensively enhanced classification performance. However, the hyperplane of LDM tends to be skewed toward the minority class, due to the optimization property for margin means. Moreover, the absence of non-deterministic options and measurement of the confidence level of samples further restricts the capability to manage uncertainty in imbalanced classification tasks. To solve these problems, we propose a novel three-way distance-based fuzzy large margin distribution machine (3W-DBFLDM). Specifically, we introduce a distance-based factor to mitigate the impact of sample size imbalance on classification results by increasing the distance weights of the minority class. Additionally, three-way decision model is introduced to deal with uncertainty, and the model’s robustness is further enhanced by utilizing the fuzzy membership degree that reflects the importance level of each input point. Comparative experiments conducted on UCI datasets demonstrate that the 3W-DBFLDM model surpasses other models in classification accuracy, stability, and robustness. Furthermore, the cost comparison experiment validate that the 3W-DBFLDM model reduces the overall decision cost.</p>\",\"PeriodicalId\":10524,\"journal\":{\"name\":\"Complex & Intelligent Systems\",\"volume\":\"12 1\",\"pages\":\"\"},\"PeriodicalIF\":4.6000,\"publicationDate\":\"2025-02-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Complex & Intelligent Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1007/s40747-025-01797-w\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Complex & Intelligent Systems","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s40747-025-01797-w","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
摘要
类不平衡是实际应用中普遍存在的一个问题,它给分类器带来了巨大的挑战。大边际分布机(large margin distribution machine, LDM)引入样本边际分布来取代传统的最小边际,极大地提高了分类性能。然而,由于边际均值的最优性,LDM的超平面倾向于向少数类倾斜。此外,缺乏非确定性选项和样本置信水平的测量进一步限制了在不平衡分类任务中管理不确定性的能力。为了解决这些问题,我们提出了一种新的基于三向距离的模糊大余量分配机(3W-DBFLDM)。具体来说,我们引入了一个基于距离的因子,通过增加少数类的距离权重来减轻样本量不平衡对分类结果的影响。引入三向决策模型处理不确定性,利用反映各输入点重要程度的模糊隶属度进一步增强模型的鲁棒性。在UCI数据集上进行的对比实验表明,3W-DBFLDM模型在分类精度、稳定性和鲁棒性方面优于其他模型。此外,成本对比实验验证了3W-DBFLDM模型降低了整体决策成本。
A novel three-way distance-based fuzzy large margin distribution machine for imbalance classification
Class imbalance is a prevalent issue in practical applications, which poses significant challenges for classifiers. The large margin distribution machine (LDM) introduces the margin distribution of samples to replace the traditional minimum margin, resulting in extensively enhanced classification performance. However, the hyperplane of LDM tends to be skewed toward the minority class, due to the optimization property for margin means. Moreover, the absence of non-deterministic options and measurement of the confidence level of samples further restricts the capability to manage uncertainty in imbalanced classification tasks. To solve these problems, we propose a novel three-way distance-based fuzzy large margin distribution machine (3W-DBFLDM). Specifically, we introduce a distance-based factor to mitigate the impact of sample size imbalance on classification results by increasing the distance weights of the minority class. Additionally, three-way decision model is introduced to deal with uncertainty, and the model’s robustness is further enhanced by utilizing the fuzzy membership degree that reflects the importance level of each input point. Comparative experiments conducted on UCI datasets demonstrate that the 3W-DBFLDM model surpasses other models in classification accuracy, stability, and robustness. Furthermore, the cost comparison experiment validate that the 3W-DBFLDM model reduces the overall decision cost.
期刊介绍:
Complex & Intelligent Systems aims to provide a forum for presenting and discussing novel approaches, tools and techniques meant for attaining a cross-fertilization between the broad fields of complex systems, computational simulation, and intelligent analytics and visualization. The transdisciplinary research that the journal focuses on will expand the boundaries of our understanding by investigating the principles and processes that underlie many of the most profound problems facing society today.