{"title":"An Improved Imbalanced Data Classification Algorithm Based on SVM","authors":"Ming Yan, Jun Wang, Dan-jiao Li, Jin Meng","doi":"10.1109/ICCSI55536.2022.9970637","DOIUrl":null,"url":null,"abstract":"When we deal with most real-world classification problems, the collected datasets are mostly imbalanced. Dataset imbalance means that the number of samples of a certain class greatly exceeds the number of samples of other classes in the dataset, but often a minority class is the main object of our research. When classifying imbalanced datasets, it is easy to misclassify the minority class samples with higher misclassification costs. Therefore, the classification of imbalanced datasets is one of the main difficulties in the field of data mining. In this paper, we propose a support vector machine (SVM) algorithm based on improved whale optimization algorithm, called SWOA-SVM. This algorithm introduces the social group optimization algorithm (SGO) to optimize the problem that the WOA algorithm is prone to premature maturity, and improves the optimization process of the WOA. The performance of SWOA-SVM has been evaluated with SVM and other improved algorithms on multiple commonly used imbalanced datasets, using AUC, Accuracy and G-mean as performance evaluation criteria. The experimental results show that the algorithm can effectively improve the recognition rate of positive samples when dealing with different experimental datasets, which verifies the effectiveness of the algorithm.","PeriodicalId":421514,"journal":{"name":"2022 International Conference on Cyber-Physical Social Intelligence (ICCSI)","volume":"73 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Cyber-Physical Social Intelligence (ICCSI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCSI55536.2022.9970637","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
When we deal with most real-world classification problems, the collected datasets are mostly imbalanced. Dataset imbalance means that the number of samples of a certain class greatly exceeds the number of samples of other classes in the dataset, but often a minority class is the main object of our research. When classifying imbalanced datasets, it is easy to misclassify the minority class samples with higher misclassification costs. Therefore, the classification of imbalanced datasets is one of the main difficulties in the field of data mining. In this paper, we propose a support vector machine (SVM) algorithm based on improved whale optimization algorithm, called SWOA-SVM. This algorithm introduces the social group optimization algorithm (SGO) to optimize the problem that the WOA algorithm is prone to premature maturity, and improves the optimization process of the WOA. The performance of SWOA-SVM has been evaluated with SVM and other improved algorithms on multiple commonly used imbalanced datasets, using AUC, Accuracy and G-mean as performance evaluation criteria. The experimental results show that the algorithm can effectively improve the recognition rate of positive samples when dealing with different experimental datasets, which verifies the effectiveness of the algorithm.