{"title":"A New Cost-sensitive SVM Algorithm for Imbalanced Dataset","authors":"Zheng Hengyu","doi":"10.1109/ICCECE51280.2021.9342072","DOIUrl":null,"url":null,"abstract":"Support Vector Machine(SVM) is a popular machine learning algorithm for its excellent generalization ability. However, similar to most of traditional algorithms, the proposal of SVM is based on an assumption that the dataset is nearly balanced, and when SVM is applied in imbalanced dataset, the result may be bias towards majority class which leads to poor performance. To solve this problem, a new cost-sensitive SVM algorithm based on samples density are proposed in this paper. In the proposed algorithm, samples’ weights are depended on sample density estimated from Kernel Density Estimation(KDE) method, and furthermore, the samples’ weights are modified to enlarge the weights of border samples and reduce the weights of noise samples based on Support Vector Data Description(SVDD) algorithm. The experiments result shows that the proposed algorithm could achieve satisfactory performance.","PeriodicalId":229425,"journal":{"name":"2021 IEEE International Conference on Consumer Electronics and Computer Engineering (ICCECE)","volume":"43 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Consumer Electronics and Computer Engineering (ICCECE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCECE51280.2021.9342072","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Support Vector Machine(SVM) is a popular machine learning algorithm for its excellent generalization ability. However, similar to most of traditional algorithms, the proposal of SVM is based on an assumption that the dataset is nearly balanced, and when SVM is applied in imbalanced dataset, the result may be bias towards majority class which leads to poor performance. To solve this problem, a new cost-sensitive SVM algorithm based on samples density are proposed in this paper. In the proposed algorithm, samples’ weights are depended on sample density estimated from Kernel Density Estimation(KDE) method, and furthermore, the samples’ weights are modified to enlarge the weights of border samples and reduce the weights of noise samples based on Support Vector Data Description(SVDD) algorithm. The experiments result shows that the proposed algorithm could achieve satisfactory performance.