{"title":"Denoise-Based Over-Sampling for Imbalanced Data Classification","authors":"Wang Dan, L. Yian","doi":"10.1109/DCABES50732.2020.00078","DOIUrl":null,"url":null,"abstract":"Imbalanced data classification has always been a hot topic in traditional machine learning. The usual method is oversampling. Its main idea is to randomly synthesize the new minority samples between the minority samples and their neighboring samples, to put the data in a particular state of equilibrium. The existing improved methods have improved the classifier's performance to some extent, but most of the focus is on the minority sample. In this paper, a denoise-based oversampling method (DNOS) is proposed, which performs different denoise processes for the majority and minority samples. Then, it is combined with ADASYN to oversampling the data. Experimental results show that DNOS has a better classification effect than ADASYN.","PeriodicalId":351404,"journal":{"name":"2020 19th International Symposium on Distributed Computing and Applications for Business Engineering and Science (DCABES)","volume":"305 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 19th International Symposium on Distributed Computing and Applications for Business Engineering and Science (DCABES)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DCABES50732.2020.00078","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Imbalanced data classification has always been a hot topic in traditional machine learning. The usual method is oversampling. Its main idea is to randomly synthesize the new minority samples between the minority samples and their neighboring samples, to put the data in a particular state of equilibrium. The existing improved methods have improved the classifier's performance to some extent, but most of the focus is on the minority sample. In this paper, a denoise-based oversampling method (DNOS) is proposed, which performs different denoise processes for the majority and minority samples. Then, it is combined with ADASYN to oversampling the data. Experimental results show that DNOS has a better classification effect than ADASYN.