{"title":"不平衡学习中基于数据集空间分布的过采样算法","authors":"Yiran Liu, Wanjiang Han, Xiaoxiang Wang, Qi Li","doi":"10.1109/ICCCS49078.2020.9118573","DOIUrl":null,"url":null,"abstract":"Imbalance problem is widespread in machine learning. Most learning algorithms can’t get satisfied performance when they are applied on imbalance data sets, because they can be deteriorated by this problem easily. This paper proposed SDSMOTE method which captures the spatial distribution of imbalance data sets, and changes the tendency of learning algorithm by over sampling by oversampling according to the recognition difficulty. Experiments on 5 UCI data sets validate the effectiveness of this oversampling algorithm.","PeriodicalId":105556,"journal":{"name":"2020 5th International Conference on Computer and Communication Systems (ICCCS)","volume":"65 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Oversampling Algorithm Based on Spatial Distribution of Data Sets for Imbalance Learning\",\"authors\":\"Yiran Liu, Wanjiang Han, Xiaoxiang Wang, Qi Li\",\"doi\":\"10.1109/ICCCS49078.2020.9118573\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Imbalance problem is widespread in machine learning. Most learning algorithms can’t get satisfied performance when they are applied on imbalance data sets, because they can be deteriorated by this problem easily. This paper proposed SDSMOTE method which captures the spatial distribution of imbalance data sets, and changes the tendency of learning algorithm by over sampling by oversampling according to the recognition difficulty. Experiments on 5 UCI data sets validate the effectiveness of this oversampling algorithm.\",\"PeriodicalId\":105556,\"journal\":{\"name\":\"2020 5th International Conference on Computer and Communication Systems (ICCCS)\",\"volume\":\"65 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 5th International Conference on Computer and Communication Systems (ICCCS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCCS49078.2020.9118573\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 5th International Conference on Computer and Communication Systems (ICCCS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCCS49078.2020.9118573","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Oversampling Algorithm Based on Spatial Distribution of Data Sets for Imbalance Learning
Imbalance problem is widespread in machine learning. Most learning algorithms can’t get satisfied performance when they are applied on imbalance data sets, because they can be deteriorated by this problem easily. This paper proposed SDSMOTE method which captures the spatial distribution of imbalance data sets, and changes the tendency of learning algorithm by over sampling by oversampling according to the recognition difficulty. Experiments on 5 UCI data sets validate the effectiveness of this oversampling algorithm.