Yoga Pristyanto, A. F. Nugraha, Akhmad Dahlan, Lucky Adhikrisna Wirasakti, Aditya Ahmad Zein, Irfan Pratama
{"title":"ADASYN过采样和堆叠算法的多类不平衡处理","authors":"Yoga Pristyanto, A. F. Nugraha, Akhmad Dahlan, Lucky Adhikrisna Wirasakti, Aditya Ahmad Zein, Irfan Pratama","doi":"10.1109/IMCOM53663.2022.9721632","DOIUrl":null,"url":null,"abstract":"Class imbalance conditions in datasets are common in real-world problems. Class imbalance is a condition where the number of classes in the dataset used in the classification process has a significant difference in number. In theory, most single classifiers have a weakness against class imbalance conditions in datasets, especially those with multiclass types, so their performance cannot be maximized. This study proposes two approaches to overcome the problem of multiclass imbalanced, namely the use of ADASYN (Adaptive Synthetic) Sampling and the Stacking Algorithm. As confirmed by testing on five multiclass datasets, the proposed method outperforms other methods in terms of accuracy values, sensitivity, specificity, and geometric mean values. As a result, the method proposed in this study can solve class imbalance problems in multiclass-type datasets. However, this study has limitations. Namely, the dataset used is a multiclass category with a maximum number of six classes. For this reason, further research will suggest testing using imbalanced class datasets in the category of multiclass datasets with more than six classes.","PeriodicalId":367038,"journal":{"name":"2022 16th International Conference on Ubiquitous Information Management and Communication (IMCOM)","volume":"98 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Multiclass Imbalanced Handling using ADASYN Oversampling and Stacking Algorithm\",\"authors\":\"Yoga Pristyanto, A. F. Nugraha, Akhmad Dahlan, Lucky Adhikrisna Wirasakti, Aditya Ahmad Zein, Irfan Pratama\",\"doi\":\"10.1109/IMCOM53663.2022.9721632\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Class imbalance conditions in datasets are common in real-world problems. Class imbalance is a condition where the number of classes in the dataset used in the classification process has a significant difference in number. In theory, most single classifiers have a weakness against class imbalance conditions in datasets, especially those with multiclass types, so their performance cannot be maximized. This study proposes two approaches to overcome the problem of multiclass imbalanced, namely the use of ADASYN (Adaptive Synthetic) Sampling and the Stacking Algorithm. As confirmed by testing on five multiclass datasets, the proposed method outperforms other methods in terms of accuracy values, sensitivity, specificity, and geometric mean values. As a result, the method proposed in this study can solve class imbalance problems in multiclass-type datasets. However, this study has limitations. Namely, the dataset used is a multiclass category with a maximum number of six classes. For this reason, further research will suggest testing using imbalanced class datasets in the category of multiclass datasets with more than six classes.\",\"PeriodicalId\":367038,\"journal\":{\"name\":\"2022 16th International Conference on Ubiquitous Information Management and Communication (IMCOM)\",\"volume\":\"98 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-01-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 16th International Conference on Ubiquitous Information Management and Communication (IMCOM)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IMCOM53663.2022.9721632\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 16th International Conference on Ubiquitous Information Management and Communication (IMCOM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IMCOM53663.2022.9721632","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Multiclass Imbalanced Handling using ADASYN Oversampling and Stacking Algorithm
Class imbalance conditions in datasets are common in real-world problems. Class imbalance is a condition where the number of classes in the dataset used in the classification process has a significant difference in number. In theory, most single classifiers have a weakness against class imbalance conditions in datasets, especially those with multiclass types, so their performance cannot be maximized. This study proposes two approaches to overcome the problem of multiclass imbalanced, namely the use of ADASYN (Adaptive Synthetic) Sampling and the Stacking Algorithm. As confirmed by testing on five multiclass datasets, the proposed method outperforms other methods in terms of accuracy values, sensitivity, specificity, and geometric mean values. As a result, the method proposed in this study can solve class imbalance problems in multiclass-type datasets. However, this study has limitations. Namely, the dataset used is a multiclass category with a maximum number of six classes. For this reason, further research will suggest testing using imbalanced class datasets in the category of multiclass datasets with more than six classes.