Yoga Pristyanto, A. F. Nugraha, Akhmad Dahlan, Lucky Adhikrisna Wirasakti, Aditya Ahmad Zein, Irfan Pratama
{"title":"Multiclass Imbalanced Handling using ADASYN Oversampling and Stacking Algorithm","authors":"Yoga Pristyanto, A. F. Nugraha, Akhmad Dahlan, Lucky Adhikrisna Wirasakti, Aditya Ahmad Zein, Irfan Pratama","doi":"10.1109/IMCOM53663.2022.9721632","DOIUrl":null,"url":null,"abstract":"Class imbalance conditions in datasets are common in real-world problems. Class imbalance is a condition where the number of classes in the dataset used in the classification process has a significant difference in number. In theory, most single classifiers have a weakness against class imbalance conditions in datasets, especially those with multiclass types, so their performance cannot be maximized. This study proposes two approaches to overcome the problem of multiclass imbalanced, namely the use of ADASYN (Adaptive Synthetic) Sampling and the Stacking Algorithm. As confirmed by testing on five multiclass datasets, the proposed method outperforms other methods in terms of accuracy values, sensitivity, specificity, and geometric mean values. As a result, the method proposed in this study can solve class imbalance problems in multiclass-type datasets. However, this study has limitations. Namely, the dataset used is a multiclass category with a maximum number of six classes. For this reason, further research will suggest testing using imbalanced class datasets in the category of multiclass datasets with more than six classes.","PeriodicalId":367038,"journal":{"name":"2022 16th International Conference on Ubiquitous Information Management and Communication (IMCOM)","volume":"98 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 16th International Conference on Ubiquitous Information Management and Communication (IMCOM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IMCOM53663.2022.9721632","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
Class imbalance conditions in datasets are common in real-world problems. Class imbalance is a condition where the number of classes in the dataset used in the classification process has a significant difference in number. In theory, most single classifiers have a weakness against class imbalance conditions in datasets, especially those with multiclass types, so their performance cannot be maximized. This study proposes two approaches to overcome the problem of multiclass imbalanced, namely the use of ADASYN (Adaptive Synthetic) Sampling and the Stacking Algorithm. As confirmed by testing on five multiclass datasets, the proposed method outperforms other methods in terms of accuracy values, sensitivity, specificity, and geometric mean values. As a result, the method proposed in this study can solve class imbalance problems in multiclass-type datasets. However, this study has limitations. Namely, the dataset used is a multiclass category with a maximum number of six classes. For this reason, further research will suggest testing using imbalanced class datasets in the category of multiclass datasets with more than six classes.