{"title":"Penerapan XGBoost untuk Seleksi Atribut pada K-Means dalam Clustering Penerima KIP Kuliah","authors":"Amiruddin Bengnga, Rezqiwati Ishak","doi":"10.37905/jjeee.v5i2.20253","DOIUrl":null,"url":null,"abstract":"Pada proses clustering prioritas penerima bantuan Kartu Indonesia Pintar Kuliah dengan algoritma K-Means ada beberapa masalah yang muncul yaitu masalah seleksi atribut yang penting dan penentuan nilai K yang optimum sehingga membuat proses clustering tidak maksimal dan tidak ideal. Masalah pemilihan atribut yang penting akan diselesaikan dengan menggunakan algoritma XGBoost yang terbukti dapat digunakan untuk memecahkan masalah seperti pada proses clustering prioritas penerima bantuan KIP Kuliah. Hasil penelitian menunjukkan bahwa algoritma XGBoost dapat menentukan 3 (tiga) atribut yang paling penting yaitu Pekerjaan Ayah, Penghasilan Ibu dan Luas Bangunan dari 12 (dua belas) atribut yang ada yaitu Pekerjaan Ayah, Pekerjaan Ibu, Penghasilan Ayah, Penghasilan Ibu, Jumlah Tanggungan, Kepemilikan Rumah, Sumber Listrik, Luas Tanah, Luas Bangunan, Sumber Air, MCK, Prestasi dan metode Elbow terbukti dapat menentukan nilai K yang optimum yaitu nilai K=4. Berdasarkan penggunaan 3 (tiga) atribut terbaik dan nilai K=4 sebagai nilai K optimum berhasil didapatkan clustering yang paling maksimal dan ideal dengan nilai index terkecil yaitu 0.819 dengan menggunakan metode pengujian Davies-Bouldin Index.In the process of clustering the priority of the recipient Indonesian smart school cards with the K-Means algorithm, there are several problems that arise, namely the problem of selecting important attributes and determining the optimal value of K, so that the process is not maximum and is not ideal. Important attribute selection problems will be solved using proven XGBoost algorithm that can be used to solve problems such as in the process of clustering the priority of recipients of school KIP assistance. The results of the research showed that the XGBoost algorithm can determine the 3 (three) most important attributes, namely Father’s Work, Mother’s Production and Building Size from the 12 (twelve) attributs that exist: Father's Job, Mothers’ Work, Fathers’ Income, Mothers’ Revenue, Number of Dependants, Home Ownership, Electrical Resources, Land Area, Building Area, Water Resource, MCK, Performance and Elbow Method proved to determine the optimal K value of K=4. Based on the use of the 3 (three) best attributes and the value of K = 4 as the optimal K value, the maximum and ideal clustering with the smallest index value is 0.819 using the Davies-Bouldin Index test method.","PeriodicalId":292481,"journal":{"name":"Jambura Journal of Electrical and Electronics Engineering","volume":"31 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Jambura Journal of Electrical and Electronics Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.37905/jjeee.v5i2.20253","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Pada proses clustering prioritas penerima bantuan Kartu Indonesia Pintar Kuliah dengan algoritma K-Means ada beberapa masalah yang muncul yaitu masalah seleksi atribut yang penting dan penentuan nilai K yang optimum sehingga membuat proses clustering tidak maksimal dan tidak ideal. Masalah pemilihan atribut yang penting akan diselesaikan dengan menggunakan algoritma XGBoost yang terbukti dapat digunakan untuk memecahkan masalah seperti pada proses clustering prioritas penerima bantuan KIP Kuliah. Hasil penelitian menunjukkan bahwa algoritma XGBoost dapat menentukan 3 (tiga) atribut yang paling penting yaitu Pekerjaan Ayah, Penghasilan Ibu dan Luas Bangunan dari 12 (dua belas) atribut yang ada yaitu Pekerjaan Ayah, Pekerjaan Ibu, Penghasilan Ayah, Penghasilan Ibu, Jumlah Tanggungan, Kepemilikan Rumah, Sumber Listrik, Luas Tanah, Luas Bangunan, Sumber Air, MCK, Prestasi dan metode Elbow terbukti dapat menentukan nilai K yang optimum yaitu nilai K=4. Berdasarkan penggunaan 3 (tiga) atribut terbaik dan nilai K=4 sebagai nilai K optimum berhasil didapatkan clustering yang paling maksimal dan ideal dengan nilai index terkecil yaitu 0.819 dengan menggunakan metode pengujian Davies-Bouldin Index.In the process of clustering the priority of the recipient Indonesian smart school cards with the K-Means algorithm, there are several problems that arise, namely the problem of selecting important attributes and determining the optimal value of K, so that the process is not maximum and is not ideal. Important attribute selection problems will be solved using proven XGBoost algorithm that can be used to solve problems such as in the process of clustering the priority of recipients of school KIP assistance. The results of the research showed that the XGBoost algorithm can determine the 3 (three) most important attributes, namely Father’s Work, Mother’s Production and Building Size from the 12 (twelve) attributs that exist: Father's Job, Mothers’ Work, Fathers’ Income, Mothers’ Revenue, Number of Dependants, Home Ownership, Electrical Resources, Land Area, Building Area, Water Resource, MCK, Performance and Elbow Method proved to determine the optimal K value of K=4. Based on the use of the 3 (three) best attributes and the value of K = 4 as the optimal K value, the maximum and ideal clustering with the smallest index value is 0.819 using the Davies-Bouldin Index test method.