Wishnu Dwi Herlambang, K. A. Laksitowening, I. Asror
{"title":"利用Naïve贝叶斯算法和主成分分析(PCA)对时间序列数据的毕业预测","authors":"Wishnu Dwi Herlambang, K. A. Laksitowening, I. Asror","doi":"10.1109/ICoICT52021.2021.9527443","DOIUrl":null,"url":null,"abstract":"The percentage of students who graduated on time can be predicted with data mining methods. This research aims to provide earlier information regarding students who are at risk of not graduating on time. Thus, the study program can take appropriate action before it is too late. Several classification methods can be used for prediction. Our research combines Naïve Bayes with Principal Component Analysis (PCA). PCA is used to simplify complex academic data. The PCA result has a more straightforward structure to be processed using Naive Bayes classification. This research uses four batches of student academic performance data in Informatics Study Program, Telkom University. The dataset is partitioned by academic year to obtain time-series data of each student. The combination of PCA and Naïve Bayes algorithms obtained better results than classification using Naïve Bayes only, with 6.04% higher accuracy on average.","PeriodicalId":191671,"journal":{"name":"2021 9th International Conference on Information and Communication Technology (ICoICT)","volume":"1021 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Prediction of Graduation with Naïve Bayes Algorithm and Principal Component Analysis (PCA) on Time Series Data\",\"authors\":\"Wishnu Dwi Herlambang, K. A. Laksitowening, I. Asror\",\"doi\":\"10.1109/ICoICT52021.2021.9527443\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The percentage of students who graduated on time can be predicted with data mining methods. This research aims to provide earlier information regarding students who are at risk of not graduating on time. Thus, the study program can take appropriate action before it is too late. Several classification methods can be used for prediction. Our research combines Naïve Bayes with Principal Component Analysis (PCA). PCA is used to simplify complex academic data. The PCA result has a more straightforward structure to be processed using Naive Bayes classification. This research uses four batches of student academic performance data in Informatics Study Program, Telkom University. The dataset is partitioned by academic year to obtain time-series data of each student. The combination of PCA and Naïve Bayes algorithms obtained better results than classification using Naïve Bayes only, with 6.04% higher accuracy on average.\",\"PeriodicalId\":191671,\"journal\":{\"name\":\"2021 9th International Conference on Information and Communication Technology (ICoICT)\",\"volume\":\"1021 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-08-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 9th International Conference on Information and Communication Technology (ICoICT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICoICT52021.2021.9527443\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 9th International Conference on Information and Communication Technology (ICoICT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICoICT52021.2021.9527443","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Prediction of Graduation with Naïve Bayes Algorithm and Principal Component Analysis (PCA) on Time Series Data
The percentage of students who graduated on time can be predicted with data mining methods. This research aims to provide earlier information regarding students who are at risk of not graduating on time. Thus, the study program can take appropriate action before it is too late. Several classification methods can be used for prediction. Our research combines Naïve Bayes with Principal Component Analysis (PCA). PCA is used to simplify complex academic data. The PCA result has a more straightforward structure to be processed using Naive Bayes classification. This research uses four batches of student academic performance data in Informatics Study Program, Telkom University. The dataset is partitioned by academic year to obtain time-series data of each student. The combination of PCA and Naïve Bayes algorithms obtained better results than classification using Naïve Bayes only, with 6.04% higher accuracy on average.