Pedro David Netto Silveira, D. Cury, C. S. Menezes, Otávio Lube dos Santos
{"title":"Analysis of classifiers in a predictive model of academic success or failure for institutional and trace data","authors":"Pedro David Netto Silveira, D. Cury, C. S. Menezes, Otávio Lube dos Santos","doi":"10.1109/FIE43999.2019.9028618","DOIUrl":null,"url":null,"abstract":"This Research Full Paper presents recent research on the Educational Data Mining (EDM) field. In recent years, EDM has contributed significantly to the prevention of various challenges in academia. This paper presents an analysis of classifiers for a comparative study of EDM impact, using institutional data and trace data generated by a virtual learning environment to predict academic success/failure. For this, a model of educational data mining using logistic regression, support vector machine, naive bayes and J48 as classifiers and cross validation as a test method, was elaborated and was used to compare the prediction accuracy and the execution time of each classifier. The model was applied on a public dataset with 32,593 students, distributed among seven courses. The results on accuracy and execution time of each classifier allowed us to make recommendations on the suitability of using them. The results also revealed that it is better to separate the trace data from the institutional data in the model application regardless of the classifier. (Abstract)","PeriodicalId":6700,"journal":{"name":"2019 IEEE Frontiers in Education Conference (FIE)","volume":"31 1","pages":"1-8"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE Frontiers in Education Conference (FIE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/FIE43999.2019.9028618","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
This Research Full Paper presents recent research on the Educational Data Mining (EDM) field. In recent years, EDM has contributed significantly to the prevention of various challenges in academia. This paper presents an analysis of classifiers for a comparative study of EDM impact, using institutional data and trace data generated by a virtual learning environment to predict academic success/failure. For this, a model of educational data mining using logistic regression, support vector machine, naive bayes and J48 as classifiers and cross validation as a test method, was elaborated and was used to compare the prediction accuracy and the execution time of each classifier. The model was applied on a public dataset with 32,593 students, distributed among seven courses. The results on accuracy and execution time of each classifier allowed us to make recommendations on the suitability of using them. The results also revealed that it is better to separate the trace data from the institutional data in the model application regardless of the classifier. (Abstract)