Pedro David Netto Silveira, D. Cury, Crediné Silva de Menezes, Otávio Lube dos Santos
{"title":"A Predictive Model of Academic Failure or Success for Institutional and Trace Data","authors":"Pedro David Netto Silveira, D. Cury, Crediné Silva de Menezes, Otávio Lube dos Santos","doi":"10.1109/LACLO49268.2019.00037","DOIUrl":null,"url":null,"abstract":"In recent years, Educational Data Mining (EDM) has contributed significantly to the prevention of various challenges in many sectors including the academia. This paper brings a comparative study of EDM impact, using institutional data and trace data generated by a virtual learning environment to predict academic failure or success. For this, we created a model of educational data mining using logistic regression as a classifier and cross validation as a test method. Logistic Regression is widely used as a classifier (or predictor) in educational data mining. The developed model was applied to a public data set of 32,593 students, distributed among seven courses. The results show that the application of the proposed model in the trace data favors excellent predictions, on average 42% better, in relation to using just the institutional data. The results also revealed that it is better to separate the trace data from the institutional data in the model application, since the gain in the prediction is negligible and the computational time spent is around four times greater if the both data sets are combined. In addition to contributing to the findings, we also believe that our methodology can be re-applied by researchers, educators and education managers.","PeriodicalId":229069,"journal":{"name":"2019 XIV Latin American Conference on Learning Technologies (LACLO)","volume":"56 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 XIV Latin American Conference on Learning Technologies (LACLO)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/LACLO49268.2019.00037","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
In recent years, Educational Data Mining (EDM) has contributed significantly to the prevention of various challenges in many sectors including the academia. This paper brings a comparative study of EDM impact, using institutional data and trace data generated by a virtual learning environment to predict academic failure or success. For this, we created a model of educational data mining using logistic regression as a classifier and cross validation as a test method. Logistic Regression is widely used as a classifier (or predictor) in educational data mining. The developed model was applied to a public data set of 32,593 students, distributed among seven courses. The results show that the application of the proposed model in the trace data favors excellent predictions, on average 42% better, in relation to using just the institutional data. The results also revealed that it is better to separate the trace data from the institutional data in the model application, since the gain in the prediction is negligible and the computational time spent is around four times greater if the both data sets are combined. In addition to contributing to the findings, we also believe that our methodology can be re-applied by researchers, educators and education managers.