{"title":"关联数据,数据挖掘和外部开放数据,以更好地预测有风险的学生","authors":"F. Sarker, T. Tiropanis, H. Davis","doi":"10.1109/CoDIT.2014.6996973","DOIUrl":null,"url":null,"abstract":"Research in student retention is traditionally survey-based, where researchers use questionnaires to collect student data to analyse and to develop student predictive model. The major issues with survey-based study are the potentially low response rates, time consuming and costly. Nevertheless, a large number of datasets that could inform the questions that students are explicitly asked in surveys is commonly available in the external open datasets. This paper describes a new student predictive model that uses commonly available external open data instead of traditional questionnaires/surveys to spot `at-risk' students. Considering the promising behavior of neural networks led us to develop student predictive models to predict `at-risk' students. The results of empirical study for undergraduate students in their first year of study shows that this model can perform as well as or even out-perform traditional survey-based ones. The prediction performance of this study was also compared with that of logistic regression approach. The results shows that neural network slightly improved the overall model accuracy however, according to the model sensitivity, it is suggested that logistic regression performs better for identifying `at-risk' students in their programme of study.","PeriodicalId":161703,"journal":{"name":"2014 International Conference on Control, Decision and Information Technologies (CoDIT)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"24","resultStr":"{\"title\":\"Linked data, data mining and external open data for better prediction of at-risk students\",\"authors\":\"F. Sarker, T. Tiropanis, H. Davis\",\"doi\":\"10.1109/CoDIT.2014.6996973\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Research in student retention is traditionally survey-based, where researchers use questionnaires to collect student data to analyse and to develop student predictive model. The major issues with survey-based study are the potentially low response rates, time consuming and costly. Nevertheless, a large number of datasets that could inform the questions that students are explicitly asked in surveys is commonly available in the external open datasets. This paper describes a new student predictive model that uses commonly available external open data instead of traditional questionnaires/surveys to spot `at-risk' students. Considering the promising behavior of neural networks led us to develop student predictive models to predict `at-risk' students. The results of empirical study for undergraduate students in their first year of study shows that this model can perform as well as or even out-perform traditional survey-based ones. The prediction performance of this study was also compared with that of logistic regression approach. The results shows that neural network slightly improved the overall model accuracy however, according to the model sensitivity, it is suggested that logistic regression performs better for identifying `at-risk' students in their programme of study.\",\"PeriodicalId\":161703,\"journal\":{\"name\":\"2014 International Conference on Control, Decision and Information Technologies (CoDIT)\",\"volume\":\"28 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-12-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"24\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2014 International Conference on Control, Decision and Information Technologies (CoDIT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CoDIT.2014.6996973\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 International Conference on Control, Decision and Information Technologies (CoDIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CoDIT.2014.6996973","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Linked data, data mining and external open data for better prediction of at-risk students
Research in student retention is traditionally survey-based, where researchers use questionnaires to collect student data to analyse and to develop student predictive model. The major issues with survey-based study are the potentially low response rates, time consuming and costly. Nevertheless, a large number of datasets that could inform the questions that students are explicitly asked in surveys is commonly available in the external open datasets. This paper describes a new student predictive model that uses commonly available external open data instead of traditional questionnaires/surveys to spot `at-risk' students. Considering the promising behavior of neural networks led us to develop student predictive models to predict `at-risk' students. The results of empirical study for undergraduate students in their first year of study shows that this model can perform as well as or even out-perform traditional survey-based ones. The prediction performance of this study was also compared with that of logistic regression approach. The results shows that neural network slightly improved the overall model accuracy however, according to the model sensitivity, it is suggested that logistic regression performs better for identifying `at-risk' students in their programme of study.