{"title":"基于多特征选择方法改进学生成绩预测","authors":"Cheng Ma, Baofeng Yao, Fang Ge, Yurong Pan, Youqiang Guo","doi":"10.1145/3141151.3141160","DOIUrl":null,"url":null,"abstract":"Recently, to provide the better education for students, there are a lot of researchers that discover the latent characteristics of students for predicting their performance. However, few existing work has explored the problem of extracting information from E-learning to get more precise and interpretable analysis. Based on the edX open data, we first predict whether the student will obtain the certificate, and take it as the criterion of student perfomance. Next, according to the requirement of the research, student features on dataset can be classified into three categories primarily, and some characters which seem umimportant intuitively have been removed already. Then, we adopt several kinds of feature selection approaches to extract important influencing student feature of the rest characters. Finally, though a few of existing classical machine learning methods, we build the model and predict student performance. The extensive experiments on the dataset of edX open platform we have conducted validated the effectiveness of predicting student performance.","PeriodicalId":165786,"journal":{"name":"Proceedings of the 2017 1st International Conference on E-Education, E-Business and E-Technology","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":"{\"title\":\"Improving Prediction of Student Performance based on Multiple Feature Selection Approaches\",\"authors\":\"Cheng Ma, Baofeng Yao, Fang Ge, Yurong Pan, Youqiang Guo\",\"doi\":\"10.1145/3141151.3141160\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Recently, to provide the better education for students, there are a lot of researchers that discover the latent characteristics of students for predicting their performance. However, few existing work has explored the problem of extracting information from E-learning to get more precise and interpretable analysis. Based on the edX open data, we first predict whether the student will obtain the certificate, and take it as the criterion of student perfomance. Next, according to the requirement of the research, student features on dataset can be classified into three categories primarily, and some characters which seem umimportant intuitively have been removed already. Then, we adopt several kinds of feature selection approaches to extract important influencing student feature of the rest characters. Finally, though a few of existing classical machine learning methods, we build the model and predict student performance. The extensive experiments on the dataset of edX open platform we have conducted validated the effectiveness of predicting student performance.\",\"PeriodicalId\":165786,\"journal\":{\"name\":\"Proceedings of the 2017 1st International Conference on E-Education, E-Business and E-Technology\",\"volume\":\"30 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-09-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2017 1st International Conference on E-Education, E-Business and E-Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3141151.3141160\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2017 1st International Conference on E-Education, E-Business and E-Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3141151.3141160","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Improving Prediction of Student Performance based on Multiple Feature Selection Approaches
Recently, to provide the better education for students, there are a lot of researchers that discover the latent characteristics of students for predicting their performance. However, few existing work has explored the problem of extracting information from E-learning to get more precise and interpretable analysis. Based on the edX open data, we first predict whether the student will obtain the certificate, and take it as the criterion of student perfomance. Next, according to the requirement of the research, student features on dataset can be classified into three categories primarily, and some characters which seem umimportant intuitively have been removed already. Then, we adopt several kinds of feature selection approaches to extract important influencing student feature of the rest characters. Finally, though a few of existing classical machine learning methods, we build the model and predict student performance. The extensive experiments on the dataset of edX open platform we have conducted validated the effectiveness of predicting student performance.