{"title":"Relevant factors and classification of student alcohol consumption","authors":"Auth Pisutaporn, Burit Chonvirachkul, D. Sutivong","doi":"10.1109/ICIRD.2018.8376297","DOIUrl":null,"url":null,"abstract":"Educational data mining is the process of applying data mining tools and techniques to analyze data for educational purpose. This paper carries out educational data mining to study the student alcohol consumption through a public dataset which includes student attributes and their grades. The decision tree algorithm and the random forest algorithm are applied to perform classification and to analyze the variable importance. The regression model is then employed to illustrate the relationship between alcohol consumption level and the students' final grades. Our analysis provides knowledge on the relationship between student characteristics and alcohol consumption. The study also compares performance of the decision tree algorithm and the random forest algorithm.","PeriodicalId":397098,"journal":{"name":"2018 IEEE International Conference on Innovative Research and Development (ICIRD)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE International Conference on Innovative Research and Development (ICIRD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIRD.2018.8376297","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
Educational data mining is the process of applying data mining tools and techniques to analyze data for educational purpose. This paper carries out educational data mining to study the student alcohol consumption through a public dataset which includes student attributes and their grades. The decision tree algorithm and the random forest algorithm are applied to perform classification and to analyze the variable importance. The regression model is then employed to illustrate the relationship between alcohol consumption level and the students' final grades. Our analysis provides knowledge on the relationship between student characteristics and alcohol consumption. The study also compares performance of the decision tree algorithm and the random forest algorithm.