{"title":"Research on Real-time Learning Prediction Method Based on Spark","authors":"Shao-lin Gong, X. Qin","doi":"10.1109/ICSESS47205.2019.9040787","DOIUrl":null,"url":null,"abstract":"Based on the research of real-time prediction and big data processing platform, an effective solution is proposed to solve the shortcomings of current real-time learning prediction in engineering application. By analyzing learners’ learning behaviors related to a certain course, learners’ learning behaviors can be divided into three categories in terms of time and space: online learning behaviors, offline learning behaviors and performance of relevant basic courses. Based on the parallel computation and binary logistic regression algorithm in Spark framework, the off-line learning prediction model is created. In the real-time environment, large scale real-time learning prediction can be realized based on Spark Streaming and kafka. With the increase of learning behaviors data, the scalability problem of prediction scheme can be solved by expanding Spark cluster nodes. The advantages of the proposed scheme have been verified in the practical application of smart campus.","PeriodicalId":203944,"journal":{"name":"2019 IEEE 10th International Conference on Software Engineering and Service Science (ICSESS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE 10th International Conference on Software Engineering and Service Science (ICSESS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSESS47205.2019.9040787","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Based on the research of real-time prediction and big data processing platform, an effective solution is proposed to solve the shortcomings of current real-time learning prediction in engineering application. By analyzing learners’ learning behaviors related to a certain course, learners’ learning behaviors can be divided into three categories in terms of time and space: online learning behaviors, offline learning behaviors and performance of relevant basic courses. Based on the parallel computation and binary logistic regression algorithm in Spark framework, the off-line learning prediction model is created. In the real-time environment, large scale real-time learning prediction can be realized based on Spark Streaming and kafka. With the increase of learning behaviors data, the scalability problem of prediction scheme can be solved by expanding Spark cluster nodes. The advantages of the proposed scheme have been verified in the practical application of smart campus.