{"title":"Comparative analysis of string similarity and corpus-based similarity for automatic essay scoring system on e-learning gamification","authors":"Eko Sakti Pramukantoro, M. Fauzi","doi":"10.1109/ICACSIS.2016.7872785","DOIUrl":null,"url":null,"abstract":"Essay assessment within e-learning need to be conducted manually by human expert. This process takes time and costly. Hence, automatic essay scoring is needed. Since the scoring system will be integrated to the e-learning, we need a computationally lightweight method that still does not rule out the accuracy of the assessment. In this paper, we propose an automatic scoring system for essay examination using unsupervised approaches. We compare and analyze two similarity measure methods, cosine similarity and latent semantic analysis. The parameters that was used to measure the performance of the methods are the computational complexity — measured by the amount of CPU and memory usage, and page load time — and accuracy — measured by Pearson Correlation and Mean Absolute Error. The results showed that both algorithm consumed same amount of memory. For CPU usage, LSA consumption is 0.13% and cosine's is 0.06%. For page load time, cosine similarity is faster than LSA which is 0.2 second and 0.5 second consecutively. Based on the correlation measure with Pearson, LSA is more superior to the cosine similarity by 0.59 to 0.49. LSA also has less MAE than cosine similarity which is 5.69 compared to 5.33. From that result, LSA and Cosine Similarity has a very competitive result in accuracy. However, Cosine has a better server performance so that preferred to be implemented in e-learning automatic essay scoring system.","PeriodicalId":267924,"journal":{"name":"2016 International Conference on Advanced Computer Science and Information Systems (ICACSIS)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"25","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 International Conference on Advanced Computer Science and Information Systems (ICACSIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICACSIS.2016.7872785","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 25
Abstract
Essay assessment within e-learning need to be conducted manually by human expert. This process takes time and costly. Hence, automatic essay scoring is needed. Since the scoring system will be integrated to the e-learning, we need a computationally lightweight method that still does not rule out the accuracy of the assessment. In this paper, we propose an automatic scoring system for essay examination using unsupervised approaches. We compare and analyze two similarity measure methods, cosine similarity and latent semantic analysis. The parameters that was used to measure the performance of the methods are the computational complexity — measured by the amount of CPU and memory usage, and page load time — and accuracy — measured by Pearson Correlation and Mean Absolute Error. The results showed that both algorithm consumed same amount of memory. For CPU usage, LSA consumption is 0.13% and cosine's is 0.06%. For page load time, cosine similarity is faster than LSA which is 0.2 second and 0.5 second consecutively. Based on the correlation measure with Pearson, LSA is more superior to the cosine similarity by 0.59 to 0.49. LSA also has less MAE than cosine similarity which is 5.69 compared to 5.33. From that result, LSA and Cosine Similarity has a very competitive result in accuracy. However, Cosine has a better server performance so that preferred to be implemented in e-learning automatic essay scoring system.