Clustering students based on grammatical errors for on-line education

M. Macedo, Elliackin M. N. Figueiredo, Fabiana Soares, H. Siqueira, A. M. A. Maciel, A. Gokhale, C. Bastos-Filho
{"title":"Clustering students based on grammatical errors for on-line education","authors":"M. Macedo, Elliackin M. N. Figueiredo, Fabiana Soares, H. Siqueira, A. M. A. Maciel, A. Gokhale, C. Bastos-Filho","doi":"10.21528/lnlm-vol16-no1-art2","DOIUrl":null,"url":null,"abstract":"Learning Management System (LMS) is an educational solution created for people who need flexibility regarding time and place. The problem of this kind of tool primarily concerns the difficulty in identifying which students have learned the content correctly. This paper aims to analyze the performance of a group of distance learning students regarding grammar errors in two different terms of an undergraduate course. Our hypothesis relies on the existence of different characteristics that emerge from subgroups of students with similar difficulties. This division can help tutors in educational platforms to develop specific recommendations tasks for each group of students. A previous work applied the well-known K-means algorithm to cluster the groups, but in that paper, we fixed the number of clusters. Therefore, we carried out a methodology to find the best number of clusters to be used in K-means for this problem. Moreover, we also applied the Fuzzy C-means to tackle the clustering problem and analyzed the results obtained by both algorithms using the well-known metrics in the literature (Gap Statistic and Davies-Bouldin) to assess the quality of the obtained groups. The experimental results showed that Fuzzy C-means approach outperforms the K-means algorithm. Moreover, the application of the Spearman Correlation on each group expose several differences, relations and similarities between groups and inside each one.","PeriodicalId":386768,"journal":{"name":"Learning and Nonlinear Models","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Learning and Nonlinear Models","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.21528/lnlm-vol16-no1-art2","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

Abstract

Learning Management System (LMS) is an educational solution created for people who need flexibility regarding time and place. The problem of this kind of tool primarily concerns the difficulty in identifying which students have learned the content correctly. This paper aims to analyze the performance of a group of distance learning students regarding grammar errors in two different terms of an undergraduate course. Our hypothesis relies on the existence of different characteristics that emerge from subgroups of students with similar difficulties. This division can help tutors in educational platforms to develop specific recommendations tasks for each group of students. A previous work applied the well-known K-means algorithm to cluster the groups, but in that paper, we fixed the number of clusters. Therefore, we carried out a methodology to find the best number of clusters to be used in K-means for this problem. Moreover, we also applied the Fuzzy C-means to tackle the clustering problem and analyzed the results obtained by both algorithms using the well-known metrics in the literature (Gap Statistic and Davies-Bouldin) to assess the quality of the obtained groups. The experimental results showed that Fuzzy C-means approach outperforms the K-means algorithm. Moreover, the application of the Spearman Correlation on each group expose several differences, relations and similarities between groups and inside each one.
基于语法错误的在线教学学生聚类
学习管理系统(LMS)是为需要灵活安排时间和地点的人而设计的一种教育解决方案。这种工具的问题主要在于难以确定哪些学生正确地学习了内容。本文旨在分析一组远程学习学生对本科课程中两个不同术语的语法错误的表现。我们的假设依赖于具有相似困难的学生子群体中出现的不同特征的存在。这种划分可以帮助教育平台的导师为每组学生制定具体的推荐任务。之前的工作应用了著名的K-means算法来聚类,但在那篇论文中,我们固定了聚类的数量。因此,我们执行了一种方法来找到K-means中用于此问题的最佳簇数。此外,我们还应用模糊C-means来解决聚类问题,并使用文献中众所周知的度量(Gap Statistic和Davies-Bouldin)来分析两种算法获得的结果,以评估获得的组的质量。实验结果表明,模糊c均值方法优于k均值算法。此外,Spearman相关在每个群体上的应用揭示了群体之间和群体内部的一些差异、关系和相似之处。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信