Clustering students based on grammatical errors for on-line education

Learning and Nonlinear Models Pub Date : 2018-06-30 DOI:10.21528/lnlm-vol16-no1-art2

M. Macedo, Elliackin M. N. Figueiredo, Fabiana Soares, H. Siqueira, A. M. A. Maciel, A. Gokhale, C. Bastos-Filho

{"title":"Clustering students based on grammatical errors for on-line education","authors":"M. Macedo, Elliackin M. N. Figueiredo, Fabiana Soares, H. Siqueira, A. M. A. Maciel, A. Gokhale, C. Bastos-Filho","doi":"10.21528/lnlm-vol16-no1-art2","DOIUrl":null,"url":null,"abstract":"Learning Management System (LMS) is an educational solution created for people who need flexibility regarding time and place. The problem of this kind of tool primarily concerns the difficulty in identifying which students have learned the content correctly. This paper aims to analyze the performance of a group of distance learning students regarding grammar errors in two different terms of an undergraduate course. Our hypothesis relies on the existence of different characteristics that emerge from subgroups of students with similar difficulties. This division can help tutors in educational platforms to develop specific recommendations tasks for each group of students. A previous work applied the well-known K-means algorithm to cluster the groups, but in that paper, we fixed the number of clusters. Therefore, we carried out a methodology to find the best number of clusters to be used in K-means for this problem. Moreover, we also applied the Fuzzy C-means to tackle the clustering problem and analyzed the results obtained by both algorithms using the well-known metrics in the literature (Gap Statistic and Davies-Bouldin) to assess the quality of the obtained groups. The experimental results showed that Fuzzy C-means approach outperforms the K-means algorithm. Moreover, the application of the Spearman Correlation on each group expose several differences, relations and similarities between groups and inside each one.","PeriodicalId":386768,"journal":{"name":"Learning and Nonlinear Models","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Learning and Nonlinear Models","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.21528/lnlm-vol16-no1-art2","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

Abstract

Learning Management System (LMS) is an educational solution created for people who need flexibility regarding time and place. The problem of this kind of tool primarily concerns the difficulty in identifying which students have learned the content correctly. This paper aims to analyze the performance of a group of distance learning students regarding grammar errors in two different terms of an undergraduate course. Our hypothesis relies on the existence of different characteristics that emerge from subgroups of students with similar difficulties. This division can help tutors in educational platforms to develop specific recommendations tasks for each group of students. A previous work applied the well-known K-means algorithm to cluster the groups, but in that paper, we fixed the number of clusters. Therefore, we carried out a methodology to find the best number of clusters to be used in K-means for this problem. Moreover, we also applied the Fuzzy C-means to tackle the clustering problem and analyzed the results obtained by both algorithms using the well-known metrics in the literature (Gap Statistic and Davies-Bouldin) to assess the quality of the obtained groups. The experimental results showed that Fuzzy C-means approach outperforms the K-means algorithm. Moreover, the application of the Spearman Correlation on each group expose several differences, relations and similarities between groups and inside each one.

查看原文本刊更多论文

基于语法错误的在线教学学生聚类

学习管理系统(LMS)是为需要灵活安排时间和地点的人而设计的一种教育解决方案。这种工具的问题主要在于难以确定哪些学生正确地学习了内容。本文旨在分析一组远程学习学生对本科课程中两个不同术语的语法错误的表现。我们的假设依赖于具有相似困难的学生子群体中出现的不同特征的存在。这种划分可以帮助教育平台的导师为每组学生制定具体的推荐任务。之前的工作应用了著名的K-means算法来聚类，但在那篇论文中，我们固定了聚类的数量。因此，我们执行了一种方法来找到K-means中用于此问题的最佳簇数。此外，我们还应用模糊C-means来解决聚类问题，并使用文献中众所周知的度量(Gap Statistic和Davies-Bouldin)来分析两种算法获得的结果，以评估获得的组的质量。实验结果表明，模糊c均值方法优于k均值算法。此外，Spearman相关在每个群体上的应用揭示了群体之间和群体内部的一些差异、关系和相似之处。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Learning and Nonlinear Models

自引率

0.00%

发文量