{"title":"学习分析与教育数据集聚类算法的比较","authors":"Álvaro Martínez Navarro, P. Moreno-Ger","doi":"10.9781/ijimai.2018.02.003","DOIUrl":null,"url":null,"abstract":"Learning Analytics is becoming a key tool for the analysis and improvement of digital education processes, and \nits potential benefit grows with the size of the student cohorts generating data. In the context of Open Education, \nthe potentially massive student cohorts and the global audience represent a great opportunity for significant \nanalyses and breakthroughs in the field of learning analytics. However, these potentially huge datasets require \nproper analysis techniques, and different algorithms, tools and approaches may perform better in this specific \ncontext. In this work, we compare different clustering algorithms using an educational dataset. We start by \nidentifying the most relevant algorithms in Learning Analytics and benchmark them to determine, according to \ninternal validation and stability measurements, which algorithms perform better. We analyzed seven algorithms, \nand determined that K-means and PAM were the best performers among partition algorithms, and DIANA was \nthe best performer among hierarchical algorithms","PeriodicalId":143152,"journal":{"name":"Int. J. Interact. Multim. Artif. Intell.","volume":"89 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"32","resultStr":"{\"title\":\"Comparison of Clustering Algorithms for Learning Analytics with Educational Datasets\",\"authors\":\"Álvaro Martínez Navarro, P. Moreno-Ger\",\"doi\":\"10.9781/ijimai.2018.02.003\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Learning Analytics is becoming a key tool for the analysis and improvement of digital education processes, and \\nits potential benefit grows with the size of the student cohorts generating data. In the context of Open Education, \\nthe potentially massive student cohorts and the global audience represent a great opportunity for significant \\nanalyses and breakthroughs in the field of learning analytics. However, these potentially huge datasets require \\nproper analysis techniques, and different algorithms, tools and approaches may perform better in this specific \\ncontext. In this work, we compare different clustering algorithms using an educational dataset. We start by \\nidentifying the most relevant algorithms in Learning Analytics and benchmark them to determine, according to \\ninternal validation and stability measurements, which algorithms perform better. We analyzed seven algorithms, \\nand determined that K-means and PAM were the best performers among partition algorithms, and DIANA was \\nthe best performer among hierarchical algorithms\",\"PeriodicalId\":143152,\"journal\":{\"name\":\"Int. J. Interact. Multim. Artif. Intell.\",\"volume\":\"89 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"32\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Int. J. Interact. Multim. Artif. Intell.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.9781/ijimai.2018.02.003\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Int. J. Interact. Multim. Artif. Intell.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.9781/ijimai.2018.02.003","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Comparison of Clustering Algorithms for Learning Analytics with Educational Datasets
Learning Analytics is becoming a key tool for the analysis and improvement of digital education processes, and
its potential benefit grows with the size of the student cohorts generating data. In the context of Open Education,
the potentially massive student cohorts and the global audience represent a great opportunity for significant
analyses and breakthroughs in the field of learning analytics. However, these potentially huge datasets require
proper analysis techniques, and different algorithms, tools and approaches may perform better in this specific
context. In this work, we compare different clustering algorithms using an educational dataset. We start by
identifying the most relevant algorithms in Learning Analytics and benchmark them to determine, according to
internal validation and stability measurements, which algorithms perform better. We analyzed seven algorithms,
and determined that K-means and PAM were the best performers among partition algorithms, and DIANA was
the best performer among hierarchical algorithms