{"title":"k - medidoids和K-means算法在基于学习成绩分割学生中的适用性","authors":"Usha Badhera, Apoorva Verma, P. Nahar","doi":"10.1080/09720510.2022.2130566","DOIUrl":null,"url":null,"abstract":"Abstract In this paper literature was surveyed to find popular clustering techniques used by researchers in recent times to predict academic performance. We obtained a trend that the K-means algorithm is particularly popular among researchers because of its simplicity and scalability, and in other studies K-medoids algorithm was selected as it is less affected by outliers. On the basis of these observations these two clustering algorithms were implemented in Python, on student dataset of undergraduate students from a higher education institute. Two different clusters were obtained which segment students based on their academic performances in the previous two exams. The clusters obtained by have high accuracy score and K-medoids cluster centroids have taken exact values of marks obtained by students whereas K-means centroid value is a round off. The K-means clustering is also affected by the presence of outliers in the student dataset.","PeriodicalId":270059,"journal":{"name":"Journal of Statistics and Management Systems","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Applicability of K-medoids and K-means algorithms for segmenting students based on their scholastic performance\",\"authors\":\"Usha Badhera, Apoorva Verma, P. Nahar\",\"doi\":\"10.1080/09720510.2022.2130566\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Abstract In this paper literature was surveyed to find popular clustering techniques used by researchers in recent times to predict academic performance. We obtained a trend that the K-means algorithm is particularly popular among researchers because of its simplicity and scalability, and in other studies K-medoids algorithm was selected as it is less affected by outliers. On the basis of these observations these two clustering algorithms were implemented in Python, on student dataset of undergraduate students from a higher education institute. Two different clusters were obtained which segment students based on their academic performances in the previous two exams. The clusters obtained by have high accuracy score and K-medoids cluster centroids have taken exact values of marks obtained by students whereas K-means centroid value is a round off. The K-means clustering is also affected by the presence of outliers in the student dataset.\",\"PeriodicalId\":270059,\"journal\":{\"name\":\"Journal of Statistics and Management Systems\",\"volume\":\"14 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-10-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Statistics and Management Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1080/09720510.2022.2130566\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Statistics and Management Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/09720510.2022.2130566","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Applicability of K-medoids and K-means algorithms for segmenting students based on their scholastic performance
Abstract In this paper literature was surveyed to find popular clustering techniques used by researchers in recent times to predict academic performance. We obtained a trend that the K-means algorithm is particularly popular among researchers because of its simplicity and scalability, and in other studies K-medoids algorithm was selected as it is less affected by outliers. On the basis of these observations these two clustering algorithms were implemented in Python, on student dataset of undergraduate students from a higher education institute. Two different clusters were obtained which segment students based on their academic performances in the previous two exams. The clusters obtained by have high accuracy score and K-medoids cluster centroids have taken exact values of marks obtained by students whereas K-means centroid value is a round off. The K-means clustering is also affected by the presence of outliers in the student dataset.