L. Zahrotun, Nila Hutami Putri, Arfiani Nur Khusna
{"title":"The Implementation of K-Means Clustering Method in Classifying Undergraduate Thesis Titles","authors":"L. Zahrotun, Nila Hutami Putri, Arfiani Nur Khusna","doi":"10.1109/TSSA.2018.8708817","DOIUrl":null,"url":null,"abstract":"One of graduation requirements at university is completing undergraduate thesis. At Industrial Engineering Universitas Ahmad Dahlan, undergraduate thesis titles are documented by thesis coordinator. The problem is that students are less knowledgeable on thesis topics, so they do not really know the previous students’ thesis topics. Based on the problem, this research aims at developing a program to classify thesis title so the knowledge on the trend of thesis title topic can be got The method used in this research was K-Means clustering, while range measurement method used was cosine similarity. The testing used Silhouette Coefficient method. The phases from text mining were tokenizing, filtering, stemming, similarity, classifying, testing. The result of this research is a program that can process the title data into trend group pattern of thesis title topic.From 138 data obtained, there are three clusters arranged based on the field on Industrial Engineering study program. Silhouette Coefficient testing shows score of 0.5674 that shows the clustering result is classified low. It occurs since the textual data of the thesis title is too widely distributed, so the title has relatively low similarity score","PeriodicalId":159795,"journal":{"name":"2018 12th International Conference on Telecommunication Systems, Services, and Applications (TSSA)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 12th International Conference on Telecommunication Systems, Services, and Applications (TSSA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TSSA.2018.8708817","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
One of graduation requirements at university is completing undergraduate thesis. At Industrial Engineering Universitas Ahmad Dahlan, undergraduate thesis titles are documented by thesis coordinator. The problem is that students are less knowledgeable on thesis topics, so they do not really know the previous students’ thesis topics. Based on the problem, this research aims at developing a program to classify thesis title so the knowledge on the trend of thesis title topic can be got The method used in this research was K-Means clustering, while range measurement method used was cosine similarity. The testing used Silhouette Coefficient method. The phases from text mining were tokenizing, filtering, stemming, similarity, classifying, testing. The result of this research is a program that can process the title data into trend group pattern of thesis title topic.From 138 data obtained, there are three clusters arranged based on the field on Industrial Engineering study program. Silhouette Coefficient testing shows score of 0.5674 that shows the clustering result is classified low. It occurs since the textual data of the thesis title is too widely distributed, so the title has relatively low similarity score