{"title":"An Optimized K-Means Algorithm Based on FSTVM","authors":"Yanqiu Chen, Peili Sun","doi":"10.1109/ICVRIS.2018.00095","DOIUrl":null,"url":null,"abstract":"Aiming at the text document similarity and initial center point problems of K-means algorithm, a new model is proposed, in which used a new method of Frequency-Sorted Term Vector Model(FSTVM) to represent a document, and for reducing the dimension of a document designed an automatic method to filter commonly used words, and redesigned the similarity calculation formula and optimization algorithm for the initial center. Compared with the traditional algorithms, the new proposed algorithm can get initial centers with higher quality and steadier cluster results, Experimental results prove this clustering algorithm is simple and effective.","PeriodicalId":152317,"journal":{"name":"2018 International Conference on Virtual Reality and Intelligent Systems (ICVRIS)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 International Conference on Virtual Reality and Intelligent Systems (ICVRIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICVRIS.2018.00095","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Aiming at the text document similarity and initial center point problems of K-means algorithm, a new model is proposed, in which used a new method of Frequency-Sorted Term Vector Model(FSTVM) to represent a document, and for reducing the dimension of a document designed an automatic method to filter commonly used words, and redesigned the similarity calculation formula and optimization algorithm for the initial center. Compared with the traditional algorithms, the new proposed algorithm can get initial centers with higher quality and steadier cluster results, Experimental results prove this clustering algorithm is simple and effective.