{"title":"基于k-均值的改进科技文献FAKM聚类方法","authors":"Baosheng Yin, Meishu Zhao","doi":"10.1117/12.2692027","DOIUrl":null,"url":null,"abstract":"Research on rapid clustering technology based on bibliographic information of scientific and technical literature aims to efficiently realize the correlation analysis of scientific and technical literature, laying the foundation for discovering hot spots and trends in the research field, conducting interdisciplinary and cross-border research, and accurately recommending scientific and technical literature. Focusing on the analysis of clustering algorithms, we proposed an improved k-meansbased Firefly Algorithm k-means (FAKM) clustering method, which effectively solved the problem of randomly selecting the initial center points of class cluster when using k-means algorithm for clustering in the clustering stage, which leads to local optimum, low accuracy and large gap between the division of class clusters and the real situation of clustering results. The use of FAKM clustering algorithm resulted in better clustering performance, high accuracy, and fewer iterations. The experimental results showed that the method achieved a silhouette coefficient of 0.54 and adjust mutual information of 0.69 on the same scientific and technical literature data set, which proved the good performance of the method.","PeriodicalId":361127,"journal":{"name":"International Conference on Images, Signals, and Computing","volume":"42 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Improved k-means-based FAKM clustering method for scientific and technical literature\",\"authors\":\"Baosheng Yin, Meishu Zhao\",\"doi\":\"10.1117/12.2692027\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Research on rapid clustering technology based on bibliographic information of scientific and technical literature aims to efficiently realize the correlation analysis of scientific and technical literature, laying the foundation for discovering hot spots and trends in the research field, conducting interdisciplinary and cross-border research, and accurately recommending scientific and technical literature. Focusing on the analysis of clustering algorithms, we proposed an improved k-meansbased Firefly Algorithm k-means (FAKM) clustering method, which effectively solved the problem of randomly selecting the initial center points of class cluster when using k-means algorithm for clustering in the clustering stage, which leads to local optimum, low accuracy and large gap between the division of class clusters and the real situation of clustering results. The use of FAKM clustering algorithm resulted in better clustering performance, high accuracy, and fewer iterations. The experimental results showed that the method achieved a silhouette coefficient of 0.54 and adjust mutual information of 0.69 on the same scientific and technical literature data set, which proved the good performance of the method.\",\"PeriodicalId\":361127,\"journal\":{\"name\":\"International Conference on Images, Signals, and Computing\",\"volume\":\"42 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-08-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Conference on Images, Signals, and Computing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1117/12.2692027\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Images, Signals, and Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.2692027","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Improved k-means-based FAKM clustering method for scientific and technical literature
Research on rapid clustering technology based on bibliographic information of scientific and technical literature aims to efficiently realize the correlation analysis of scientific and technical literature, laying the foundation for discovering hot spots and trends in the research field, conducting interdisciplinary and cross-border research, and accurately recommending scientific and technical literature. Focusing on the analysis of clustering algorithms, we proposed an improved k-meansbased Firefly Algorithm k-means (FAKM) clustering method, which effectively solved the problem of randomly selecting the initial center points of class cluster when using k-means algorithm for clustering in the clustering stage, which leads to local optimum, low accuracy and large gap between the division of class clusters and the real situation of clustering results. The use of FAKM clustering algorithm resulted in better clustering performance, high accuracy, and fewer iterations. The experimental results showed that the method achieved a silhouette coefficient of 0.54 and adjust mutual information of 0.69 on the same scientific and technical literature data set, which proved the good performance of the method.