基于k-均值的改进科技文献FAKM聚类方法

Baosheng Yin, Meishu Zhao
{"title":"基于k-均值的改进科技文献FAKM聚类方法","authors":"Baosheng Yin, Meishu Zhao","doi":"10.1117/12.2692027","DOIUrl":null,"url":null,"abstract":"Research on rapid clustering technology based on bibliographic information of scientific and technical literature aims to efficiently realize the correlation analysis of scientific and technical literature, laying the foundation for discovering hot spots and trends in the research field, conducting interdisciplinary and cross-border research, and accurately recommending scientific and technical literature. Focusing on the analysis of clustering algorithms, we proposed an improved k-meansbased Firefly Algorithm k-means (FAKM) clustering method, which effectively solved the problem of randomly selecting the initial center points of class cluster when using k-means algorithm for clustering in the clustering stage, which leads to local optimum, low accuracy and large gap between the division of class clusters and the real situation of clustering results. The use of FAKM clustering algorithm resulted in better clustering performance, high accuracy, and fewer iterations. The experimental results showed that the method achieved a silhouette coefficient of 0.54 and adjust mutual information of 0.69 on the same scientific and technical literature data set, which proved the good performance of the method.","PeriodicalId":361127,"journal":{"name":"International Conference on Images, Signals, and Computing","volume":"42 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Improved k-means-based FAKM clustering method for scientific and technical literature\",\"authors\":\"Baosheng Yin, Meishu Zhao\",\"doi\":\"10.1117/12.2692027\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Research on rapid clustering technology based on bibliographic information of scientific and technical literature aims to efficiently realize the correlation analysis of scientific and technical literature, laying the foundation for discovering hot spots and trends in the research field, conducting interdisciplinary and cross-border research, and accurately recommending scientific and technical literature. Focusing on the analysis of clustering algorithms, we proposed an improved k-meansbased Firefly Algorithm k-means (FAKM) clustering method, which effectively solved the problem of randomly selecting the initial center points of class cluster when using k-means algorithm for clustering in the clustering stage, which leads to local optimum, low accuracy and large gap between the division of class clusters and the real situation of clustering results. The use of FAKM clustering algorithm resulted in better clustering performance, high accuracy, and fewer iterations. The experimental results showed that the method achieved a silhouette coefficient of 0.54 and adjust mutual information of 0.69 on the same scientific and technical literature data set, which proved the good performance of the method.\",\"PeriodicalId\":361127,\"journal\":{\"name\":\"International Conference on Images, Signals, and Computing\",\"volume\":\"42 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-08-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Conference on Images, Signals, and Computing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1117/12.2692027\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Images, Signals, and Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.2692027","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

基于科技文献书目信息的快速聚类技术研究旨在高效实现科技文献的相关性分析,为发现研究领域的热点和趋势、开展跨学科和跨界研究、准确推荐科技文献奠定基础。在对聚类算法进行分析的基础上,提出了一种改进的基于k-means的萤火虫算法k-means (FAKM)聚类方法,有效解决了在聚类阶段使用k-means算法进行聚类时随机选取类簇初始中心点的问题,导致类簇的划分存在局部最优、准确率低、与实际聚类结果差距大的问题。采用FAKM聚类算法,聚类性能更好,准确率高,迭代次数少。实验结果表明,该方法在同一科技文献数据集上的剪影系数为0.54,调整互信息为0.69,证明了该方法的良好性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Improved k-means-based FAKM clustering method for scientific and technical literature
Research on rapid clustering technology based on bibliographic information of scientific and technical literature aims to efficiently realize the correlation analysis of scientific and technical literature, laying the foundation for discovering hot spots and trends in the research field, conducting interdisciplinary and cross-border research, and accurately recommending scientific and technical literature. Focusing on the analysis of clustering algorithms, we proposed an improved k-meansbased Firefly Algorithm k-means (FAKM) clustering method, which effectively solved the problem of randomly selecting the initial center points of class cluster when using k-means algorithm for clustering in the clustering stage, which leads to local optimum, low accuracy and large gap between the division of class clusters and the real situation of clustering results. The use of FAKM clustering algorithm resulted in better clustering performance, high accuracy, and fewer iterations. The experimental results showed that the method achieved a silhouette coefficient of 0.54 and adjust mutual information of 0.69 on the same scientific and technical literature data set, which proved the good performance of the method.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信