使用k -均值和k -近邻计算组新闻文档

Sitti Arni, Syaharulla Disa
{"title":"使用k -均值和k -近邻计算组新闻文档","authors":"Sitti Arni, Syaharulla Disa","doi":"10.4108/EAI.2-5-2019.2284616","DOIUrl":null,"url":null,"abstract":"The aim of the study was to reduce the computational time of the document search by using the K-Means and K-Nearest Neighbor algorithm. The search result through search engines with display the documents according to the keywords entered. The number of documents that were displayed will make the user difficult to fine the required documents and it a long time for the display process of the documents. K-Means algorithm is used for the clustering documents obtained through online with search engine whereas algorithm K-Nearst Neighbor is used for grouping the clustering result document that are done with offline. The clustering with K-Means can reduce computational time on the news grouping by using K-Nearst Neighbor. The combination of two methods result an average time of 0.5011 seconds. Whereas the grouping that uses pure K-Nearst Neighbor requires the computational time 2.4841 seconds. The test result indicated that the combination of the two algorithms resulted accurate classification with the faster computation time.","PeriodicalId":355290,"journal":{"name":"Proceedings of the 1st International Conference on Science and Technology, ICOST 2019, 2-3 May, Makassar, Indonesia","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-06-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Computing Group News Documents Using K-Means and K-Nearest Neighbor\",\"authors\":\"Sitti Arni, Syaharulla Disa\",\"doi\":\"10.4108/EAI.2-5-2019.2284616\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The aim of the study was to reduce the computational time of the document search by using the K-Means and K-Nearest Neighbor algorithm. The search result through search engines with display the documents according to the keywords entered. The number of documents that were displayed will make the user difficult to fine the required documents and it a long time for the display process of the documents. K-Means algorithm is used for the clustering documents obtained through online with search engine whereas algorithm K-Nearst Neighbor is used for grouping the clustering result document that are done with offline. The clustering with K-Means can reduce computational time on the news grouping by using K-Nearst Neighbor. The combination of two methods result an average time of 0.5011 seconds. Whereas the grouping that uses pure K-Nearst Neighbor requires the computational time 2.4841 seconds. The test result indicated that the combination of the two algorithms resulted accurate classification with the faster computation time.\",\"PeriodicalId\":355290,\"journal\":{\"name\":\"Proceedings of the 1st International Conference on Science and Technology, ICOST 2019, 2-3 May, Makassar, Indonesia\",\"volume\":\"21 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-06-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 1st International Conference on Science and Technology, ICOST 2019, 2-3 May, Makassar, Indonesia\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.4108/EAI.2-5-2019.2284616\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 1st International Conference on Science and Technology, ICOST 2019, 2-3 May, Makassar, Indonesia","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4108/EAI.2-5-2019.2284616","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

研究的目的是利用k -均值和k -最近邻算法来减少文档搜索的计算时间。通过搜索引擎的搜索结果会根据输入的关键字显示文档。显示的文档数量多,用户难以找到所需的文档,并且文档的显示过程耗时长。K-Means算法用于搜索引擎在线获得的聚类文档,k - nearest Neighbor算法用于离线完成的聚类结果文档的分组。基于K-Means的聚类可以利用k -近邻来减少新闻分组的计算时间。两种方法的组合得到的平均时间为0.5011秒。而使用纯k近邻的分组则需要2.4841秒的计算时间。实验结果表明,两种算法结合使用,分类准确,计算速度快。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Computing Group News Documents Using K-Means and K-Nearest Neighbor
The aim of the study was to reduce the computational time of the document search by using the K-Means and K-Nearest Neighbor algorithm. The search result through search engines with display the documents according to the keywords entered. The number of documents that were displayed will make the user difficult to fine the required documents and it a long time for the display process of the documents. K-Means algorithm is used for the clustering documents obtained through online with search engine whereas algorithm K-Nearst Neighbor is used for grouping the clustering result document that are done with offline. The clustering with K-Means can reduce computational time on the news grouping by using K-Nearst Neighbor. The combination of two methods result an average time of 0.5011 seconds. Whereas the grouping that uses pure K-Nearst Neighbor requires the computational time 2.4841 seconds. The test result indicated that the combination of the two algorithms resulted accurate classification with the faster computation time.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信