使用k -均值和k -近邻计算组新闻文档

Proceedings of the 1st International Conference on Science and Technology, ICOST 2019, 2-3 May, Makassar, Indonesia Pub Date : 2019-06-14 DOI:10.4108/EAI.2-5-2019.2284616

Sitti Arni, Syaharulla Disa

{"title":"使用k -均值和k -近邻计算组新闻文档","authors":"Sitti Arni, Syaharulla Disa","doi":"10.4108/EAI.2-5-2019.2284616","DOIUrl":null,"url":null,"abstract":"The aim of the study was to reduce the computational time of the document search by using the K-Means and K-Nearest Neighbor algorithm. The search result through search engines with display the documents according to the keywords entered. The number of documents that were displayed will make the user difficult to fine the required documents and it a long time for the display process of the documents. K-Means algorithm is used for the clustering documents obtained through online with search engine whereas algorithm K-Nearst Neighbor is used for grouping the clustering result document that are done with offline. The clustering with K-Means can reduce computational time on the news grouping by using K-Nearst Neighbor. The combination of two methods result an average time of 0.5011 seconds. Whereas the grouping that uses pure K-Nearst Neighbor requires the computational time 2.4841 seconds. The test result indicated that the combination of the two algorithms resulted accurate classification with the faster computation time.","PeriodicalId":355290,"journal":{"name":"Proceedings of the 1st International Conference on Science and Technology, ICOST 2019, 2-3 May, Makassar, Indonesia","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-06-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Computing Group News Documents Using K-Means and K-Nearest Neighbor\",\"authors\":\"Sitti Arni, Syaharulla Disa\",\"doi\":\"10.4108/EAI.2-5-2019.2284616\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The aim of the study was to reduce the computational time of the document search by using the K-Means and K-Nearest Neighbor algorithm. The search result through search engines with display the documents according to the keywords entered. The number of documents that were displayed will make the user difficult to fine the required documents and it a long time for the display process of the documents. K-Means algorithm is used for the clustering documents obtained through online with search engine whereas algorithm K-Nearst Neighbor is used for grouping the clustering result document that are done with offline. The clustering with K-Means can reduce computational time on the news grouping by using K-Nearst Neighbor. The combination of two methods result an average time of 0.5011 seconds. Whereas the grouping that uses pure K-Nearst Neighbor requires the computational time 2.4841 seconds. The test result indicated that the combination of the two algorithms resulted accurate classification with the faster computation time.\",\"PeriodicalId\":355290,\"journal\":{\"name\":\"Proceedings of the 1st International Conference on Science and Technology, ICOST 2019, 2-3 May, Makassar, Indonesia\",\"volume\":\"21 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-06-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 1st International Conference on Science and Technology, ICOST 2019, 2-3 May, Makassar, Indonesia\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.4108/EAI.2-5-2019.2284616\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 1st International Conference on Science and Technology, ICOST 2019, 2-3 May, Makassar, Indonesia","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4108/EAI.2-5-2019.2284616","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

研究的目的是利用k -均值和k -最近邻算法来减少文档搜索的计算时间。通过搜索引擎的搜索结果会根据输入的关键字显示文档。显示的文档数量多，用户难以找到所需的文档，并且文档的显示过程耗时长。K-Means算法用于搜索引擎在线获得的聚类文档，k - nearest Neighbor算法用于离线完成的聚类结果文档的分组。基于K-Means的聚类可以利用k -近邻来减少新闻分组的计算时间。两种方法的组合得到的平均时间为0.5011秒。而使用纯k近邻的分组则需要2.4841秒的计算时间。实验结果表明，两种算法结合使用，分类准确，计算速度快。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Computing Group News Documents Using K-Means and K-Nearest Neighbor

The aim of the study was to reduce the computational time of the document search by using the K-Means and K-Nearest Neighbor algorithm. The search result through search engines with display the documents according to the keywords entered. The number of documents that were displayed will make the user difficult to fine the required documents and it a long time for the display process of the documents. K-Means algorithm is used for the clustering documents obtained through online with search engine whereas algorithm K-Nearst Neighbor is used for grouping the clustering result document that are done with offline. The clustering with K-Means can reduce computational time on the news grouping by using K-Nearst Neighbor. The combination of two methods result an average time of 0.5011 seconds. Whereas the grouping that uses pure K-Nearst Neighbor requires the computational time 2.4841 seconds. The test result indicated that the combination of the two algorithms resulted accurate classification with the faster computation time.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the 1st International Conference on Science and Technology, ICOST 2019, 2-3 May, Makassar, Indonesia

自引率

0.00%

发文量