Processing All k-Nearest Neighbor Query on Large Multidimensional Data

Huu Vu Lam Cao, T. Phan, Q. Minh, Thanh Luan Hong, M. S. Q. Truong
{"title":"Processing All k-Nearest Neighbor Query on Large Multidimensional Data","authors":"Huu Vu Lam Cao, T. Phan, Q. Minh, Thanh Luan Hong, M. S. Q. Truong","doi":"10.1109/ACOMP.2016.012","DOIUrl":null,"url":null,"abstract":"All k nearest neighbor (AkNN) query processing is a data processing problem which is important in many fields such as computer architecture, searching user information by coordinates, and city planning. Nowadays amount of data tends to grow in size and becomes huge. It is a major challenge that we need to face. Therefore, many traditional methods are no longer effective when dealing with the problem. Meanwhile, the method that processes distributed and parallel AkNN problem on MapReduce model and uses equal-cell-dividing technique is effective on multidimensional large dataset. However, when data is not equally distributed, the method becomes inefficient and even cannot be implemented. In this paper, we improve this method by applying a new cell-dividing technique. Instead of dividing the target space into cells which have the same size, we aim to divide it to cells in which the number of points are balanced, and there is not a cell that contains a large number of points. We also conduct experiments and compare the results produced by the old method and our method. Experimental results show that our method is more efficient and more stable.","PeriodicalId":133451,"journal":{"name":"2016 International Conference on Advanced Computing and Applications (ACOMP)","volume":"202 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 International Conference on Advanced Computing and Applications (ACOMP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ACOMP.2016.012","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

All k nearest neighbor (AkNN) query processing is a data processing problem which is important in many fields such as computer architecture, searching user information by coordinates, and city planning. Nowadays amount of data tends to grow in size and becomes huge. It is a major challenge that we need to face. Therefore, many traditional methods are no longer effective when dealing with the problem. Meanwhile, the method that processes distributed and parallel AkNN problem on MapReduce model and uses equal-cell-dividing technique is effective on multidimensional large dataset. However, when data is not equally distributed, the method becomes inefficient and even cannot be implemented. In this paper, we improve this method by applying a new cell-dividing technique. Instead of dividing the target space into cells which have the same size, we aim to divide it to cells in which the number of points are balanced, and there is not a cell that contains a large number of points. We also conduct experiments and compare the results produced by the old method and our method. Experimental results show that our method is more efficient and more stable.
大型多维数据的全k近邻查询处理
全k近邻查询处理(AkNN)是一种数据处理问题,在计算机体系结构、根据坐标搜索用户信息、城市规划等领域具有重要意义。如今,数据量趋于增长,变得巨大。这是我们需要面对的重大挑战。因此,许多传统的方法在处理这个问题时不再有效。同时,在MapReduce模型上处理分布式并行AkNN问题并使用等单元划分技术的方法在多维大数据集上是有效的。然而,当数据分布不均匀时,该方法效率低下,甚至无法实现。在本文中,我们采用一种新的细胞分裂技术来改进这种方法。我们的目标不是将目标空间划分为大小相同的单元,而是将其划分为点数平衡的单元,并且不存在包含大量点数的单元。我们还进行了实验,比较了旧方法和我们的方法得到的结果。实验结果表明,该方法具有更高的效率和稳定性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信