NV-Tree: nearest neighbors at the billion scale

Proceedings of the 1st ACM International Conference on Multimedia Retrieval Pub Date : 2011-04-18 DOI:10.1145/1991996.1992050

Herwig Lejsek, B. Jónsson, L. Amsaleg

引用次数: 47

Abstract

This paper presents the NV-Tree (Nearest Vector Tree). It addresses the specific, yet important, problem of efficiently and effectively finding the approximate k-nearest neighbors within a collection of a few billion high-dimensional data points. The NV-Tree is a very compact index, as only six bytes are kept in the index for each high-dimensional descriptor. It thus scales extremely well when indexing large collections of high-dimensional descriptors. The NV-Tree efficiently produces results of good quality, even at such a large scale that the indices cannot be kept entirely in main memory any more. We demonstrate this with extensive experiments using a collection of 2.5 billion SIFT (Scale Invariant Feature Transform) descriptors.

查看原文本刊更多论文

nv树:十亿尺度上的近邻

本文提出了最近向量树(NV-Tree)。它解决了在数十亿个高维数据点的集合中有效地找到近似k个最近的邻居的具体但重要的问题。NV-Tree是一个非常紧凑的索引，因为每个高维描述符的索引中只保留了6个字节。因此，在索引大型高维描述符集合时，它的伸缩性非常好。NV-Tree有效地产生高质量的结果，即使在如此大的规模下，索引不能完全保存在主存中。我们通过使用25亿个SIFT(尺度不变特征变换)描述符的集合进行了广泛的实验来证明这一点。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 1st ACM International Conference on Multimedia Retrieval

自引率

0.00%

发文量