RobustiQ: A Robust ANN Search Method for Billion-scale Similarity Search on GPUs

Proceedings of the 2019 on International Conference on Multimedia Retrieval Pub Date : 2019-06-05 DOI:10.1145/3323873.3325018

Wei Chen, Jincai Chen, F. Zou, Yuan-Fang Li, Ping Lu, Wei Zhao

{"title":"RobustiQ: A Robust ANN Search Method for Billion-scale Similarity Search on GPUs","authors":"Wei Chen, Jincai Chen, F. Zou, Yuan-Fang Li, Ping Lu, Wei Zhao","doi":"10.1145/3323873.3325018","DOIUrl":null,"url":null,"abstract":"GPU-based methods represent state-of-the-art in approximate nearest neighbor (ANN) search, as they are scalable (billion-scale), accurate (high recall) as well as efficient (sub-millisecond query speed). Faiss, the representative GPU-based ANN system, achieves considerably faster query speed than the representative CPU-based systems. The query accuracy of Faiss critically depends on the number of indexing regions, which in turn is dependent on the amount of available memory. At the same time, query speed deteriorates dramatically with the increase in the number of partition regions. Thus, it can be observed that Faiss suffers from a lack of robustness, that the fine-grained partitioning of datasets is achieved at the expense of search speed, and vice versa. In this paper, we introduce a new GPU-based ANN search method, Robust Quantization (RobustiQ), that addresses the robustness limitations of existing GPU-based methods in a holistic way. We design a novel hierarchical indexing structure using vector and bilayer line quantization. This indexing structure, together with our indexing and encoding methods, allows RobustiQ to avoid the need for maintaining a large lookup table, hence reduces not only memory consumption but also query complexity. Our extensive evaluation on two public billion-scale benchmark datasets, SIFT1B and DEEP1B, shows that RobustiQ consistently obtains 2-3 × speedup over Faiss while achieving better query accuracy for different codebook sizes. Compared to the best CPU-based ANN systems, RobustiQ achieves even more pronounced average speedups of 51.8 × and 11 × respectively.","PeriodicalId":149041,"journal":{"name":"Proceedings of the 2019 on International Conference on Multimedia Retrieval","volume":"40 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2019 on International Conference on Multimedia Retrieval","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3323873.3325018","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 11

Abstract

GPU-based methods represent state-of-the-art in approximate nearest neighbor (ANN) search, as they are scalable (billion-scale), accurate (high recall) as well as efficient (sub-millisecond query speed). Faiss, the representative GPU-based ANN system, achieves considerably faster query speed than the representative CPU-based systems. The query accuracy of Faiss critically depends on the number of indexing regions, which in turn is dependent on the amount of available memory. At the same time, query speed deteriorates dramatically with the increase in the number of partition regions. Thus, it can be observed that Faiss suffers from a lack of robustness, that the fine-grained partitioning of datasets is achieved at the expense of search speed, and vice versa. In this paper, we introduce a new GPU-based ANN search method, Robust Quantization (RobustiQ), that addresses the robustness limitations of existing GPU-based methods in a holistic way. We design a novel hierarchical indexing structure using vector and bilayer line quantization. This indexing structure, together with our indexing and encoding methods, allows RobustiQ to avoid the need for maintaining a large lookup table, hence reduces not only memory consumption but also query complexity. Our extensive evaluation on two public billion-scale benchmark datasets, SIFT1B and DEEP1B, shows that RobustiQ consistently obtains 2-3 × speedup over Faiss while achieving better query accuracy for different codebook sizes. Compared to the best CPU-based ANN systems, RobustiQ achieves even more pronounced average speedups of 51.8 × and 11 × respectively.

查看原文本刊更多论文

鲁棒性神经网络搜索方法在gpu上的十亿级相似度搜索

基于gpu的方法代表了最先进的近似最近邻(ANN)搜索，因为它们具有可扩展性(十亿规模)、准确性(高召回率)和效率(亚毫秒级查询速度)。Faiss是具有代表性的基于gpu的人工神经网络系统，其查询速度明显快于具有代表性的基于cpu的系统。Faiss的查询准确性主要取决于索引区域的数量，而索引区域的数量又取决于可用内存的数量。同时，查询速度随着分区数量的增加而急剧下降。因此，可以观察到Faiss缺乏鲁棒性，数据集的细粒度分区是以牺牲搜索速度为代价实现的，反之亦然。在本文中，我们引入了一种新的基于gpu的人工神经网络搜索方法——鲁棒量化(Robust Quantization, RobustiQ)，它从整体上解决了现有基于gpu的方法的鲁棒性限制。利用向量和双层线量化设计了一种新的分层索引结构。这种索引结构，加上我们的索引和编码方法，使RobustiQ避免了维护大型查找表的需要，因此不仅减少了内存消耗，还降低了查询复杂性。我们对两个公开的十亿规模基准数据集SIFT1B和DEEP1B进行了广泛的评估，结果表明，在不同码本大小的情况下，roubustiq始终比Faiss获得2-3倍的加速，同时获得更好的查询精度。与最好的基于cpu的人工神经网络系统相比，roubustiq的平均速度分别达到了51.8倍和11倍。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 2019 on International Conference on Multimedia Retrieval

自引率

0.00%

发文量