Spatial-aware data partition for distributed memory parallelization of ANN search in multimedia retrieval

IF 2.1 4区计算机科学 Q2 COMPUTER SCIENCE, THEORY & METHODS

Parallel Computing Pub Date : 2023-02-01 DOI:10.1016/j.parco.2022.102992

Guilherme Andrade, Renato Ferreira, George Teodoro

{"title":"Spatial-aware data partition for distributed memory parallelization of ANN search in multimedia retrieval","authors":"Guilherme Andrade, Renato Ferreira, George Teodoro","doi":"10.1016/j.parco.2022.102992","DOIUrl":null,"url":null,"abstract":"<div><p>Content-based multimedia retrieval (CBMR) applications are becoming very popular in several online services which handles large volumes of data and are submitted to high query rates. While these applications may be complex, finding the nearest neighboring objects (multimedia descriptors) is typically their most time consuming operation. In order to address this problem, several recent works have proposed distributed memory parallelization of approximate nearest neighbors (ANN) search. These solutions employ a variety of ANN algorithms and different parallelization strategies. In this paper, we have identified the currently used parallelization strategies (Data Equal Split (DES) and Bucket Equal Split (BES)) and systematically evaluated their performance. We have also developed a framework to simplify the deployment of ANN algorithms in distributed memory machines with customized parallelization or data partition strategies. We further proposed a novel class of data partition/parallelization strategies that takes into account the data spatial proximity. Our approaches (SABES and SABES++) improves data locality and the system efficiency as compared to DES and BES. For instance, SABES++ achieved speedups of 4.2<span><math><mo>×</mo></math></span> and 1.8<span><math><mo>×</mo></math></span> on top of DES and BES, respectively, in the baseline case (40 nodes). Further, SABES and SABES++ also attained higher multi-node scalability and the gains vs DES and BES increase a larger number of nodes. SABES++ is 14.5<span><math><mo>×</mo></math></span> faster than DES when 160 nodes are used.</p></div>","PeriodicalId":54642,"journal":{"name":"Parallel Computing","volume":"115 ","pages":"Article 102992"},"PeriodicalIF":2.1000,"publicationDate":"2023-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Parallel Computing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167819122000813","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}

引用次数: 3

Abstract

Content-based multimedia retrieval (CBMR) applications are becoming very popular in several online services which handles large volumes of data and are submitted to high query rates. While these applications may be complex, finding the nearest neighboring objects (multimedia descriptors) is typically their most time consuming operation. In order to address this problem, several recent works have proposed distributed memory parallelization of approximate nearest neighbors (ANN) search. These solutions employ a variety of ANN algorithms and different parallelization strategies. In this paper, we have identified the currently used parallelization strategies (Data Equal Split (DES) and Bucket Equal Split (BES)) and systematically evaluated their performance. We have also developed a framework to simplify the deployment of ANN algorithms in distributed memory machines with customized parallelization or data partition strategies. We further proposed a novel class of data partition/parallelization strategies that takes into account the data spatial proximity. Our approaches (SABES and SABES++) improves data locality and the system efficiency as compared to DES and BES. For instance, SABES++ achieved speedups of 4.2 $\times$ and 1.8 $\times$ on top of DES and BES, respectively, in the baseline case (40 nodes). Further, SABES and SABES++ also attained higher multi-node scalability and the gains vs DES and BES increase a larger number of nodes. SABES++ is 14.5 $\times$ faster than DES when 160 nodes are used.

查看原文本刊更多论文

多媒体检索中神经网络搜索分布式内存并行化的空间感知数据分区

基于内容的多媒体检索（CBMR）应用程序在处理大量数据并提交高查询率的几种在线服务中变得非常流行。虽然这些应用程序可能很复杂，但查找最近的相邻对象（多媒体描述符）通常是它们最耗时的操作。为了解决这个问题，最近的几项工作提出了近似最近邻（ANN）搜索的分布式存储器并行化。这些解决方案采用了各种ANN算法和不同的并行化策略。在本文中，我们确定了目前使用的并行化策略（数据相等分割（DES）和桶相等分割（BES）），并系统地评估了它们的性能。我们还开发了一个框架，通过定制的并行化或数据分割策略来简化分布式存储机中ANN算法的部署。我们进一步提出了一类新的数据划分/并行化策略，该策略考虑了数据的空间邻近性。与DES和BES相比，我们的方法（SABES和SABES++）提高了数据的局部性和系统效率。例如，在基线情况下（40个节点），SABES++在DES和BES之上分别实现了4.2倍和1.8倍的加速。此外，SABES和SABES++还获得了更高的多节点可扩展性，并且与DES和BES相比的增益增加了更多的节点数量。当使用160个节点时，SABES++比DES快14.5倍。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Parallel Computing 工程技术-计算机：理论方法

CiteScore

3.50

自引率

7.10%

发文量

审稿时长

4.5 months

期刊介绍： Parallel Computing is an international journal presenting the practical use of parallel computer systems, including high performance architecture, system software, programming systems and tools, and applications. Within this context the journal covers all aspects of high-end parallel computing from single homogeneous or heterogenous computing nodes to large-scale multi-node systems. Parallel Computing features original research work and review articles as well as novel or illustrative accounts of application experience with (and techniques for) the use of parallel computers. We also welcome studies reproducing prior publications that either confirm or disprove prior published results. Particular technical areas of interest include, but are not limited to: -System software for parallel computer systems including programming languages (new languages as well as compilation techniques), operating systems (including middleware), and resource management (scheduling and load-balancing). -Enabling software including debuggers, performance tools, and system and numeric libraries. -General hardware (architecture) concepts, new technologies enabling the realization of such new concepts, and details of commercially available systems -Software engineering and productivity as it relates to parallel computing -Applications (including scientific computing, deep learning, machine learning) or tool case studies demonstrating novel ways to achieve parallelism -Performance measurement results on state-of-the-art systems -Approaches to effectively utilize large-scale parallel computing including new algorithms or algorithm analysis with demonstrated relevance to real applications using existing or next generation parallel computer architectures. -Parallel I/O systems both hardware and software -Networking technology for support of high-speed computing demonstrating the impact of high-speed computation on parallel applications