{"title":"An efficient multiprocessor merge algorithm","authors":"P. Varman, B. Iyer, D. Haderle","doi":"10.1109/PARBSE.1990.77152","DOIUrl":null,"url":null,"abstract":"An efficient algorithm for merging two memory-resident sorted lists is described. The algorithm is based on a novel low-cost partitioning algorithm that is used to split the two lists among an arbitrary number of processors in a way that ensures load balance during the merge. The algorithm has direct applications in memory-resident databases, as well as for handling record pointers in disk-resident databases. It may be used for parallel sorting, table access using multiple indexes, and parallel sort-merge joins. A feature of the partitioning algorithm is that it may itself be parallelized efficiently; the parallel implementation reduces partitioning time, which may become significant if the number of processors gets large. If p is the number of processors and N is the total number of elements in both runs combined, the serial and parallel versions of the partitioning algorithm require time O(p log (N/p)), and O(log p+log(N/p)), respectively.<<ETX>>","PeriodicalId":389644,"journal":{"name":"Proceedings. PARBASE-90: International Conference on Databases, Parallel Architectures, and Their Applications","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1990-03-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings. PARBASE-90: International Conference on Databases, Parallel Architectures, and Their Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PARBSE.1990.77152","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
An efficient algorithm for merging two memory-resident sorted lists is described. The algorithm is based on a novel low-cost partitioning algorithm that is used to split the two lists among an arbitrary number of processors in a way that ensures load balance during the merge. The algorithm has direct applications in memory-resident databases, as well as for handling record pointers in disk-resident databases. It may be used for parallel sorting, table access using multiple indexes, and parallel sort-merge joins. A feature of the partitioning algorithm is that it may itself be parallelized efficiently; the parallel implementation reduces partitioning time, which may become significant if the number of processors gets large. If p is the number of processors and N is the total number of elements in both runs combined, the serial and parallel versions of the partitioning algorithm require time O(p log (N/p)), and O(log p+log(N/p)), respectively.<>