{"title":"Hybrid algorithms for list ranking and graph connected components","authors":"D. Banerjee, Kishore Kothapalli","doi":"10.1109/HiPC.2011.6152655","DOIUrl":null,"url":null,"abstract":"The advent of multicore and many-core architectures saw them being deployed to speed-up computations across several disciplines and application areas. Prominent examples include semi-numerical algorithms such as sorting, graph algorithms, image processing, scientific computations, and the like. In particular, using GPUs for general purpose computations has attracted a lot of attention given that GPUs can deliver more than one TFLOP of computing power at very low prices. In this work, we use a new model of multicore computing called hybrid multicore computing where the computation is performed simultaneously a control device, such as a CPU, and an accelerator such as a GPU. To this end, we use two case studies to explore the algorithmic and analytical issues in hybrid multicore computing. Our case studies involve two different ways of designing hybrid multicore algorithms. The main contribution of this paper is to address the issues related to the design of hybrid solutions. We show our hybrid algorithm for list ranking is faster by 50% compared to the best known implementation [Z. Wei, J. JaJa; IPDPS 2010]. Similarly, our hybrid algorithm for graph connected components is faster by 25% compared to the best known GPU implementation [26].","PeriodicalId":122468,"journal":{"name":"2011 18th International Conference on High Performance Computing","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"22","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 18th International Conference on High Performance Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HiPC.2011.6152655","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 22
Abstract
The advent of multicore and many-core architectures saw them being deployed to speed-up computations across several disciplines and application areas. Prominent examples include semi-numerical algorithms such as sorting, graph algorithms, image processing, scientific computations, and the like. In particular, using GPUs for general purpose computations has attracted a lot of attention given that GPUs can deliver more than one TFLOP of computing power at very low prices. In this work, we use a new model of multicore computing called hybrid multicore computing where the computation is performed simultaneously a control device, such as a CPU, and an accelerator such as a GPU. To this end, we use two case studies to explore the algorithmic and analytical issues in hybrid multicore computing. Our case studies involve two different ways of designing hybrid multicore algorithms. The main contribution of this paper is to address the issues related to the design of hybrid solutions. We show our hybrid algorithm for list ranking is faster by 50% compared to the best known implementation [Z. Wei, J. JaJa; IPDPS 2010]. Similarly, our hybrid algorithm for graph connected components is faster by 25% compared to the best known GPU implementation [26].
随着多核和多核架构的出现,它们被部署到多个学科和应用领域来加速计算。突出的例子包括半数值算法,如排序、图算法、图像处理、科学计算等。特别是,使用gpu进行通用计算已经引起了很多关注,因为gpu可以以非常低的价格提供超过1 TFLOP的计算能力。在这项工作中,我们使用了一种新的多核计算模型,称为混合多核计算,其中计算同时由控制设备(如CPU)和加速器(如GPU)执行。为此,我们使用两个案例研究来探讨混合多核计算中的算法和分析问题。我们的案例研究涉及设计混合多核算法的两种不同方式。本文的主要贡献是解决与混合解决方案设计相关的问题。我们展示了我们的混合列表排序算法比最著名的实现[Z]快50%。Wei, J. JaJa;IPDPS 2010]。同样,我们的图连接组件混合算法比最著名的GPU实现[26]快25%。