A Parallel Rectangle Intersection Algorithm on GPU+CPU

2011 11th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing Pub Date : 2011-05-23 DOI:10.1109/CCGrid.2011.13

Shih-Hsiang Lo, Che-Rung Lee, Yeh-Ching Chung, I. Chung

{"title":"A Parallel Rectangle Intersection Algorithm on GPU+CPU","authors":"Shih-Hsiang Lo, Che-Rung Lee, Yeh-Ching Chung, I. Chung","doi":"10.1109/CCGrid.2011.13","DOIUrl":null,"url":null,"abstract":"In this paper, we investigate efficient algorithms and implementations using GPU plus CPU to solve the rectangle intersection problem on a plane. The problem is to report all intersecting pairs of iso-oriented rectangles, whose parallelization on GPUs poses two major computational challenges: data partition and the massive output. The algorithm we presented is called PRI-GC, Parallel Rectangle Intersection algorithm on GPU+CPU, which consists of two phases: mapping and intersection-checking. In the mapping phase, rectangles are hashed into different subspaces (called cells) to reduce the unnecessary intersection checking for far-apart rectangles. In the intersection-checking phase, pairs of rectangles within the same cell are examined in parallel, and the intersecting pairs of rectangles are reported. Several optimization techniques, including rectangles re-ordering, output data compressing/encoding, and the execution overlapping of GPU and CPU, are applied to enhance the performance. We had evaluated the performance of PRI-GC and the result shows over 30x speedup against two well-implemented sequential algorithms on single CPU. The effectiveness of each optimization technique for this problem was evaluated as well. Several parameters, including different degrees of rectangle coverage, different block sizes, and different cell sizes, were also experimented to explore their influences on the performance of PRI-GC.","PeriodicalId":376385,"journal":{"name":"2011 11th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing","volume":"56 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 11th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCGrid.2011.13","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 10

Abstract

In this paper, we investigate efficient algorithms and implementations using GPU plus CPU to solve the rectangle intersection problem on a plane. The problem is to report all intersecting pairs of iso-oriented rectangles, whose parallelization on GPUs poses two major computational challenges: data partition and the massive output. The algorithm we presented is called PRI-GC, Parallel Rectangle Intersection algorithm on GPU+CPU, which consists of two phases: mapping and intersection-checking. In the mapping phase, rectangles are hashed into different subspaces (called cells) to reduce the unnecessary intersection checking for far-apart rectangles. In the intersection-checking phase, pairs of rectangles within the same cell are examined in parallel, and the intersecting pairs of rectangles are reported. Several optimization techniques, including rectangles re-ordering, output data compressing/encoding, and the execution overlapping of GPU and CPU, are applied to enhance the performance. We had evaluated the performance of PRI-GC and the result shows over 30x speedup against two well-implemented sequential algorithms on single CPU. The effectiveness of each optimization technique for this problem was evaluated as well. Several parameters, including different degrees of rectangle coverage, different block sizes, and different cell sizes, were also experimented to explore their influences on the performance of PRI-GC.

查看原文本刊更多论文

基于GPU+CPU的并行矩形交点算法

在本文中，我们研究了使用GPU + CPU来解决平面上矩形相交问题的有效算法和实现。问题是报告所有相交的等向矩形对，其在gpu上的并行化提出了两个主要的计算挑战:数据分区和大量输出。本文提出的算法被称为PRI-GC，即GPU+CPU上的并行矩形相交算法，该算法包括两个阶段:映射和相交检查。在映射阶段，将矩形散列到不同的子空间(称为单元格)中，以减少对相距很远的矩形进行不必要的交叉检查。在相交检查阶段，对同一单元内的矩形对进行并行检查，并报告相交的矩形对。采用矩形重排序、输出数据压缩/编码、GPU和CPU执行重叠等优化技术来提高性能。我们已经对PRI-GC的性能进行了评估，结果显示，在单个CPU上，与两种实现良好的顺序算法相比，它的速度提高了30倍以上。对该问题的各种优化技术的有效性进行了评估。实验还探讨了不同矩形覆盖度、不同块大小和不同单元大小等参数对PRI-GC性能的影响。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2011 11th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing

自引率

0.00%

发文量