Distanceless label propagation: An efficient direct connected component labeling algorithm for GPUs

2017 Seventh International Conference on Image Processing Theory, Tools and Applications (IPTA) Pub Date : 2017-11-28 DOI:10.1109/IPTA.2017.8310147

Laurent Cabaret, L. Lacassagne, D. Etiemble

{"title":"Distanceless label propagation: An efficient direct connected component labeling algorithm for GPUs","authors":"Laurent Cabaret, L. Lacassagne, D. Etiemble","doi":"10.1109/IPTA.2017.8310147","DOIUrl":null,"url":null,"abstract":"Modern computer architectures are mainly composed of multi-core processors and GPUs. Consequently, solely providing a sequential implementation of algorithms or comparing algorithm performance without regard to architecture is no longer pertinent. Today, algorithms have to address parallelism, multithreading and memory topology (private/shared memory, cache or scratchpad, …). Most Connected Component Labeling (CCL) algorithms are sequential, direct and optimized for processors. Few were designed specifically for GPU architectures and none were designed to be adapted to different architectures. The most efficient GPU implementations are iterative; in order to manage synchronizations between processing units, but the number of iterations depends on the image shape and density. This paper describes the DLP (Distanceless Label Propagation) algorithms, an adaptable set of algorithms usable both on GPU and multi-core architectures, and DLP-GPU, an efficient direct CCL algorithm for GPU based on DLP mechanisms.","PeriodicalId":316356,"journal":{"name":"2017 Seventh International Conference on Image Processing Theory, Tools and Applications (IPTA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 Seventh International Conference on Image Processing Theory, Tools and Applications (IPTA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IPTA.2017.8310147","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 10

Abstract

Modern computer architectures are mainly composed of multi-core processors and GPUs. Consequently, solely providing a sequential implementation of algorithms or comparing algorithm performance without regard to architecture is no longer pertinent. Today, algorithms have to address parallelism, multithreading and memory topology (private/shared memory, cache or scratchpad, …). Most Connected Component Labeling (CCL) algorithms are sequential, direct and optimized for processors. Few were designed specifically for GPU architectures and none were designed to be adapted to different architectures. The most efficient GPU implementations are iterative; in order to manage synchronizations between processing units, but the number of iterations depends on the image shape and density. This paper describes the DLP (Distanceless Label Propagation) algorithms, an adaptable set of algorithms usable both on GPU and multi-core architectures, and DLP-GPU, an efficient direct CCL algorithm for GPU based on DLP mechanisms.

查看原文本刊更多论文

无距离标签传播:一种高效的gpu直连组件标签算法

现代计算机体系结构主要由多核处理器和图形处理器组成。因此，仅仅提供算法的顺序实现或不考虑体系结构而比较算法性能不再相关。今天，算法必须解决并行性、多线程和内存拓扑(私有/共享内存、缓存或刮擦板等)。大多数连接组件标记(CCL)算法是顺序的，直接的和优化的处理器。很少有专门为GPU架构设计的，没有一个是为了适应不同的架构而设计的。最有效的GPU实现是迭代的;为了管理处理单元之间的同步，但迭代的次数取决于图像的形状和密度。本文介绍了DLP (distance - eless Label Propagation)算法和基于DLP机制的高效GPU直接CCL算法DLP-GPU。DLP算法是一套可用于GPU和多核架构的自适应算法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2017 Seventh International Conference on Image Processing Theory, Tools and Applications (IPTA)

自引率

0.00%

发文量