Scheduling Irregular Dataflow Pipelines on SIMD Architectures

Proceedings of the 2020 Sixth Workshop on Programming Models for SIMD/Vector Processing Pub Date : 2020-02-22 DOI:10.1145/3380479.3380480

Tom Plano, J. Buhler

{"title":"Scheduling Irregular Dataflow Pipelines on SIMD Architectures","authors":"Tom Plano, J. Buhler","doi":"10.1145/3380479.3380480","DOIUrl":null,"url":null,"abstract":"Streaming computations often exhibit substantial data parallelism that makes them well-suited to SIMD architectures. However, many such computations also exhibit irregularity, in the form of data-dependent, dynamic data rates, that makes efficient SIMD execution challenging. One aspect of this challenge is the need to schedule execution of a computation realized as a pipeline of stages connected by finite queues. A scheduler must both ensure high SIMD occupancy by gathering queued items into vectors and minimize costs associated with switching execution between stages. In this work, we present the AFIE (Active Full, Inactive Empty) scheduling policy for irregular streaming applications on SIMD processors. AFIE provably groups inputs to each stage of a pipeline into a minimal number of SIMD vectors while incurring a bounded number of switches relative to the best possible policy. These results apply even though irregularity forbids a priori knowledge of how many outputs will be generated from each input to each stage. We have implemented AFIE as an extension to the MERCATOR system [6] for building irregular streaming applications on NVIDIA GPUs. We describe how the AFIE scheduler simplifies MERCATOR's runtime code and empirically measure the new scheduler's improved performance on irregular streaming applications.","PeriodicalId":164160,"journal":{"name":"Proceedings of the 2020 Sixth Workshop on Programming Models for SIMD/Vector Processing","volume":"37 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2020 Sixth Workshop on Programming Models for SIMD/Vector Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3380479.3380480","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

Abstract

Streaming computations often exhibit substantial data parallelism that makes them well-suited to SIMD architectures. However, many such computations also exhibit irregularity, in the form of data-dependent, dynamic data rates, that makes efficient SIMD execution challenging. One aspect of this challenge is the need to schedule execution of a computation realized as a pipeline of stages connected by finite queues. A scheduler must both ensure high SIMD occupancy by gathering queued items into vectors and minimize costs associated with switching execution between stages. In this work, we present the AFIE (Active Full, Inactive Empty) scheduling policy for irregular streaming applications on SIMD processors. AFIE provably groups inputs to each stage of a pipeline into a minimal number of SIMD vectors while incurring a bounded number of switches relative to the best possible policy. These results apply even though irregularity forbids a priori knowledge of how many outputs will be generated from each input to each stage. We have implemented AFIE as an extension to the MERCATOR system [6] for building irregular streaming applications on NVIDIA GPUs. We describe how the AFIE scheduler simplifies MERCATOR's runtime code and empirically measure the new scheduler's improved performance on irregular streaming applications.

查看原文本刊更多论文

SIMD架构下的不规则数据流管道调度

流计算通常表现出大量的数据并行性，这使得它们非常适合SIMD体系结构。然而，许多这样的计算也表现出不规则性，以数据依赖的动态数据速率的形式，这使得有效的SIMD执行变得困难。这一挑战的一个方面是需要调度计算的执行，计算是由有限队列连接的阶段管道实现的。调度器必须通过将排队项收集到向量中来确保较高的SIMD占用率，并最小化与在阶段之间切换执行相关的成本。在这项工作中，我们提出了针对SIMD处理器上的不规则流应用程序的AFIE (Active Full, Inactive Empty)调度策略。可以证明，AFIE将管道每个阶段的输入分组为最小数量的SIMD向量，同时相对于最佳策略产生有限数量的开关。这些结果适用于即使不规则性禁止从每个输入到每个阶段将产生多少输出的先验知识。我们已经实现了AFIE作为MERCATOR系统的扩展[6]，用于在NVIDIA gpu上构建不规则流媒体应用程序。我们描述了AFIE调度器如何简化MERCATOR的运行时代码，并经验地测量了新调度器在不规则流应用程序上的改进性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 2020 Sixth Workshop on Programming Models for SIMD/Vector Processing

自引率

0.00%

发文量