A Latency, Throughput, and Programmability Perspective of GrPPI for Streaming on Multi-cores

2023 31st Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP) Pub Date : 2023-03-01 DOI:10.1109/PDP59025.2023.00033

A. Garcia, Dalvan Griebler, C. Schepke, André Sacilotto Santos, José Daniel García Sánchez, Javier Fernández Muñoz, L. G. Fernandes

{"title":"A Latency, Throughput, and Programmability Perspective of GrPPI for Streaming on Multi-cores","authors":"A. Garcia, Dalvan Griebler, C. Schepke, André Sacilotto Santos, José Daniel García Sánchez, Javier Fernández Muñoz, L. G. Fernandes","doi":"10.1109/PDP59025.2023.00033","DOIUrl":null,"url":null,"abstract":"Several solutions aim to simplify the burdening task of parallel programming. The GrPPI library is one of them. It allows users to implement parallel code for multiple backends through a unified, abstract, and generic layer while promising minimal overhead on performance. An outspread evaluation of GrPPI regarding stream parallelism with representative metrics for this domain, such as throughput and latency, was not yet done. In this work, we evaluate GrPPI focused on stream processing. We evaluate performance, memory usage, and programming effort and compare them against handwritten parallel code. For this, we use the benchmarking framework SPBench to build custom GrPPI benchmarks. The basis of the benchmarks is real applications, such as Lane Detection, Bzip2, Face Recognizer, and Ferret. Experiments show that while performance is competitive with handwritten code in some cases, in other cases, the infeasibility of fine-tuning GrPPI is a crucial drawback. Despite this, programmability experiments estimate that GrPPI has the potential to reduce by about three times the development time of parallel applications.","PeriodicalId":153500,"journal":{"name":"2023 31st Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 31st Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PDP59025.2023.00033","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Several solutions aim to simplify the burdening task of parallel programming. The GrPPI library is one of them. It allows users to implement parallel code for multiple backends through a unified, abstract, and generic layer while promising minimal overhead on performance. An outspread evaluation of GrPPI regarding stream parallelism with representative metrics for this domain, such as throughput and latency, was not yet done. In this work, we evaluate GrPPI focused on stream processing. We evaluate performance, memory usage, and programming effort and compare them against handwritten parallel code. For this, we use the benchmarking framework SPBench to build custom GrPPI benchmarks. The basis of the benchmarks is real applications, such as Lane Detection, Bzip2, Face Recognizer, and Ferret. Experiments show that while performance is competitive with handwritten code in some cases, in other cases, the infeasibility of fine-tuning GrPPI is a crucial drawback. Despite this, programmability experiments estimate that GrPPI has the potential to reduce by about three times the development time of parallel applications.

查看原文本刊更多论文

从延迟、吞吐量和可编程性的角度看GrPPI在多核流上的应用

有几个解决方案旨在简化并行编程的繁重任务。GrPPI库就是其中之一。它允许用户通过一个统一的、抽象的、通用的层为多个后端实现并行代码，同时保证最小的性能开销。关于该领域的代表性指标(如吞吐量和延迟)的流并行性的GrPPI扩展评估尚未完成。在这项工作中，我们评估了侧重于流处理的GrPPI。我们评估性能、内存使用和编程工作，并将它们与手写并行代码进行比较。为此，我们使用基准测试框架SPBench来构建自定义GrPPI基准。基准测试的基础是真实的应用程序，例如Lane Detection、Bzip2、Face Recognizer和Ferret。实验表明，虽然在某些情况下性能可以与手写代码竞争，但在其他情况下，微调GrPPI的不可行性是一个关键的缺点。尽管如此，可编程性实验估计GrPPI有可能将并行应用程序的开发时间减少约三倍。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2023 31st Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP)

自引率

0.00%

发文量