V. G. Pinto, Lucas Leandro Nesi, M. Miletto, L. Schnorr
{"title":"Providing In-depth Performance Analysis for Heterogeneous Task-based Applications with StarVZ","authors":"V. G. Pinto, Lucas Leandro Nesi, M. Miletto, L. Schnorr","doi":"10.1109/IPDPSW52791.2021.00013","DOIUrl":null,"url":null,"abstract":"Task-based parallelism has adequately addressed the coding complexity required to fully exploit the processing power offered by omnipresent hybrid CPU/GPU supercomputers. However, its performance highly depends on the proper runtime system setup. Analyzing and tuning the performance of task-based applications running on hybrid platforms is challenging since they present unstructured communication and computation overlap, with finer granularity, dynamic scheduling, and inherent irregularity. This paper discusses the StarVZ approach to enable a comprehensive performance analysis in such a heterogeneous context. StarVZ is built on top of modern data analysis tools and is publicly available as an R package. We collect traces from five diverse task-based applications running on top of the StarPU runtime system on a set of multi-node platforms enhanced with GPUs. We demonstrate how it can highlight disturbances that are particularly hard to identify or explain with traditional analysis tools. Additionally, we provide a detailed performance evaluation of StarVZ with different workloads and setups.","PeriodicalId":170832,"journal":{"name":"2021 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IPDPSW52791.2021.00013","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Task-based parallelism has adequately addressed the coding complexity required to fully exploit the processing power offered by omnipresent hybrid CPU/GPU supercomputers. However, its performance highly depends on the proper runtime system setup. Analyzing and tuning the performance of task-based applications running on hybrid platforms is challenging since they present unstructured communication and computation overlap, with finer granularity, dynamic scheduling, and inherent irregularity. This paper discusses the StarVZ approach to enable a comprehensive performance analysis in such a heterogeneous context. StarVZ is built on top of modern data analysis tools and is publicly available as an R package. We collect traces from five diverse task-based applications running on top of the StarPU runtime system on a set of multi-node platforms enhanced with GPUs. We demonstrate how it can highlight disturbances that are particularly hard to identify or explain with traditional analysis tools. Additionally, we provide a detailed performance evaluation of StarVZ with different workloads and setups.