The Evaluation of an Effective Out-of-Core Run-Time System in the Context of Parallel Mesh Generation

2011 IEEE International Parallel & Distributed Processing Symposium Pub Date : 2011-05-16 DOI:10.1109/IPDPS.2011.25

A. Kot, Andrey N. Chernikov, N. Chrisochoides

{"title":"The Evaluation of an Effective Out-of-Core Run-Time System in the Context of Parallel Mesh Generation","authors":"A. Kot, Andrey N. Chernikov, N. Chrisochoides","doi":"10.1109/IPDPS.2011.25","DOIUrl":null,"url":null,"abstract":"We present an out-of-core run-time system that supports effective parallel computation of large irregular and adaptive problems, in particular unstructured mesh generation (PUMG). PUMG is a highly challenging application due to intensive memory accesses, unpredictable communication patterns, and variable and irregular data dependencies reflecting the unstructured spatial connectivity of mesh elements. Our runtime system allows to transform the footprint of parallel applications from wide and shallow into narrow and deep by extending the memory utilization to the out-of-core level. It simplifies and streamlines the development of otherwise highly time consuming out-of-core applications as well as the converting of existing applications. It utilizes disk, network and memory hierarchy to achieve high utilization of computing resources without sacrificing performance with PUMG. The runtime system combines different programming paradigms: multi-threading within the nodes using industrial strength software framework, one-sided active messages among the nodes, and an out-of-core subsystem for managing large datasets. We performed an evaluation on traditional parallel platforms to stress test all layers of the run-time system using three different PUMG methods with significantly varying communication and synchronization patterns. We demonstrated high overlap in computation, communication, and disk I/O which results in good performance when computing large out-of-core problems. The runtime system adds very small overhead~(up to 18\\% on most configurations) when computing in-core which means performance is not compromised.","PeriodicalId":355100,"journal":{"name":"2011 IEEE International Parallel & Distributed Processing Symposium","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 IEEE International Parallel & Distributed Processing Symposium","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IPDPS.2011.25","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 8

Abstract

We present an out-of-core run-time system that supports effective parallel computation of large irregular and adaptive problems, in particular unstructured mesh generation (PUMG). PUMG is a highly challenging application due to intensive memory accesses, unpredictable communication patterns, and variable and irregular data dependencies reflecting the unstructured spatial connectivity of mesh elements. Our runtime system allows to transform the footprint of parallel applications from wide and shallow into narrow and deep by extending the memory utilization to the out-of-core level. It simplifies and streamlines the development of otherwise highly time consuming out-of-core applications as well as the converting of existing applications. It utilizes disk, network and memory hierarchy to achieve high utilization of computing resources without sacrificing performance with PUMG. The runtime system combines different programming paradigms: multi-threading within the nodes using industrial strength software framework, one-sided active messages among the nodes, and an out-of-core subsystem for managing large datasets. We performed an evaluation on traditional parallel platforms to stress test all layers of the run-time system using three different PUMG methods with significantly varying communication and synchronization patterns. We demonstrated high overlap in computation, communication, and disk I/O which results in good performance when computing large out-of-core problems. The runtime system adds very small overhead~(up to 18\% on most configurations) when computing in-core which means performance is not compromised.

查看原文本刊更多论文

并行网格生成环境下有效的离核运行时系统评价

我们提出了一个核外运行时系统，支持大型不规则和自适应问题的有效并行计算，特别是非结构化网格生成(PUMG)。由于密集的内存访问、不可预测的通信模式以及反映网格元素非结构化空间连通性的可变和不规则数据依赖关系，PUMG是一个极具挑战性的应用。我们的运行时系统允许通过将内存利用率扩展到核外级别，将并行应用程序的内存占用从宽而浅转换为窄而深。它简化和流线化了原本非常耗时的核心外应用程序的开发，以及现有应用程序的转换。它利用磁盘、网络和内存层次结构，在不牺牲PUMG性能的情况下实现计算资源的高利用率。运行时系统结合了不同的编程范例:使用工业强度软件框架的节点内多线程，节点之间的单侧活动消息，以及用于管理大型数据集的核心外子系统。我们在传统的并行平台上进行了评估，使用三种不同的PUMG方法对运行时系统的所有层进行了压力测试，这些方法具有显著不同的通信和同步模式。我们展示了在计算、通信和磁盘I/O方面的高度重叠，这在计算大型核外问题时可以带来良好的性能。当内核计算时，运行时系统增加了非常小的开销(在大多数配置中最多为18%)，这意味着性能不会受到损害。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2011 IEEE International Parallel & Distributed Processing Symposium

自引率

0.00%

发文量