2016 PGAS Applications Workshop (PAW)最新文献

OpenSHMEM Non-blocking Data Movement Operations with MVAPICH2-X: Early Experiences OpenSHMEM非阻塞数据移动操作与MVAPICH2-X:早期经验

2016 PGAS Applications Workshop (PAW) Pub Date : 2016-11-13 DOI: 10.1109/PAW.2016.7

Khaled Hamidouche, Jie Zhang, D. Panda, K. Tomko

引用次数: 0

Application of PGAS Programming to Power Grid Simulation PGAS编程在电网仿真中的应用

2016 PGAS Applications Workshop (PAW) Pub Date : 2016-11-13 DOI: 10.1109/PAW.2016.10

B. Palmer

引用次数: 0

Optimizing PGAS Overhead in a Multi-locale Chapel Implementation of CoMD 在CoMD的多场所Chapel实现中优化PGAS开销

2016 PGAS Applications Workshop (PAW) Pub Date : 2016-11-13 DOI: 10.1109/PAW.2016.9

Riyaz Haque, D. Richards

{"title":"Optimizing PGAS Overhead in a Multi-locale Chapel Implementation of CoMD","authors":"Riyaz Haque, D. Richards","doi":"10.1109/PAW.2016.9","DOIUrl":"https://doi.org/10.1109/PAW.2016.9","url":null,"abstract":"Chapel supports distributed computing with an underlying PGAS memory address space. While it provides abstractions for writing simple and elegant distributed code, the type system currently lacks a notion of locality i.e. a description of an object's access behavior in relation to its actual location. This often necessitates programmer intervention to avoid redundant non-local data access. Moreover, due to insufficient locality information the compiler ends up using “wide” pointers—that can point to non-local data—for objects referenced in an otherwise completely local manner, adding to the runtime overhead.In this work we describe CoMD-Chapel, our distributed Chapel implementation of the CoMD benchmark. We demonstrate that optimizing data access through replication and localization is crucial for achieving performance comparable to the reference implementation. We discuss limitations of existing scope-based locality optimizations and argue instead for a more general (and robust) type-based approach. Lastly, we also evaluate code performance and scaling characteristics. The fully optimized version of CoMD-Chapel can perform to within 62%–87% of the reference implementation.","PeriodicalId":383847,"journal":{"name":"2016 PGAS Applications Workshop (PAW)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115797313","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 11

Multi-scale CAFE Framework for Simulating Fracture in Heterogeneous Materials Implemented in Fortran Co-arrays and MPI 基于Fortran共阵列和MPI的非均质材料断裂模拟多尺度CAFE框架

2016 PGAS Applications Workshop (PAW) Pub Date : 2016-11-13 DOI: 10.1109/PAW.2016.6

A. Shterenlikht, L. Margetts, J. D. Arregui-Mena, L. Cebamanos

{"title":"Multi-scale CAFE Framework for Simulating Fracture in Heterogeneous Materials Implemented in Fortran Co-arrays and MPI","authors":"A. Shterenlikht, L. Margetts, J. D. Arregui-Mena, L. Cebamanos","doi":"10.1109/PAW.2016.6","DOIUrl":"https://doi.org/10.1109/PAW.2016.6","url":null,"abstract":"Fortran coarrays have been used as an extension to the standard for over 20 years, mostly on Cray systems. Their appeal to users increased substantially when they were standardised in 2010. In this work we show that coarrays offer simple and intuitive data structures for 3D cellular automata (CA) modelling of material microstructures. We show how coarrays can be used together with an MPI finite element (FE) library to create a two-way concurrent hierarchical and scalable multi-scale CAFE deformation and fracture framework. Design of a coarray cellular automata microstructure evolution library CGPACK is described. A highly portable MPI FE library ParaFEM was used in this work. We show that independently CGPACK and ParaFEM programs can scale up well into tens of thousands of cores. Strong scaling of a hybrid ParaFEM/CGPACK MPI/coarray multi-scale framework was measured on an important solid mechanics practical example of a fracture of a steel round bar under tension. That program did not scale beyond 7 thousand cores. Excessive synchronisation might be one contributing factor to relatively poor scaling. Therefore we conclude with a comparative analysis of synchronisation requirements in MPI and coarray programs. Specific challenges of synchronising a coarray library are discussed.","PeriodicalId":383847,"journal":{"name":"2016 PGAS Applications Workshop (PAW)","volume":"54 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116525317","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 7

Experiences of Applying One-Sided Communication to Nearest-Neighbor Communication 单侧通信在最近邻通信中的应用经验

2016 PGAS Applications Workshop (PAW) Pub Date : 2016-11-13 DOI: 10.1109/PAW.2016.8

H. Shan, Samuel Williams, Yili Zheng, Weiqun Zhang, Bei Wang, S. Ethier, Zhengji Zhao

引用次数: 3