International Conference on Partitioned Global Address Space Programming Models最新文献_第4页

Development and performance analysis of a UPC Particle-in-Cell code UPC细胞内粒子代码的开发与性能分析

International Conference on Partitioned Global Address Space Programming Models Pub Date : 2010-10-12 DOI: 10.1145/2020373.2020383

S. Markidis, G. Lapenta

引用次数: 8

Unifying UPC and MPI runtimes: experience with MVAPICH 统一UPC和MPI运行时:使用MVAPICH的经验

International Conference on Partitioned Global Address Space Programming Models Pub Date : 2010-10-12 DOI: 10.1145/2020373.2020378

Jithin Jose, Miao Luo, S. Sur, D. Panda

{"title":"Unifying UPC and MPI runtimes: experience with MVAPICH","authors":"Jithin Jose, Miao Luo, S. Sur, D. Panda","doi":"10.1145/2020373.2020378","DOIUrl":"https://doi.org/10.1145/2020373.2020378","url":null,"abstract":"Unified Parallel C (UPC) is an emerging parallel programming language that is based on a shared memory paradigm. MPI has been a widely ported and dominant parallel programming model for the past couple of decades. Real-life scientific applications require a lot of investment by domain scientists. Many scientists choose the MPI programming model as it is considered low-risk. It is unlikely that entire applications will be re-written using the emerging UPC language (or PGAS paradigm) in the near future. It is more likely that parts of these applications will be converted to newer models. This requires that underlying implementation of system software be able to support both UPC and MPI simultaneously. Unfortunately, the current state-of-the-art of UPC and MPI interoperability leaves much to be desired both in terms of performance and ease-of-use.\u0000 In this paper, we propose \"Integrated Native Communication Runtime\" (INCR) for MPI and UPC communications on InfiniBand clusters. Our library is capable of supporting both UPC and MPI communications simultaneously. This runtime is based on the widely used MVAPICH (MPI over InfiniBand) Aptus runtime, which is known to scale to tens-of-thousands of cores. Our evaluation reveals that INCR is able to deliver equal or better performance compared to the existing UPC runtime - GASNet on InfiniBand verbs. We observe that with UPC NAS benchmarks CG and MG (class B) at 128 processes, we outperform current GASNet implementation by 10% and 23%, respectively.","PeriodicalId":245693,"journal":{"name":"International Conference on Partitioned Global Address Space Programming Models","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128627250","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 55

Introducing OpenSHMEM: SHMEM for the PGAS community 介绍OpenSHMEM: PGAS社区的SHMEM

International Conference on Partitioned Global Address Space Programming Models Pub Date : 2010-10-12 DOI: 10.1145/2020373.2020375

B. Chapman, Tony Curtis, S. Pophale, S. Poole, J. Kuehn, C. Koelbel, Lauren Smith

引用次数: 219

Numerical Python for scalable architectures 用于可扩展架构的数值Python

International Conference on Partitioned Global Address Space Programming Models Pub Date : 2010-10-12 DOI: 10.1145/2020373.2020388

M. R. B. Kristensen, B. Vinter

{"title":"Numerical Python for scalable architectures","authors":"M. R. B. Kristensen, B. Vinter","doi":"10.1145/2020373.2020388","DOIUrl":"https://doi.org/10.1145/2020373.2020388","url":null,"abstract":"In this paper, we introduce DistNumPy, a library for doing numerical computation in Python that targets scalable distributed memory architectures. DistNumPy extends the NumPy module[15], which is popular for scientific programming. Replacing NumPy with Dist-NumPy enables the user to write sequential Python programs that seamlessly utilize distributed memory architectures. This feature is obtained by introducing a new backend for NumPy arrays, which distribute data amongst the nodes in a distributed memory multi-processor. All operations on this new array will seek to utilize all available processors. The array itself is distributed between multiple processors in order to support larger arrays than a single node can hold in memory.\u0000 We perform three experiments of sequential Python programs running on an Ethernet based cluster of SMP-nodes with a total of 64 CPU-cores. The results show an 88% CPU utilization when running a Monte Carlo simulation, 63% CPU utilization on an N-body simulation and a more modest 50% on a Jacobi solver. The primary limitation in CPU utilization is identified as SMP limitations and not the distribution aspect. Based on the experiments we find that it is possible to obtain significant speedup from using our new array-backend without changing the original Python code.","PeriodicalId":245693,"journal":{"name":"International Conference on Partitioned Global Address Space Programming Models","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130525278","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 20

Introducing mNUMA: an extended PGAS architecture 介绍mNUMA:一个扩展的PGAS架构

International Conference on Partitioned Global Address Space Programming Models Pub Date : 2010-10-12 DOI: 10.1145/2020373.2020379

Megan Vance, P. Kogge

引用次数: 2

Predicting remote reuse distance patterns in UPC applications 预测UPC应用程序中的远程重用距离模式

International Conference on Partitioned Global Address Space Programming Models Pub Date : 2010-10-12 DOI: 10.1145/2020373.2020374

Steven Vormwald, Wei Wang, S. Carr, S. Seidel, Z. Wang

引用次数: 2

Extensible PGAS semantics for C++ c++的可扩展PGAS语义

International Conference on Partitioned Global Address Space Programming Models Pub Date : 2010-10-12 DOI: 10.1145/2020373.2020385

N. Edmonds, Douglas P. Gregor, A. Lumsdaine

引用次数: 3

X10-enabled MapReduce 了x10 MapReduce

International Conference on Partitioned Global Address Space Programming Models Pub Date : 2010-10-12 DOI: 10.1145/2020373.2020382

H. Dong, Shujia Zhou, D. Grove

引用次数: 6

XcalableMP implementation and performance of NAS Parallel Benchmarks XcalableMP的实现和NAS并行基准的性能

International Conference on Partitioned Global Address Space Programming Models Pub Date : 2010-10-12 DOI: 10.1145/2020373.2020384

M. Nakao, Jinpil Lee, T. Boku, M. Sato

引用次数: 17