2018 IEEE/ACM Parallel Applications Workshop, Alternatives To MPI (PAW-ATM)最新文献

筛选
英文 中文
[Title page] (标题页)
2018 IEEE/ACM Parallel Applications Workshop, Alternatives To MPI (PAW-ATM) Pub Date : 2018-11-01 DOI: 10.1109/paw-atm.2018.00001
{"title":"[Title page]","authors":"","doi":"10.1109/paw-atm.2018.00001","DOIUrl":"https://doi.org/10.1109/paw-atm.2018.00001","url":null,"abstract":"","PeriodicalId":368346,"journal":{"name":"2018 IEEE/ACM Parallel Applications Workshop, Alternatives To MPI (PAW-ATM)","volume":"56 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133275352","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Chapel Aggregation Library (CAL) 教堂集合图书馆
2018 IEEE/ACM Parallel Applications Workshop, Alternatives To MPI (PAW-ATM) Pub Date : 2018-11-01 DOI: 10.1109/PAW-ATM.2018.00009
Louis Jenkins, Marcin Zalewski, Michael P. Ferguson
{"title":"Chapel Aggregation Library (CAL)","authors":"Louis Jenkins, Marcin Zalewski, Michael P. Ferguson","doi":"10.1109/PAW-ATM.2018.00009","DOIUrl":"https://doi.org/10.1109/PAW-ATM.2018.00009","url":null,"abstract":"Fine-grained communication is a fundamental prin- ciple of the Partitioned Global Address Space (PGAS), which serves to simplify creating and reasoning about programs in the distributed context. However, per-message overheads of communication rapidly accumulate in programs that generate a high volume of small messages, limiting the effective bandwidth and potentially increasing latency if the messages are generated at a much higher rate than the effective network bandwidth. One way to reduce such fine-grained communication is by coarsening the granularity by aggregating data, or by buffering the smaller communications together in a way that they can be handled in bulk. Once these communications are buffered, the multiple units of the aggregated data can be combined into fewer units in an optimization called coalescing. The Chapel Aggregation Library (CAL) provides a straight- forward approach to handling both aggregation and coalescing of data in Chapel and aims to be as generic and minimal as possible to maximize code reuse and minimize its increase in com- plexity on user applications. CAL provides a high-performance, distributed, and parallel-safe solution that is entirely written as a Chapel module. In addition to being easy to use, CAL improves the performance of some benchmarks by one to two orders of magnitude over naive implementations at 32 compute-nodes on a Cray XC50.","PeriodicalId":368346,"journal":{"name":"2018 IEEE/ACM Parallel Applications Workshop, Alternatives To MPI (PAW-ATM)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129382369","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Comparison of the HPC and Big Data Java Libraries Spark, PCJ and APGAS HPC和大数据Java库Spark、PCJ和APGAS的比较
2018 IEEE/ACM Parallel Applications Workshop, Alternatives To MPI (PAW-ATM) Pub Date : 2018-11-01 DOI: 10.1109/PAW-ATM.2018.00007
Jonas Posner, Lukas Reitz, Claudia Fohry
{"title":"Comparison of the HPC and Big Data Java Libraries Spark, PCJ and APGAS","authors":"Jonas Posner, Lukas Reitz, Claudia Fohry","doi":"10.1109/PAW-ATM.2018.00007","DOIUrl":"https://doi.org/10.1109/PAW-ATM.2018.00007","url":null,"abstract":"Although Java is rarely used in HPC, there are a few notable libraries. Use of Java may help to bridge the gap between HPC and big data processing. This paper compares the big data library Spark, and the HPC libraries PCJ and APGAS, regarding productivity and performance. We refer to Java versions of all libraries. For APGAS, we include both the original version and an own extension by locality-flexible tasks. We consider three benchmarks: Calculation of π from HPC, Unbalanced Tree Search (UTS) from HPC, and WordCount from the big data domain. In performance measurements with up to 144 workers, the extended APGAS library was the clear winner. With 144 workers, APGAS programs were up to a factor of more than two faster than Spark programs, and up to about 30% faster than PCJ programs. Regarding productivity, the extended APGAS programs consistently needed the lowest number of different library constructs. Spark ranged second in productivity, and PCJ third.","PeriodicalId":368346,"journal":{"name":"2018 IEEE/ACM Parallel Applications Workshop, Alternatives To MPI (PAW-ATM)","volume":"40 12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129101477","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Distributed L-shaped Algorithms in Julia Julia中的分布式l形算法
2018 IEEE/ACM Parallel Applications Workshop, Alternatives To MPI (PAW-ATM) Pub Date : 2018-11-01 DOI: 10.1109/PAW-ATM.2018.00011
Martin Biel, M. Johansson
{"title":"Distributed L-shaped Algorithms in Julia","authors":"Martin Biel, M. Johansson","doi":"10.1109/PAW-ATM.2018.00011","DOIUrl":"https://doi.org/10.1109/PAW-ATM.2018.00011","url":null,"abstract":"We present LShapedSolvers.jl, a suite of scalable stochastic programming solvers implemented in the Julia programming language. The solvers, which are based on the L-shaped algorithm, run efficiently in parallel, exploit problem structure, and operate on distributed data. The implementation introduces several flexible high-level abstractions that result in a modular design and simplify the development of algorithm variants. In addition, we demonstrate how the abstractions available in the Julia module for distributed computing are exploited to simplify the implementation of the parallel algorithms. The performance of the solvers is evaluated on large-scale problems for finding optimal orders on the Nordic day-ahead electricity market. With 16 worker cores, the fastest algorithm solves a distributed problem with 2.5 million variables and 1.5 million linear constraints about 19 times faster than Gurobi is able to solve the extended form directly.","PeriodicalId":368346,"journal":{"name":"2018 IEEE/ACM Parallel Applications Workshop, Alternatives To MPI (PAW-ATM)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122475782","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Semi-Static and Dynamic Load Balancing for Asynchronous Hurricane Storm Surge Simulations 异步飓风风暴潮模拟的半静态和动态负载平衡
2018 IEEE/ACM Parallel Applications Workshop, Alternatives To MPI (PAW-ATM) Pub Date : 2018-11-01 DOI: 10.1109/PAW-ATM.2018.00010
Maximilian H. Bremer, J. Bachan, Cy P. Chan
{"title":"Semi-Static and Dynamic Load Balancing for Asynchronous Hurricane Storm Surge Simulations","authors":"Maximilian H. Bremer, J. Bachan, Cy P. Chan","doi":"10.1109/PAW-ATM.2018.00010","DOIUrl":"https://doi.org/10.1109/PAW-ATM.2018.00010","url":null,"abstract":"The performance of hurricane storm surge simulations is critical to forecast and mitigate the deadly effects of hurricane landfall. Supercomputers play a key role to run these simulations quickly; however, disruptive changes in future computer architectures will require adapting simulators to maintain high performance, such as increasing asynchrony and improving load balance. We introduce two new multi-constraint, fully asynchronous load balancers and a new discrete-event simulator (DGSim) that is able to natively model the execution of task-based hurricane simulations based on efficient one-sided, active message-based communication protocols. We calibrate and validate DGSim, use it to compare the algorithms' load balancing capabilities and task migration costs under many parameterizations, saving of over 5,000x core-hours compared to running the application code directly. Our load balancing algorithms achieve a performance improvement of up to 56 percent over the original static balancer and up to 97 percent of the optimal speed-up.","PeriodicalId":368346,"journal":{"name":"2018 IEEE/ACM Parallel Applications Workshop, Alternatives To MPI (PAW-ATM)","volume":"65 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128898991","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Efficient Algorithms for Collective Operations with Notified Communication in Shared Windows 共享窗口中具有通知通信的集体操作的高效算法
2018 IEEE/ACM Parallel Applications Workshop, Alternatives To MPI (PAW-ATM) Pub Date : 2018-11-01 DOI: 10.1109/PAW-ATM.2018.00006
Muhammed Abdullah Al Ahad, C. Simmendinger, R. Iakymchuk, E. Laure, S. Markidis
{"title":"Efficient Algorithms for Collective Operations with Notified Communication in Shared Windows","authors":"Muhammed Abdullah Al Ahad, C. Simmendinger, R. Iakymchuk, E. Laure, S. Markidis","doi":"10.1109/PAW-ATM.2018.00006","DOIUrl":"https://doi.org/10.1109/PAW-ATM.2018.00006","url":null,"abstract":"Collective operations are commonly used in various parts of scientific applications. Especially in strong scaling scenarios collective operations can negatively impact the overall applications performance: while the load per rank here decreases with increasing core counts, time spent in e.g. barrier operations will increase logarithmically with the core count. In this article, we develop novel algorithmic solutions for collective operations -- such as Allreduce and Allgather(V) -- by leveraging notified communication in shared windows. To this end, we have developed an extension of GASPI which enables all ranks participating in a shared window to observe the entire notified communication targeted at the window. By exploring benefits of this extension, we deliver high performing implementations of Allreduce and Allgather(V) on Intel and Cray clusters. These implementations clearly achieve 2x-4x performance improvements compared to the best performing MPI implementations for various data distributions.","PeriodicalId":368346,"journal":{"name":"2018 IEEE/ACM Parallel Applications Workshop, Alternatives To MPI (PAW-ATM)","volume":"50 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127431366","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
GASNet-EX Performance Improvements Due to Specialization for the Cray Aries Network 基于Cray Aries网络专门化的GASNet-EX性能改进
Paul H. Hargrove, D. Bonachea
{"title":"GASNet-EX Performance Improvements Due to Specialization for the Cray Aries Network","authors":"Paul H. Hargrove, D. Bonachea","doi":"10.2172/1430690","DOIUrl":"https://doi.org/10.2172/1430690","url":null,"abstract":"GASNet-EX is a portable, open-source, high-performance communication library designed to efficiently support the networking requirements of PGAS runtime systems and other alternative models on future exascale machines. This paper reports on the improvements in performance observed on Cray XC-series systems due to enhancements made to the GASNet-EX software. These enhancements, known as \"specializations\", primarily consist of replacing network-independent implementations of several recently added features with implementations tailored to the Cray Aries network. Performance gains from specialization include (1) Negotiated-Payload Active Messages improve bandwidth of a ping-pong test by up to 14%, (2) Immediate Operations reduce running time of a synthetic benchmark by up to 93%, (3) non-bulk RMA Put bandwidth is increased by up to 32%, (4) Remote Atomic performance is 70% faster than the reference on a point-to-point test and allows a hot-spot test to scale robustly, and (5) non-contiguous RMA interfaces see up to 8.6x speedups for an intra-node benchmark and 26% for inter-node. These improvements are all available in GASNet-EX version 2018.3.0 and later.","PeriodicalId":368346,"journal":{"name":"2018 IEEE/ACM Parallel Applications Workshop, Alternatives To MPI (PAW-ATM)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133932809","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信