2014 IEEE International Parallel & Distributed Processing Symposium Workshops最新文献_第4页

GABB Introduction

2014 IEEE International Parallel & Distributed Processing Symposium Workshops Pub Date : 2014-05-19 DOI: 10.1109/IPDPSW.2014.221

T. Mattson, David A. Bader, A. Buluç, J. Gilbert, Joseph E. Gonzalez, J. Kepner

引用次数: 0

New Algorithm for Computing Eigenvectors of the Symmetric Eigenvalue Problem 对称特征值问题特征向量计算的新算法

2014 IEEE International Parallel & Distributed Processing Symposium Workshops Pub Date : 2014-05-19 DOI: 10.1109/IPDPSW.2014.130

A. Haidar, P. Luszczek, J. Dongarra

引用次数: 12

A Genetic Algorithm-Based Sparse Coverage over Urban VANETs 基于遗传算法的城市区域稀疏覆盖

2014 IEEE International Parallel & Distributed Processing Symposium Workshops Pub Date : 2014-05-19 DOI: 10.1109/IPDPSW.2014.59

Huang Cheng, Xin Fei, A. Boukerche, M. Almulla

引用次数: 7

HPGC Introduction and Committees HPGC简介及委员会

2014 IEEE International Parallel & Distributed Processing Symposium Workshops Pub Date : 2014-05-19 DOI: 10.1109/IPDPSW.2014.216

E. Aubanel, V. Bhavsar, M. Frumkin

引用次数: 0

Towards Extreme-Scale Simulations with Next-Generation Trilinos: A Low Mach Fluid Application Case Study 下一代Trilinos的极端尺度模拟:低马赫流体应用案例研究

2014 IEEE International Parallel & Distributed Processing Symposium Workshops Pub Date : 2014-05-19 DOI: 10.1109/IPDPSW.2014.166

P. Lin, M. Bettencourt, S. Domino, T. Fisher, M. Hoemmen, Jonathan J. Hu, E. Phipps, A. Prokopenko, S. Rajamanickam, C. Siefert, E. Cyr, S. Kennon

{"title":"Towards Extreme-Scale Simulations with Next-Generation Trilinos: A Low Mach Fluid Application Case Study","authors":"P. Lin, M. Bettencourt, S. Domino, T. Fisher, M. Hoemmen, Jonathan J. Hu, E. Phipps, A. Prokopenko, S. Rajamanickam, C. Siefert, E. Cyr, S. Kennon","doi":"10.1109/IPDPSW.2014.166","DOIUrl":"https://doi.org/10.1109/IPDPSW.2014.166","url":null,"abstract":"Trilinos is an object-oriented software framework for the solution of large-scale, complex multi-physics engineering and scientific problems. While the original version of Trilinos was designed for highly scalable solutions for large problems, the need for increasingly higher fidelity simulations has pushed the problem sizes beyond what could have been envisioned two decades ago. When problem sizes exceed a billion elements even highly scalable applications and solver stacks require a complete revision. The next-generation Trilinos employs C++ templates in order to solve arbitrarily large problems and enable extreme-scale simulations. We present a case study that involves integration of Trilinos with an engineering application (Sierra low Mach module/Nalu), involving the simulation of low Mach fluid flow for problems of size up to nine billion elements. Through the use of improved algorithms and better software engineering practices, we demonstrate good weak scaling for the matrix assembly and solve for the engineering application for up to a nine billion element fluid flow large eddy simulation (LES) problem on unstructured meshes with a 27 billion row matrix on 131,072 cores of a Cray XE6 platform.","PeriodicalId":153864,"journal":{"name":"2014 IEEE International Parallel & Distributed Processing Symposium Workshops","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127628754","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 11

The Empirical Research of Virtual Enterprise Knowledge Transfer's Effectiveness Faced to the Independent Innovation Ability 面向自主创新能力的虚拟企业知识转移有效性实证研究

2014 IEEE International Parallel & Distributed Processing Symposium Workshops Pub Date : 2014-05-19 DOI: 10.1109/IPDPSW.2014.186

Yang Bo, N. Xiong, Wenzhong Guo

{"title":"The Empirical Research of Virtual Enterprise Knowledge Transfer's Effectiveness Faced to the Independent Innovation Ability","authors":"Yang Bo, N. Xiong, Wenzhong Guo","doi":"10.1109/IPDPSW.2014.186","DOIUrl":"https://doi.org/10.1109/IPDPSW.2014.186","url":null,"abstract":"Based on the theory of knowledge transfer and with the organizational characteristics, the paper makes member enterprises' knowledge transfer behaviors and the innovation of knowledge, technology and management as research objects to study the practical effectiveness of promoting enterprises' independent innovation ability by the successful use of Virtual Enterprise Knowledge Transfer. Having analyzed the Virtual Enterprise Knowledge Transfer's influence to the independent innovation ability of enterprises, the paper constructs a concept mode about Virtual Enterprise Knowledge Transfer 's effectiveness to promote the ability. Meanwhile, it exemplifies the study by using structure equation model and statistical software. The result indicates that the coalition of Virtual Enterprise Knowledge Transfer has a great promotion on the knowledge and technology innovation of member enterprises. Furthermore, the paper is to offer the solution and suggestion during the process of Virtual Enterprise Knowledge Transfer to improve the independent innovation ability of member enterprises.","PeriodicalId":153864,"journal":{"name":"2014 IEEE International Parallel & Distributed Processing Symposium Workshops","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127808679","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Characterizing the Impact of Program Optimizations on Power and Energy for Explicit Hydrodynamics 表征程序优化对显式流体力学的功率和能量的影响

2014 IEEE International Parallel & Distributed Processing Symposium Workshops Pub Date : 2014-05-19 DOI: 10.1109/IPDPSW.2014.89

E. León, I. Karlin

{"title":"Characterizing the Impact of Program Optimizations on Power and Energy for Explicit Hydrodynamics","authors":"E. León, I. Karlin","doi":"10.1109/IPDPSW.2014.89","DOIUrl":"https://doi.org/10.1109/IPDPSW.2014.89","url":null,"abstract":"With the end of Denard scaling, future systems will be constrained by power and energy. This will impact application developers by forcing them to restructure and optimize their algorithms in terms of these resources. In this paper, we analyze the impact of different code optimizations on power, energy, and execution time. Our optimizations include loop fusion, data structure transformations, global allocation, and compiler selection. We analyze the static and dynamic components of power and energy as applied to the processor chip and memory domains within a system. In addition, our analysis correlates energy and power changes with performance events and shows that data motion is highly correlated with memory power and energy usage and executed instructions are partially correlated with processor power and energy. Our results demonstrate key tradeoffs among power, energy, and execution time for explicit hydrodynamics via a representative kernel. In particular, we observe that loop fusion and compiler selection improve all objectives, while global allocation and data layout transformations present tradeoffs that are objective-dependent.","PeriodicalId":153864,"journal":{"name":"2014 IEEE International Parallel & Distributed Processing Symposium Workshops","volume":"218 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134394787","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 11

Application Level Fault Recovery: Using Fault-Tolerant Open MPI in a PDE Solver 应用程序级故障恢复:在PDE求解器中使用容错开放MPI

2014 IEEE International Parallel & Distributed Processing Symposium Workshops Pub Date : 2014-05-19 DOI: 10.1109/IPDPSW.2014.132

Md. Mohsin Ali, James A. Southern, P. Strazdins, B. Harding

{"title":"Application Level Fault Recovery: Using Fault-Tolerant Open MPI in a PDE Solver","authors":"Md. Mohsin Ali, James A. Southern, P. Strazdins, B. Harding","doi":"10.1109/IPDPSW.2014.132","DOIUrl":"https://doi.org/10.1109/IPDPSW.2014.132","url":null,"abstract":"A fault-tolerant version of Open Message Passing Interface (Open MPI), based on the draft User Level Failure Mitigation (ULFM) proposal of the MPI Forum's Fault Tolerance Working Group, is used to create fault-tolerant applications. This allows applications and libraries to design their own recovery methods and control them at the user level. However, only a limited amount of research work on user level failure recovery (including the implementation and performance evaluation of this prototype) has been carried out. This paper contributes a fault-tolerant implementation of an application solving 2D partial differential equations (PDEs) by means of a sparse grid combination technique which is capable of surviving multiple process failures caused by the faults. Our fault recovery involves reconstructing the faulty communicators without shrinking the global size by re-spawning failed MPI processes on the same physical processors where they were before the failure (for load balancing). It also involves restoring lost data from either exact check pointed data on disk, approximated data in memory (via an alternate sparse grid combination technique) or a near-exact copy of replicated data in memory. The experimental results show that the faulty communicator reconstruction time is currently large in the draft ULFM, especially for multiple process failures. They also show that the alternate combination technique has the lowest data recovery overhead, except on a system with very low disk write latency for which checkpointing has the lowest overhead. Furthermore, the errors due to the recovery of approximated data are within a factor of 10 in all cases, with the surprising result that the alternate combination technique being more accurate than the near-exact replication method. The contributed implementation details, including the analysis of the experimental results, of this paper will help application developers to resolve different issues of design and implementation of fault-tolerant applications by means of the Open MPI ULFM standard.","PeriodicalId":153864,"journal":{"name":"2014 IEEE International Parallel & Distributed Processing Symposium Workshops","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130375847","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 30

Fast Generation of Large Task Network Mappings 大型任务网络映射的快速生成

2014 IEEE International Parallel & Distributed Processing Symposium Workshops Pub Date : 2014-05-19 DOI: 10.1109/IPDPSW.2014.170

Karl-Eduard Berger, François Galea, B. L. Cun, Renaud Sirdey

引用次数: 1

An ILP-Based Optimal Circuit Mapping Method for PLDs 基于ilp的pld最优电路映射方法

2014 IEEE International Parallel & Distributed Processing Symposium Workshops Pub Date : 2014-05-19 DOI: 10.1109/IPDPSW.2014.33

Hiroki Nishiyama, Masato Inagi, S. Wakabayashi, Shinobu Nagayama, Keisuke Inoue, M. Kaneko

引用次数: 0