Pre-exascale accelerated application development: The ORNL Summit experience

IF 1.3 4区 计算机科学 Q1 Computer Science
L. Luo;T. P. Straatsma;L. E. Aguilar Suarez;R. Broer;D. Bykov;E. F. D'Azevedo;S. S. Faraji;K. C. Gottiparthi;C. De Graaf;J. A. Harris;R. W. A. Havenith;H. J. Aa. Jensen;W. Joubert;R. K. Kathir;J. Larkin;Y. W. Li;D. I. Lyakh;O. E. B. Messer;M. R. Norman;J. C. Oefelein;R. Sankaran;A. F. Tillack;A. L. Barnes;L. Visscher;J. C. Wells;M. Wibowo
{"title":"Pre-exascale accelerated application development: The ORNL Summit experience","authors":"L. Luo;T. P. Straatsma;L. E. Aguilar Suarez;R. Broer;D. Bykov;E. F. D'Azevedo;S. S. Faraji;K. C. Gottiparthi;C. De Graaf;J. A. Harris;R. W. A. Havenith;H. J. Aa. Jensen;W. Joubert;R. K. Kathir;J. Larkin;Y. W. Li;D. I. Lyakh;O. E. B. Messer;M. R. Norman;J. C. Oefelein;R. Sankaran;A. F. Tillack;A. L. Barnes;L. Visscher;J. C. Wells;M. Wibowo","doi":"10.1147/JRD.2020.2965881","DOIUrl":null,"url":null,"abstract":"High-performance computing (HPC) increasingly relies on heterogeneous architectures to achieve higher performance. In the Oak Ridge Leadership Facility (OLCF), Oak Ridge, TN, USA, this trend continues as its latest supercomputer, Summit, entered production in early 2019. The combination of IBM POWER9 CPU and NVIDIA V100 GPU, along with a fast NVLink2 interconnect and other latest technologies, pushes system performance to a new height and breaks the exascale barrier by certain measures. Due to Summit's powerful GPUs and much higher GPU–CPU ratio, offloading to accelerators becomes a requirement for any application, which intends to effectively use the system. To facilitate navigating a complex landscape of competing heterogeneous architectures, a collection of applications from a wide spectrum of scientific domains is selected for early adoption on Summit. In this article, the experience and lessons learned are summarized, in the hope of providing useful guidance to address new programming challenges, such as scalability, performance portability, and software maintainability, for future application development efforts on heterogeneous HPC systems.","PeriodicalId":55034,"journal":{"name":"IBM Journal of Research and Development","volume":null,"pages":null},"PeriodicalIF":1.3000,"publicationDate":"2020-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1147/JRD.2020.2965881","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IBM Journal of Research and Development","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/8960361/","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Computer Science","Score":null,"Total":0}
引用次数: 10

Abstract

High-performance computing (HPC) increasingly relies on heterogeneous architectures to achieve higher performance. In the Oak Ridge Leadership Facility (OLCF), Oak Ridge, TN, USA, this trend continues as its latest supercomputer, Summit, entered production in early 2019. The combination of IBM POWER9 CPU and NVIDIA V100 GPU, along with a fast NVLink2 interconnect and other latest technologies, pushes system performance to a new height and breaks the exascale barrier by certain measures. Due to Summit's powerful GPUs and much higher GPU–CPU ratio, offloading to accelerators becomes a requirement for any application, which intends to effectively use the system. To facilitate navigating a complex landscape of competing heterogeneous architectures, a collection of applications from a wide spectrum of scientific domains is selected for early adoption on Summit. In this article, the experience and lessons learned are summarized, in the hope of providing useful guidance to address new programming challenges, such as scalability, performance portability, and software maintainability, for future application development efforts on heterogeneous HPC systems.
Pre-exascale加速应用程序开发:ORNL峰会经验
高性能计算(HPC)越来越依赖于异构体系结构来实现更高的性能。在美国田纳西州橡树岭的橡树岭领导设施(OLCF),随着其最新的超级计算机Summit于2019年初投入生产,这一趋势仍在继续。IBM POWER9 CPU和NVIDIA V100 GPU的结合,加上快速NVLink2互连和其他最新技术,将系统性能推向了一个新的高度,并在某些方面打破了EB级的障碍。由于Summit强大的GPU和更高的GPU-CPU比率,任何想要有效使用系统的应用程序都需要卸载到加速器。为了便于在竞争异构架构的复杂环境中导航,我们选择了一系列来自广泛科学领域的应用程序,以便在Summit上尽早采用。在本文中,总结了经验教训,希望为解决新的编程挑战(如可扩展性、性能可移植性和软件可维护性)提供有用的指导,以供未来在异构HPC系统上进行应用程序开发。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
IBM Journal of Research and Development
IBM Journal of Research and Development 工程技术-计算机:硬件
自引率
0.00%
发文量
0
审稿时长
6-12 weeks
期刊介绍: The IBM Journal of Research and Development is a peer-reviewed technical journal, published bimonthly, which features the work of authors in the science, technology and engineering of information systems. Papers are written for the worldwide scientific research and development community and knowledgeable professionals. Submitted papers are welcome from the IBM technical community and from non-IBM authors on topics relevant to the scientific and technical content of the Journal.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信