Inferring Large-Scale Computation Behavior via Trace Extrapolation

2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum Pub Date : 2013-05-20 DOI:10.1109/IPDPSW.2013.137

L. Carrington, M. Laurenzano, Ananta Tiwari

{"title":"Inferring Large-Scale Computation Behavior via Trace Extrapolation","authors":"L. Carrington, M. Laurenzano, Ananta Tiwari","doi":"10.1109/IPDPSW.2013.137","DOIUrl":null,"url":null,"abstract":"Understanding large-scale application behavior is critical for effectively utilizing existing HPC resources and making design decisions for upcoming systems. In this work we present a methodology for characterizing an MPI application's large-scale computation behavior and system requirements using information about the behavior of that application at a series of smaller core counts. The methodology finds the best statistical fit from among a set of canonical functions in terms of how a set of features that are important for both performance and energy (cache hit rates, floating point intensity, ILP, etc.) change across a series of small core counts. The statistical models for each of these application features can then be utilized to generate an extrapolated trace of the application at scale. The fidelity of the fully extrapolated traces is evaluated by comparing the results of building performance models using both the extrapolated trace along with an actual trace in order to predict application performance at using each. For two full-scale HPC applications, SPECFEM3D and UH3D, the extrapolated traces had absolute relative errors of less than 5%.","PeriodicalId":234552,"journal":{"name":"2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum","volume":"60 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"19","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IPDPSW.2013.137","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 19

Abstract

Understanding large-scale application behavior is critical for effectively utilizing existing HPC resources and making design decisions for upcoming systems. In this work we present a methodology for characterizing an MPI application's large-scale computation behavior and system requirements using information about the behavior of that application at a series of smaller core counts. The methodology finds the best statistical fit from among a set of canonical functions in terms of how a set of features that are important for both performance and energy (cache hit rates, floating point intensity, ILP, etc.) change across a series of small core counts. The statistical models for each of these application features can then be utilized to generate an extrapolated trace of the application at scale. The fidelity of the fully extrapolated traces is evaluated by comparing the results of building performance models using both the extrapolated trace along with an actual trace in order to predict application performance at using each. For two full-scale HPC applications, SPECFEM3D and UH3D, the extrapolated traces had absolute relative errors of less than 5%.

查看原文本刊更多论文

通过跟踪外推推断大规模计算行为

理解大规模应用程序行为对于有效利用现有HPC资源和为即将到来的系统做出设计决策至关重要。在这项工作中，我们提出了一种方法来描述MPI应用程序的大规模计算行为和系统需求，该方法使用有关该应用程序在一系列较小的核心计数中的行为的信息。该方法根据对性能和能量(缓存命中率、浮点强度、ILP等)都很重要的一组特性在一系列小核心计数中的变化情况，从一组规范函数中找到最佳的统计拟合。然后可以利用每个应用程序特性的统计模型来大规模地生成应用程序的外推跟踪。通过比较使用外推轨迹和实际轨迹构建性能模型的结果来评估完全外推轨迹的保真度，以便预测使用每种轨迹时的应用程序性能。在SPECFEM3D和UH3D这两个全尺寸高性能计算应用中，外推轨迹的绝对相对误差小于5%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum

自引率

0.00%

发文量