Shirley Moore, D. Cronk, S. Shende, A. Malony, sameer
{"title":"Loop-Level Profiling and Analysis of DoD Applications Using TAU","authors":"Shirley Moore, D. Cronk, S. Shende, A. Malony, sameer","doi":"10.1109/HPCMP-UGC.2006.43","DOIUrl":null,"url":null,"abstract":"Performance of computationally intensive applications often depends critically on the floating point and memory performance of nested loop structures, this paper describes extensions to the Tuning Analysis and Utilities (TAU) parallel performance system that implement automated of parallel C/C++ and Fortran programs to collect loop-level profile data. Link-time and run-time options for configuring the instrumented version of the code to perform various types of measurements, such as time and hardware counter based profiling are described. Finally, examples are given of collecting and analyzing loop-level profile data for several DoD applications","PeriodicalId":173959,"journal":{"name":"2006 HPCMP Users Group Conference (HPCMP-UGC'06)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2006-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2006 HPCMP Users Group Conference (HPCMP-UGC'06)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HPCMP-UGC.2006.43","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Performance of computationally intensive applications often depends critically on the floating point and memory performance of nested loop structures, this paper describes extensions to the Tuning Analysis and Utilities (TAU) parallel performance system that implement automated of parallel C/C++ and Fortran programs to collect loop-level profile data. Link-time and run-time options for configuring the instrumented version of the code to perform various types of measurements, such as time and hardware counter based profiling are described. Finally, examples are given of collecting and analyzing loop-level profile data for several DoD applications