Samuel Williams, David A. Patterson, L. Oliker, J. Shalf, Kathy Yelick
{"title":"The roofline model: A pedagogical tool for program analysis and optimization","authors":"Samuel Williams, David A. Patterson, L. Oliker, J. Shalf, Kathy Yelick","doi":"10.1109/HOTCHIPS.2008.7476531","DOIUrl":null,"url":null,"abstract":"This article consists of a collection of slides from the authors' conference presentation. The Roofline model is a visually intuitive figure for kernel analysis and optimization. The authors believe undergraduates will find it useful in assessing performance and scalability limitations. It is easily extended to other architectural paradigms. It is easily extendable to other metrics: performance (sort, graphics, crypto...) bandwidth (L2, PCIe, ...). A performance counters could be used to generate a runtime-specific roofline that would greatly aide the optimization.","PeriodicalId":134939,"journal":{"name":"2008 IEEE Hot Chips 20 Symposium (HCS)","volume":"61 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"26","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 IEEE Hot Chips 20 Symposium (HCS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HOTCHIPS.2008.7476531","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 26
Abstract
This article consists of a collection of slides from the authors' conference presentation. The Roofline model is a visually intuitive figure for kernel analysis and optimization. The authors believe undergraduates will find it useful in assessing performance and scalability limitations. It is easily extended to other architectural paradigms. It is easily extendable to other metrics: performance (sort, graphics, crypto...) bandwidth (L2, PCIe, ...). A performance counters could be used to generate a runtime-specific roofline that would greatly aide the optimization.