Konstantin Levit-Gurevich, Alex Skaletsky, Michael Berezalsky, Yulia Kuznetcova, Hila Yakov
{"title":"Profiling Intel Graphics Architecture with Long Instruction Traces","authors":"Konstantin Levit-Gurevich, Alex Skaletsky, Michael Berezalsky, Yulia Kuznetcova, Hila Yakov","doi":"10.1109/ISPASS55109.2022.00001","DOIUrl":null,"url":null,"abstract":"In the process of developing software and hardware, profiling workloads is critical. Binary Instrumentation Technology plays a key role in this task for both x86 architecture and Intel Graphics Processing Units. The GTPin framework is the first tool that allows the profiling of graphics and compute kernels running on Intel GPUs. However, GTPin capabilities are less flexible than x86 profiling tools. In this paper, we introduce the concept of “gLIT” – Long Instruction Trace for Intel GPUs. Generated on real hardware, gLIT can be replayed on a simulator or an emulator running on the CPU device, and thus, can be easily profiled and analyzed “on the fly” with analysis tools of any complexity. Since the graphics devices are extremely parallel, the gLIT trace is, by definition, a multi-threaded trace, reflecting a kernel concurrently running hundreds of hardware threads. The ability to thoroughly profile and analyze workloads is critical for improving hardware and software readiness and creates new possibilities for academic research on Intel graphics devices.","PeriodicalId":115391,"journal":{"name":"2022 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS)","volume":"434 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISPASS55109.2022.00001","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In the process of developing software and hardware, profiling workloads is critical. Binary Instrumentation Technology plays a key role in this task for both x86 architecture and Intel Graphics Processing Units. The GTPin framework is the first tool that allows the profiling of graphics and compute kernels running on Intel GPUs. However, GTPin capabilities are less flexible than x86 profiling tools. In this paper, we introduce the concept of “gLIT” – Long Instruction Trace for Intel GPUs. Generated on real hardware, gLIT can be replayed on a simulator or an emulator running on the CPU device, and thus, can be easily profiled and analyzed “on the fly” with analysis tools of any complexity. Since the graphics devices are extremely parallel, the gLIT trace is, by definition, a multi-threaded trace, reflecting a kernel concurrently running hundreds of hardware threads. The ability to thoroughly profile and analyze workloads is critical for improving hardware and software readiness and creates new possibilities for academic research on Intel graphics devices.