{"title":"减轻寄存器压力在GPU内核,以提高能源效率","authors":"Apan Qasem, Samuel Teich","doi":"10.1109/IGCC.2017.8323599","DOIUrl":null,"url":null,"abstract":"GPUs have become an integral part of modern HPC systems. As the HPC landscape moves toward over-provisioned systems with heterogeneous compute nodes, the importance of GPUs is likely to grow. Therefore, strategies for energy savings on these devices are of critical importance. In this paper, we examine the energy efficiency of the register file on current GPUs. We show that ineffective utilization of the register file not only degrades performance but also increases power consumption. To address this problem, we develop a multi-level approach for mitigating register pressure in GPU kernels. Our strategy includes a source-to-source data layout transformation, a method for modulating register pressure (RP) via compiler optimizations, and an autotuner for selecting RP-sensitive kernel launch configurations. Experimental results show that our strategy is able to curb inefficiency in register file usage and can significantly boost performance and energy efficiency in a range of GPU kernels.","PeriodicalId":133239,"journal":{"name":"2017 Eighth International Green and Sustainable Computing Conference (IGSC)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Mitigating register pressure in GPU kernels for improved energy efficiency\",\"authors\":\"Apan Qasem, Samuel Teich\",\"doi\":\"10.1109/IGCC.2017.8323599\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"GPUs have become an integral part of modern HPC systems. As the HPC landscape moves toward over-provisioned systems with heterogeneous compute nodes, the importance of GPUs is likely to grow. Therefore, strategies for energy savings on these devices are of critical importance. In this paper, we examine the energy efficiency of the register file on current GPUs. We show that ineffective utilization of the register file not only degrades performance but also increases power consumption. To address this problem, we develop a multi-level approach for mitigating register pressure in GPU kernels. Our strategy includes a source-to-source data layout transformation, a method for modulating register pressure (RP) via compiler optimizations, and an autotuner for selecting RP-sensitive kernel launch configurations. Experimental results show that our strategy is able to curb inefficiency in register file usage and can significantly boost performance and energy efficiency in a range of GPU kernels.\",\"PeriodicalId\":133239,\"journal\":{\"name\":\"2017 Eighth International Green and Sustainable Computing Conference (IGSC)\",\"volume\":\"15 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 Eighth International Green and Sustainable Computing Conference (IGSC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IGCC.2017.8323599\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 Eighth International Green and Sustainable Computing Conference (IGSC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IGCC.2017.8323599","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Mitigating register pressure in GPU kernels for improved energy efficiency
GPUs have become an integral part of modern HPC systems. As the HPC landscape moves toward over-provisioned systems with heterogeneous compute nodes, the importance of GPUs is likely to grow. Therefore, strategies for energy savings on these devices are of critical importance. In this paper, we examine the energy efficiency of the register file on current GPUs. We show that ineffective utilization of the register file not only degrades performance but also increases power consumption. To address this problem, we develop a multi-level approach for mitigating register pressure in GPU kernels. Our strategy includes a source-to-source data layout transformation, a method for modulating register pressure (RP) via compiler optimizations, and an autotuner for selecting RP-sensitive kernel launch configurations. Experimental results show that our strategy is able to curb inefficiency in register file usage and can significantly boost performance and energy efficiency in a range of GPU kernels.