Tanzima Z. Islam, Aniruddha Marathe, Holland Schutte, Mohammad Zaeed
{"title":"通过数据驱动分析了解优化后的 GPU 硬件资源使用情况","authors":"Tanzima Z. Islam, Aniruddha Marathe, Holland Schutte, Mohammad Zaeed","doi":"arxiv-2408.10143","DOIUrl":null,"url":null,"abstract":"With heterogeneous systems, the number of GPUs per chip increases to provide\ncomputational capabilities for solving science at a nanoscopic scale. However,\nlow utilization for single GPUs defies the need to invest more money for\nexpensive ccelerators. While related work develops optimizations for improving\napplication performance, none studies how these optimizations impact hardware\nresource usage or the average GPU utilization. This paper takes a data-driven\nanalysis approach in addressing this gap by (1) characterizing how hardware\nresource usage affects device utilization, execution time, or both, (2)\npresenting a multi-objective metric to identify important application-device\ninteractions that can be optimized to improve device utilization and\napplication performance jointly, (3) studying hardware resource usage behaviors\nof several optimizations for a benchmark application, and finally (4)\nidentifying optimization opportunities for several scientific proxy\napplications based on their hardware resource usage behaviors. Furthermore, we\ndemonstrate the applicability of our methodology by applying the identified\noptimizations to a proxy application, which improves the execution time, device\nutilization and power consumption by up to 29.6%, 5.3% and 26.5% respectively.","PeriodicalId":501291,"journal":{"name":"arXiv - CS - Performance","volume":"53 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Data-Driven Analysis to Understand GPU Hardware Resource Usage of Optimizations\",\"authors\":\"Tanzima Z. Islam, Aniruddha Marathe, Holland Schutte, Mohammad Zaeed\",\"doi\":\"arxiv-2408.10143\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With heterogeneous systems, the number of GPUs per chip increases to provide\\ncomputational capabilities for solving science at a nanoscopic scale. However,\\nlow utilization for single GPUs defies the need to invest more money for\\nexpensive ccelerators. While related work develops optimizations for improving\\napplication performance, none studies how these optimizations impact hardware\\nresource usage or the average GPU utilization. This paper takes a data-driven\\nanalysis approach in addressing this gap by (1) characterizing how hardware\\nresource usage affects device utilization, execution time, or both, (2)\\npresenting a multi-objective metric to identify important application-device\\ninteractions that can be optimized to improve device utilization and\\napplication performance jointly, (3) studying hardware resource usage behaviors\\nof several optimizations for a benchmark application, and finally (4)\\nidentifying optimization opportunities for several scientific proxy\\napplications based on their hardware resource usage behaviors. Furthermore, we\\ndemonstrate the applicability of our methodology by applying the identified\\noptimizations to a proxy application, which improves the execution time, device\\nutilization and power consumption by up to 29.6%, 5.3% and 26.5% respectively.\",\"PeriodicalId\":501291,\"journal\":{\"name\":\"arXiv - CS - Performance\",\"volume\":\"53 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-08-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Performance\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2408.10143\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Performance","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.10143","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Data-Driven Analysis to Understand GPU Hardware Resource Usage of Optimizations
With heterogeneous systems, the number of GPUs per chip increases to provide
computational capabilities for solving science at a nanoscopic scale. However,
low utilization for single GPUs defies the need to invest more money for
expensive ccelerators. While related work develops optimizations for improving
application performance, none studies how these optimizations impact hardware
resource usage or the average GPU utilization. This paper takes a data-driven
analysis approach in addressing this gap by (1) characterizing how hardware
resource usage affects device utilization, execution time, or both, (2)
presenting a multi-objective metric to identify important application-device
interactions that can be optimized to improve device utilization and
application performance jointly, (3) studying hardware resource usage behaviors
of several optimizations for a benchmark application, and finally (4)
identifying optimization opportunities for several scientific proxy
applications based on their hardware resource usage behaviors. Furthermore, we
demonstrate the applicability of our methodology by applying the identified
optimizations to a proxy application, which improves the execution time, device
utilization and power consumption by up to 29.6%, 5.3% and 26.5% respectively.