{"title":"A Deep Q-Learning Approach for GPU Task Scheduling","authors":"R. Luley, Qinru Qiu","doi":"10.1109/HPEC43674.2020.9286238","DOIUrl":null,"url":null,"abstract":"Efficient utilization of resources is critical to system performance and effectiveness for high performance computing systems. In a graphics processing unit (GPU) -based system, one method for enabling higher utilization is concurrent kernel execution - allowing multiple independent kernels to simultaneously execute on the GPU. However, resource contention due to the manner in which kernel tasks are scheduled may still lead to suboptimal task performance and utilization. In this work, we present a deep Q-learning approach to identify an ordering for a given set of tasks which achieves near-optimal average task performance and high resource utilization. Our solution outperforms other similar approaches and has additional benefit of being adaptable to dynamic task characteristics or GPU resource configurations.","PeriodicalId":168544,"journal":{"name":"2020 IEEE High Performance Extreme Computing Conference (HPEC)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE High Performance Extreme Computing Conference (HPEC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HPEC43674.2020.9286238","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Efficient utilization of resources is critical to system performance and effectiveness for high performance computing systems. In a graphics processing unit (GPU) -based system, one method for enabling higher utilization is concurrent kernel execution - allowing multiple independent kernels to simultaneously execute on the GPU. However, resource contention due to the manner in which kernel tasks are scheduled may still lead to suboptimal task performance and utilization. In this work, we present a deep Q-learning approach to identify an ordering for a given set of tasks which achieves near-optimal average task performance and high resource utilization. Our solution outperforms other similar approaches and has additional benefit of being adaptable to dynamic task characteristics or GPU resource configurations.