{"title":"Task Assignment in a Virtualized GPU Enabled Cloud","authors":"Hari Sivaraman, Uday Kurkure, Lan Vu","doi":"10.1109/HPCS.2018.00143","DOIUrl":null,"url":null,"abstract":"Cloud computing vendors are beginning to offer GPU based high performance computing as a service. One approach uses virtual machines (VM), running in a hypervisor like VMware vSphere, equipped with virtual GPUs like Nvidia's vGPU solution. In this approach, multiple VMs running concurrently can share a single GPU. The number of VMs that share the GPU can be configured by the user/system administrator. Further, VMs can be re-assigned to GPUs, if more than one is available, dynamically. This approach allows tasks/jobs that use GPUs to run in individual VMs guaranteeing isolation whilst sharing resources. In a typical cloud environment with multiple servers each with one or more GPUs, finding an efficient, fast solution to the problem of placing VMs (i.e. VM-placement) on GPUs and moving them around as needed is extremely important to achieve high throughput of tasks while maximizing server utilization and minimizing task wait times. In this paper, we present the simulator we built to compare different solutions to the problem of VM-placement together with some early results.","PeriodicalId":308138,"journal":{"name":"2018 International Conference on High Performance Computing & Simulation (HPCS)","volume":"98 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 International Conference on High Performance Computing & Simulation (HPCS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HPCS.2018.00143","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
Cloud computing vendors are beginning to offer GPU based high performance computing as a service. One approach uses virtual machines (VM), running in a hypervisor like VMware vSphere, equipped with virtual GPUs like Nvidia's vGPU solution. In this approach, multiple VMs running concurrently can share a single GPU. The number of VMs that share the GPU can be configured by the user/system administrator. Further, VMs can be re-assigned to GPUs, if more than one is available, dynamically. This approach allows tasks/jobs that use GPUs to run in individual VMs guaranteeing isolation whilst sharing resources. In a typical cloud environment with multiple servers each with one or more GPUs, finding an efficient, fast solution to the problem of placing VMs (i.e. VM-placement) on GPUs and moving them around as needed is extremely important to achieve high throughput of tasks while maximizing server utilization and minimizing task wait times. In this paper, we present the simulator we built to compare different solutions to the problem of VM-placement together with some early results.