Daisuke Fukutomi, Yuki Iida, Takuya Azumi, S. Kato, N. Nishio
{"title":"GPUhd: Augmenting YARN with GPU Resource Management","authors":"Daisuke Fukutomi, Yuki Iida, Takuya Azumi, S. Kato, N. Nishio","doi":"10.1145/3149457.3155313","DOIUrl":null,"url":null,"abstract":"This paper presents GPUhd, a graphics processing unit (GPU) resource management approach that combines Hadoop and a GPU to obtain scale-out and scale-up functionality. There are several researches that combine Hadoop and GPU. However, there are no researches that can schedule tasks in consideration of GPU resource on Hadoop. Moreover, these researches cannot use multiple distributed frameworks. GPUhd extends the Yet Another Resource Negotiator (YARN) management mechanism and distributed processing frameworks for the coordinated use of GPU resources in Hadoop. We extend the YARN scheduling algorithm to consider GPU resources and incorporate a resources monitoring function. GPU resources can be managed on the basis of existing development methods because GPUhd simply handles GPU resources as host memory and CPU resources. In addition, GPUhd achieves high-speed processing, e.g., the computational time required to calculate 2048 x 2048 matrix multiplication is approximately 25 times less than that required when using only a CPU with Hadoop. GPUhd achieves high scalability and excellent response times in a heterogeneous distributed environment.","PeriodicalId":314778,"journal":{"name":"Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region","volume":"93 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3149457.3155313","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
This paper presents GPUhd, a graphics processing unit (GPU) resource management approach that combines Hadoop and a GPU to obtain scale-out and scale-up functionality. There are several researches that combine Hadoop and GPU. However, there are no researches that can schedule tasks in consideration of GPU resource on Hadoop. Moreover, these researches cannot use multiple distributed frameworks. GPUhd extends the Yet Another Resource Negotiator (YARN) management mechanism and distributed processing frameworks for the coordinated use of GPU resources in Hadoop. We extend the YARN scheduling algorithm to consider GPU resources and incorporate a resources monitoring function. GPU resources can be managed on the basis of existing development methods because GPUhd simply handles GPU resources as host memory and CPU resources. In addition, GPUhd achieves high-speed processing, e.g., the computational time required to calculate 2048 x 2048 matrix multiplication is approximately 25 times less than that required when using only a CPU with Hadoop. GPUhd achieves high scalability and excellent response times in a heterogeneous distributed environment.
本文介绍了GPUhd,一种图形处理单元(GPU)资源管理方法,它结合了Hadoop和GPU来获得横向扩展和纵向扩展功能。有几个研究将Hadoop和GPU结合起来。然而,目前还没有研究可以在Hadoop上考虑GPU资源来调度任务。此外,这些研究不能使用多个分布式框架。GPUhd扩展了另一个资源协商器(YARN)管理机制和分布式处理框架,用于在Hadoop中协调使用GPU资源。我们扩展了YARN调度算法来考虑GPU资源,并加入了资源监控功能。GPU资源可以在现有开发方法的基础上进行管理,因为GPUhd简单地将GPU资源作为主机内存和CPU资源来处理。此外,GPUhd实现了高速处理,例如,计算2048 x 2048矩阵乘法所需的计算时间比仅使用Hadoop的CPU所需的计算时间减少了大约25倍。GPUhd在异构分布式环境中实现了高可伸缩性和出色的响应时间。