{"title":"IDLE: Integrated Deep Learning Engine with Adaptive Task Scheduling on Heterogeneous GPUs","authors":"Taewoo Kim, Eunju Yang, S. Bae, Chan-Hyun Youn","doi":"10.1109/TENCON.2018.8650284","DOIUrl":null,"url":null,"abstract":"As the deep learning (DL) has widely been used for application domains such as image classifications, natural language processing, and speech recognition, various software frameworks have been developed. They provide users with efficient programming interfaces for developing the DL applications. The optimization techniques within these frameworks generally are different from each other, which leads to different processing times for even the same applications. However, it is difficult that end users consider performance differences in processing time due to incompatible programming interface among the DL frameworks. These differences might cause redundant efforts and costs for end users to develop and maintain the applications. In this paper, we introduce an integrated deep learning engine (IDLE), a novel interface working on the top of the existing DL frameworks, which provides a convenient, flexible and scalable programming interface developing the DL applications for end users regardless of DL frameworks. Besides, we also propose a novel adaptive task scheduling scheme for training DL applications in a cluster with different GPUs. We implement our platform on the heterogeneous GPU cluster, and the results show that the proposed scheduling algorithm improves cost efficiency processing various DL applications.","PeriodicalId":132900,"journal":{"name":"TENCON 2018 - 2018 IEEE Region 10 Conference","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"TENCON 2018 - 2018 IEEE Region 10 Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TENCON.2018.8650284","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
As the deep learning (DL) has widely been used for application domains such as image classifications, natural language processing, and speech recognition, various software frameworks have been developed. They provide users with efficient programming interfaces for developing the DL applications. The optimization techniques within these frameworks generally are different from each other, which leads to different processing times for even the same applications. However, it is difficult that end users consider performance differences in processing time due to incompatible programming interface among the DL frameworks. These differences might cause redundant efforts and costs for end users to develop and maintain the applications. In this paper, we introduce an integrated deep learning engine (IDLE), a novel interface working on the top of the existing DL frameworks, which provides a convenient, flexible and scalable programming interface developing the DL applications for end users regardless of DL frameworks. Besides, we also propose a novel adaptive task scheduling scheme for training DL applications in a cluster with different GPUs. We implement our platform on the heterogeneous GPU cluster, and the results show that the proposed scheduling algorithm improves cost efficiency processing various DL applications.