{"title":"Cost Optimization of Cloud-Based Data Integration System","authors":"P. Zhang, Yanbo Han, Zhuofeng Zhao, Guiling Wang","doi":"10.1109/WISA.2012.13","DOIUrl":null,"url":null,"abstract":"Cloud computing provides virtualized, dynamically-scalable computing power. At the same time, reduction of cost is also considered as an important advantage of cloud computing. Data integration can notably benefit from cloud computing because integrating data is usually an expensive task. However, existing optimization techniques pay less attention on the fact that different execution plans of the same data integration application generate different usage costs while cloud computing provides good enough performance, so this paper introduces the cost optimization of cloud-based data integration system. The data integration system's data service layer facilitates accessing and composing information from a range of enterprise data sources through data service composition. In addition, two task scheduling algorithms for parallel part and non-parallel part are proposed to minimize the usage cost required to complete the execution of composite data service when computational capability provided by cloud computing is charged. Both of the two can obtain optimal plans in polynomial time. Experiments with the system indicate that our algorithms can lead to significant cost saving over more straightforward techniques.","PeriodicalId":313228,"journal":{"name":"2012 Ninth Web Information Systems and Applications Conference","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 Ninth Web Information Systems and Applications Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WISA.2012.13","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8
Abstract
Cloud computing provides virtualized, dynamically-scalable computing power. At the same time, reduction of cost is also considered as an important advantage of cloud computing. Data integration can notably benefit from cloud computing because integrating data is usually an expensive task. However, existing optimization techniques pay less attention on the fact that different execution plans of the same data integration application generate different usage costs while cloud computing provides good enough performance, so this paper introduces the cost optimization of cloud-based data integration system. The data integration system's data service layer facilitates accessing and composing information from a range of enterprise data sources through data service composition. In addition, two task scheduling algorithms for parallel part and non-parallel part are proposed to minimize the usage cost required to complete the execution of composite data service when computational capability provided by cloud computing is charged. Both of the two can obtain optimal plans in polynomial time. Experiments with the system indicate that our algorithms can lead to significant cost saving over more straightforward techniques.