{"title":"Do More Replicas of Object Data Improve the Performance of Cloud Data Centers?","authors":"Zeng Zeng, B. Veeravalli","doi":"10.1109/UCC.2012.11","DOIUrl":null,"url":null,"abstract":"Nowadays, more and more researchers have focused on the performance of cloud data centers. Successful development of cloud data center paradigm necessitates the best QoS for the end users and the Mean Response Time (MRT) of the data requests is one of the most important performance indicators that shall be emphasized on. A cloud data center consists clusters of Rawdata Servers (RDS) that can provide raw data retrieval service. For a single data stored in the data center, there may be several RDS with the target raw data replicas. Hence, when a data request arriving, it has many potential data request paths and the system shall determine the best one for it. In this paper, we aim at answering an interesting question: “Do More Replicas of Object Data Improve the Performance of Cloud Data Centers?”, in order to achieve the minimum MRT of all the requests. The target optimal constrained function has been formulated and two novel load balancing algorithms based on virtual routing method has been proposed, which can achieve near-optimal solutions by theoretical proof. We also found distributing the requests for the same objects among several RDS for load balancing purpose, which is widely used in most data centers, would worsen the system performance. We validate our findings via rigorous simulations with respect to several influencing factors and prove that our proposed strategy is scalable, flexible and efficient for the real-life applications.","PeriodicalId":122639,"journal":{"name":"2012 IEEE Fifth International Conference on Utility and Cloud Computing","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 IEEE Fifth International Conference on Utility and Cloud Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/UCC.2012.11","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 10
Abstract
Nowadays, more and more researchers have focused on the performance of cloud data centers. Successful development of cloud data center paradigm necessitates the best QoS for the end users and the Mean Response Time (MRT) of the data requests is one of the most important performance indicators that shall be emphasized on. A cloud data center consists clusters of Rawdata Servers (RDS) that can provide raw data retrieval service. For a single data stored in the data center, there may be several RDS with the target raw data replicas. Hence, when a data request arriving, it has many potential data request paths and the system shall determine the best one for it. In this paper, we aim at answering an interesting question: “Do More Replicas of Object Data Improve the Performance of Cloud Data Centers?”, in order to achieve the minimum MRT of all the requests. The target optimal constrained function has been formulated and two novel load balancing algorithms based on virtual routing method has been proposed, which can achieve near-optimal solutions by theoretical proof. We also found distributing the requests for the same objects among several RDS for load balancing purpose, which is widely used in most data centers, would worsen the system performance. We validate our findings via rigorous simulations with respect to several influencing factors and prove that our proposed strategy is scalable, flexible and efficient for the real-life applications.
目前,越来越多的研究人员开始关注云数据中心的性能。云数据中心范式的成功开发需要为最终用户提供最佳的QoS,数据请求的平均响应时间(MRT)是最重要的性能指标之一,需要重点关注。云数据中心由可以提供原始数据检索服务的Rawdata server (RDS)集群组成。对于存储在数据中心中的单个数据,可能存在多个具有目标原始数据副本的RDS。因此,当一个数据请求到达时,它有许多潜在的数据请求路径,系统将为其确定最佳路径。在本文中,我们的目标是回答一个有趣的问题:“更多的对象数据副本是否会提高云数据中心的性能?”,以达到所有要求的最小MRT。提出了目标最优约束函数,并提出了两种基于虚拟路由方法的负载均衡算法,通过理论证明,这两种算法都能达到近最优解。我们还发现,在多个RDS中分发对相同对象的请求以实现负载平衡(这种方法在大多数数据中心中广泛使用)会降低系统性能。我们通过对几个影响因素的严格模拟来验证我们的发现,并证明我们提出的策略对于现实应用是可扩展的,灵活的和高效的。