Service reliability modeling of the IT infrastructure of active-active cloud data center

Yue Liu, Xiaoyang Li, R. Kang, Lianghua Xiao
{"title":"Service reliability modeling of the IT infrastructure of active-active cloud data center","authors":"Yue Liu, Xiaoyang Li, R. Kang, Lianghua Xiao","doi":"10.1109/PHM.2016.7819903","DOIUrl":null,"url":null,"abstract":"With the increasing use in different areas, cloud data center has gradually showed its superiority in availability, resource utilization and disaster recovery during the service delivery, compared with traditional data center. According to SLA (Service-Level Agreement), the demand on service reliability and other related indexes are put forward. Despite efforts at fault tolerance and redundancy, the occurrence of failure in data center is still inevitable. Hence, there is a need to model and analyze the service reliability of data center. However, traditional method of reliability modeling is no longer applicable because of the complicated cloud control flows, massive-scale service sharing, and complexity real-word infrastructures. This paper proposes a new approach to model the service reliability of the IT infrastructure of active-active data center, which is a typical form of cloud data center. Firstly, we divide the process of service delivery into two stages — the request stage and execution stage. Then, two models are built for two stages, respectively. For request stage, we use the Queuing Theory and Monte Carlo Method while for execution stage, the Graph Theory and Monte Carlo Method are adopted. Based on the proposed model, the service reliability of data center can be calculated. With the data of a company active-active data center as a case study, we finally demonstrate the applicability and correctness of the model.","PeriodicalId":202597,"journal":{"name":"2016 Prognostics and System Health Management Conference (PHM-Chengdu)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 Prognostics and System Health Management Conference (PHM-Chengdu)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PHM.2016.7819903","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

Abstract

With the increasing use in different areas, cloud data center has gradually showed its superiority in availability, resource utilization and disaster recovery during the service delivery, compared with traditional data center. According to SLA (Service-Level Agreement), the demand on service reliability and other related indexes are put forward. Despite efforts at fault tolerance and redundancy, the occurrence of failure in data center is still inevitable. Hence, there is a need to model and analyze the service reliability of data center. However, traditional method of reliability modeling is no longer applicable because of the complicated cloud control flows, massive-scale service sharing, and complexity real-word infrastructures. This paper proposes a new approach to model the service reliability of the IT infrastructure of active-active data center, which is a typical form of cloud data center. Firstly, we divide the process of service delivery into two stages — the request stage and execution stage. Then, two models are built for two stages, respectively. For request stage, we use the Queuing Theory and Monte Carlo Method while for execution stage, the Graph Theory and Monte Carlo Method are adopted. Based on the proposed model, the service reliability of data center can be calculated. With the data of a company active-active data center as a case study, we finally demonstrate the applicability and correctness of the model.
双活云数据中心IT基础架构服务可靠性建模
随着云数据中心在不同领域的应用越来越多,在服务交付过程中,云数据中心在可用性、资源利用率、容灾等方面逐渐显示出其相对于传统数据中心的优势。根据服务水平协议(SLA),提出了对服务可靠性和其他相关指标的要求。尽管在容错和冗余方面做出了努力,但数据中心故障的发生仍然是不可避免的。因此,有必要对数据中心的业务可靠性进行建模和分析。然而,由于云控制流的复杂性、大规模的服务共享和现实世界基础设施的复杂性,传统的可靠性建模方法已经不再适用。本文提出了一种对云数据中心典型形式双活数据中心IT基础设施服务可靠性建模的新方法。首先,我们将服务交付过程分为两个阶段——请求阶段和执行阶段。然后,分别针对两个阶段建立了两个模型。在请求阶段,我们使用排队论和蒙特卡罗方法,在执行阶段,我们使用图论和蒙特卡罗方法。基于该模型,可以对数据中心的业务可靠性进行计算。最后以某公司双活数据中心为例,验证了该模型的适用性和正确性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信