基于随机模型和故障注入实验的灾难恢复方案可用性分析

Júlio Mendonça, R. Lima, Rúbens de Souza Matos Júnior, João Ferreira, E. Andrade
{"title":"基于随机模型和故障注入实验的灾难恢复方案可用性分析","authors":"Júlio Mendonça, R. Lima, Rúbens de Souza Matos Júnior, João Ferreira, E. Andrade","doi":"10.1109/AINA.2018.00032","DOIUrl":null,"url":null,"abstract":"The Information Technology (IT) systems of most organizations must support their operations 24 hours a day, 7 days a week. Systems unavailability may have serious consequences such as data loss, customer dissatisfaction, and subsequent revenue loss. With the popularity of cloud computing, the adoption of cloud-based disaster recovery (DR) solutions has gained more space to prevent data loss and ensure business continuity. However, disaster recovery solutions are not cheap and do not exist as a single solution that suits every requirement (e.g., availability and costs). In this paper, we present an integrated model-experiment approach to evaluate cloud-based disaster recovery solutions. We use Stochastic Petri Nets (SPNs) and fault-injection experiments to evaluate availability related metrics like steady-state availability and downtime. To demonstrate the feasibility of our approach, distinct real-world cloud-based DR solutions (e.g., active/active and active/standby) are modeled and analyzed. The results revealed that disaster recovery solution significantly improves system availability and minimizes the downtime costs. In addition, our numerical analysis shows the statistical correspondence between the results of the experiments and models.","PeriodicalId":239730,"journal":{"name":"2018 IEEE 32nd International Conference on Advanced Information Networking and Applications (AINA)","volume":"52 4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Availability Analysis of a Disaster Recovery Solution Through Stochastic Models and Fault Injection Experiments\",\"authors\":\"Júlio Mendonça, R. Lima, Rúbens de Souza Matos Júnior, João Ferreira, E. Andrade\",\"doi\":\"10.1109/AINA.2018.00032\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The Information Technology (IT) systems of most organizations must support their operations 24 hours a day, 7 days a week. Systems unavailability may have serious consequences such as data loss, customer dissatisfaction, and subsequent revenue loss. With the popularity of cloud computing, the adoption of cloud-based disaster recovery (DR) solutions has gained more space to prevent data loss and ensure business continuity. However, disaster recovery solutions are not cheap and do not exist as a single solution that suits every requirement (e.g., availability and costs). In this paper, we present an integrated model-experiment approach to evaluate cloud-based disaster recovery solutions. We use Stochastic Petri Nets (SPNs) and fault-injection experiments to evaluate availability related metrics like steady-state availability and downtime. To demonstrate the feasibility of our approach, distinct real-world cloud-based DR solutions (e.g., active/active and active/standby) are modeled and analyzed. The results revealed that disaster recovery solution significantly improves system availability and minimizes the downtime costs. In addition, our numerical analysis shows the statistical correspondence between the results of the experiments and models.\",\"PeriodicalId\":239730,\"journal\":{\"name\":\"2018 IEEE 32nd International Conference on Advanced Information Networking and Applications (AINA)\",\"volume\":\"52 4 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-05-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 IEEE 32nd International Conference on Advanced Information Networking and Applications (AINA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/AINA.2018.00032\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE 32nd International Conference on Advanced Information Networking and Applications (AINA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AINA.2018.00032","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

摘要

大多数组织的信息技术(IT)系统必须支持他们每周7天,每天24小时的操作。系统不可用可能会产生严重的后果,例如数据丢失、客户不满以及随后的收入损失。随着云计算的普及,采用基于云的容灾解决方案为防止数据丢失和确保业务连续性提供了更大的空间。但是,灾难恢复解决方案并不便宜,也不是作为满足所有需求(例如可用性和成本)的单一解决方案而存在的。在本文中,我们提出了一种综合模型-实验方法来评估基于云的灾难恢复解决方案。我们使用随机Petri网(spn)和故障注入实验来评估可用性相关指标,如稳态可用性和停机时间。为了证明我们方法的可行性,我们对现实世界中不同的基于云的灾难恢复解决方案(例如,主动/主动和主动/备用)进行了建模和分析。结果表明,灾难恢复解决方案显著提高了系统可用性,并最大限度地减少了停机成本。此外,我们的数值分析表明了实验结果与模型之间的统计一致性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Availability Analysis of a Disaster Recovery Solution Through Stochastic Models and Fault Injection Experiments
The Information Technology (IT) systems of most organizations must support their operations 24 hours a day, 7 days a week. Systems unavailability may have serious consequences such as data loss, customer dissatisfaction, and subsequent revenue loss. With the popularity of cloud computing, the adoption of cloud-based disaster recovery (DR) solutions has gained more space to prevent data loss and ensure business continuity. However, disaster recovery solutions are not cheap and do not exist as a single solution that suits every requirement (e.g., availability and costs). In this paper, we present an integrated model-experiment approach to evaluate cloud-based disaster recovery solutions. We use Stochastic Petri Nets (SPNs) and fault-injection experiments to evaluate availability related metrics like steady-state availability and downtime. To demonstrate the feasibility of our approach, distinct real-world cloud-based DR solutions (e.g., active/active and active/standby) are modeled and analyzed. The results revealed that disaster recovery solution significantly improves system availability and minimizes the downtime costs. In addition, our numerical analysis shows the statistical correspondence between the results of the experiments and models.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信