{"title":"虚拟化软件备件云系统的实时年轻化调度","authors":"Joshua R. Carberry, John Rahme, Haiping Xu","doi":"10.1016/j.jss.2024.112168","DOIUrl":null,"url":null,"abstract":"<div><p>With the increasing popularity of cloud services, there is a growing demand for high reliability and availability of cloud computing. As viable solutions, virtualized software spares and rejuvenation scheduling have been used to maintain highly reliable software platforms and combat Mandelbugs in cloud systems. However, developing real-time rejuvenation schedules for software components with dynamic reliability models has been a challenging task. In this paper, we propose a hybrid approach that integrates preventive and automatic failover strategies to mitigate the harmful effects of Mandelbugs. The approach allows selecting reliability models based on the state of virtualized software components, performing reliability calculations for Software SPare (SSP) gates with up to two virtual hot spares, and scheduling software rejuvenation in real time for cloud systems. Furthermore, the use of Dynamic Fault Tree (DFT) analysis supports the compositional modeling of complex and interconnected systems, alleviating the problem of state-space explosion. Finally, we present a case study of a cloud system with virtualized software spares to demonstrate how rejuvenation schedules can be generated and updated in real time.</p></div>","PeriodicalId":51099,"journal":{"name":"Journal of Systems and Software","volume":"217 ","pages":"Article 112168"},"PeriodicalIF":3.7000,"publicationDate":"2024-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Real-Time rejuvenation scheduling for cloud systems with virtualized software spares\",\"authors\":\"Joshua R. Carberry, John Rahme, Haiping Xu\",\"doi\":\"10.1016/j.jss.2024.112168\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>With the increasing popularity of cloud services, there is a growing demand for high reliability and availability of cloud computing. As viable solutions, virtualized software spares and rejuvenation scheduling have been used to maintain highly reliable software platforms and combat Mandelbugs in cloud systems. However, developing real-time rejuvenation schedules for software components with dynamic reliability models has been a challenging task. In this paper, we propose a hybrid approach that integrates preventive and automatic failover strategies to mitigate the harmful effects of Mandelbugs. The approach allows selecting reliability models based on the state of virtualized software components, performing reliability calculations for Software SPare (SSP) gates with up to two virtual hot spares, and scheduling software rejuvenation in real time for cloud systems. Furthermore, the use of Dynamic Fault Tree (DFT) analysis supports the compositional modeling of complex and interconnected systems, alleviating the problem of state-space explosion. Finally, we present a case study of a cloud system with virtualized software spares to demonstrate how rejuvenation schedules can be generated and updated in real time.</p></div>\",\"PeriodicalId\":51099,\"journal\":{\"name\":\"Journal of Systems and Software\",\"volume\":\"217 \",\"pages\":\"Article 112168\"},\"PeriodicalIF\":3.7000,\"publicationDate\":\"2024-07-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Systems and Software\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0164121224002139\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, SOFTWARE ENGINEERING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Systems and Software","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0164121224002139","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}
Real-Time rejuvenation scheduling for cloud systems with virtualized software spares
With the increasing popularity of cloud services, there is a growing demand for high reliability and availability of cloud computing. As viable solutions, virtualized software spares and rejuvenation scheduling have been used to maintain highly reliable software platforms and combat Mandelbugs in cloud systems. However, developing real-time rejuvenation schedules for software components with dynamic reliability models has been a challenging task. In this paper, we propose a hybrid approach that integrates preventive and automatic failover strategies to mitigate the harmful effects of Mandelbugs. The approach allows selecting reliability models based on the state of virtualized software components, performing reliability calculations for Software SPare (SSP) gates with up to two virtual hot spares, and scheduling software rejuvenation in real time for cloud systems. Furthermore, the use of Dynamic Fault Tree (DFT) analysis supports the compositional modeling of complex and interconnected systems, alleviating the problem of state-space explosion. Finally, we present a case study of a cloud system with virtualized software spares to demonstrate how rejuvenation schedules can be generated and updated in real time.
期刊介绍:
The Journal of Systems and Software publishes papers covering all aspects of software engineering and related hardware-software-systems issues. All articles should include a validation of the idea presented, e.g. through case studies, experiments, or systematic comparisons with other approaches already in practice. Topics of interest include, but are not limited to:
•Methods and tools for, and empirical studies on, software requirements, design, architecture, verification and validation, maintenance and evolution
•Agile, model-driven, service-oriented, open source and global software development
•Approaches for mobile, multiprocessing, real-time, distributed, cloud-based, dependable and virtualized systems
•Human factors and management concerns of software development
•Data management and big data issues of software systems
•Metrics and evaluation, data mining of software development resources
•Business and economic aspects of software development processes
The journal welcomes state-of-the-art surveys and reports of practical experience for all of these topics.