Philippe Marcotte, Frédéric Grégoire, Fábio Petrillo
{"title":"Multiple Fault-Tolerance Mechanisms in Cloud Systems: A Systematic Review","authors":"Philippe Marcotte, Frédéric Grégoire, Fábio Petrillo","doi":"10.1109/ISSREW.2019.00104","DOIUrl":null,"url":null,"abstract":"Cloud systems are progressively taking over today's software market. These typically require constant operations with a minimum of failure. Multiple fault-tolerance mechanisms have been developed to maximize the availability of cloud systems. In this paper, we propose a systematic review of the literature on fault-tolerance mechanisms. We discussed the current trends, and we found that migration, checkpointing and replication are the most widely used based on the number of references. In contrast, some mechanism like task resubmission and n-version programming seems to incur less research.","PeriodicalId":166239,"journal":{"name":"2019 IEEE International Symposium on Software Reliability Engineering Workshops (ISSREW)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE International Symposium on Software Reliability Engineering Workshops (ISSREW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISSREW.2019.00104","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
Cloud systems are progressively taking over today's software market. These typically require constant operations with a minimum of failure. Multiple fault-tolerance mechanisms have been developed to maximize the availability of cloud systems. In this paper, we propose a systematic review of the literature on fault-tolerance mechanisms. We discussed the current trends, and we found that migration, checkpointing and replication are the most widely used based on the number of references. In contrast, some mechanism like task resubmission and n-version programming seems to incur less research.