{"title":"Scheduling real-time parallel applications in SaaS clouds in the presence of transient software failures","authors":"Georgios L. Stavrinides, H. Karatza","doi":"10.1109/SPECTS.2016.7570524","DOIUrl":null,"url":null,"abstract":"One of the major challenges in a Software as a Service (SaaS) cloud, is the fault-tolerant and cost-effective scheduling and execution of the end-user applications within strict time constraints, in order to provide results of acceptable quality. Towards this direction, we investigate the performance of strategies for the scheduling of fine-grained parallel applications in a SaaS cloud in the presence of transient software failures, which may occur during the execution of the applications. We combine and compare by simulation different techniques that incorporate application-directed checkpointing and approximate computations, in an attempt to: (a) provide resilience against temporary software failures, (b) guarantee that all applications will meet their deadline, (c) provide application results of as high quality as possible and (d) minimize the monetary cost charged to the end-users for the execution of their applications.","PeriodicalId":302558,"journal":{"name":"2016 International Symposium on Performance Evaluation of Computer and Telecommunication Systems (SPECTS)","volume":"61 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"18","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 International Symposium on Performance Evaluation of Computer and Telecommunication Systems (SPECTS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SPECTS.2016.7570524","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 18
Abstract
One of the major challenges in a Software as a Service (SaaS) cloud, is the fault-tolerant and cost-effective scheduling and execution of the end-user applications within strict time constraints, in order to provide results of acceptable quality. Towards this direction, we investigate the performance of strategies for the scheduling of fine-grained parallel applications in a SaaS cloud in the presence of transient software failures, which may occur during the execution of the applications. We combine and compare by simulation different techniques that incorporate application-directed checkpointing and approximate computations, in an attempt to: (a) provide resilience against temporary software failures, (b) guarantee that all applications will meet their deadline, (c) provide application results of as high quality as possible and (d) minimize the monetary cost charged to the end-users for the execution of their applications.