{"title":"Non-parametric predictive inference of adaptive software rejuvenation schedule","authors":"K. Rinsaka, T. Dohi","doi":"10.1109/ISSREW.2008.5355521","DOIUrl":"https://doi.org/10.1109/ISSREW.2008.5355521","url":null,"abstract":"In this paper we develop an adaptive approach to estimate the optimal preventive rejuvenation schedule which maximizes the steady-state system availability. We formulate the upper and lower bounds of the predictive system availability using the one-look ahead predictive survival function from system failure time data, and derive the pessimistic and optimistic rejuvenation policies. Then, we derive adaptive rejuvenation policies from the original data together with a right-censored observation.","PeriodicalId":436527,"journal":{"name":"2008 IEEE International Conference on Software Reliability Engineering Workshops (ISSRE Wksp)","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124342669","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A feedback control approach for software rejuvenation in a web server","authors":"Yun-Fei Jia, Jing-Ya Su, K. Cai","doi":"10.1109/ISSREW.2008.5355514","DOIUrl":"https://doi.org/10.1109/ISSREW.2008.5355514","url":null,"abstract":"Software aging refers to the phenomenon that applications will show growing failure rate or performance degradation after longtime execution. This phenomenon usually results from the accumulation of error conditions, such as memory leaks, round-off errors, file tables not released, etc. Unexpected downtime cost due to software aging is huge, particularly for e-commerce Websites and safety/business-critical applications. Software rejuvenation is an effective but expensive fault tolerance technique to deal with software aging phenomenon. This technique usually involves the termination of services due to the restart of computer systems. In this paper, a ¿lightweight¿ complementary technique of software rejuvenation is proposed to reduce the severity of software aging in terms of its impact on system resource usage by online tuning the settings of related parameters. Unlike traditional software restart involved in traditional rejuvenation, this technique is based on the idea of fine adjustment of software state. Experiments are conducted to reveal the relationship between certain parameters of Web server and resource usage. Accordingly, a PI (proportional-integral) controller is designed to online tune the parameters of web server with the purpose of counteracting the effect of software aging. Simulation results show that the technique is cost-effective.","PeriodicalId":436527,"journal":{"name":"2008 IEEE International Conference on Software Reliability Engineering Workshops (ISSRE Wksp)","volume":"78 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134486037","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A queuing model for service availability of systems with rejuvenation","authors":"Felix Salfner, K. Wolter","doi":"10.1109/ISSREW.2008.5355520","DOIUrl":"https://doi.org/10.1109/ISSREW.2008.5355520","url":null,"abstract":"In this paper we present a queuing model to investigate the effect of time-based system rejuvenation on service availability. The model is formulated as a stochastic colored Petri net, which allows us to use realistic distributions such as the lognormal distribution. We define a metric for service availability and derive how it can be estimated from the model. Experiments show that the optimal rejuvenation interval as well as the achievable service availability improvement depend significantly on system utilization.","PeriodicalId":436527,"journal":{"name":"2008 IEEE International Conference on Software Reliability Engineering Workshops (ISSRE Wksp)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124518193","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Analysis and application of conditional software rejuvenation — A new approach","authors":"Hitesh Shetty, M. Nambiar, H. Kalita","doi":"10.1109/ISSREW.2008.5355522","DOIUrl":"https://doi.org/10.1109/ISSREW.2008.5355522","url":null,"abstract":"Software aging effects if left uncontrolled are known to cause availability issues. When an application is serving live in production, fixing such issues is close to impossible. Software rejuvenation techniques can be used effectively to avoid failure, but will incur some downtime if a single instance of the application is running. With two instances the system can be engineered in such a way that there is at least one instance running at any time. There may be further reduction in operating expenses if an intelligent program can take charge of executing rejuvenation tasks as required. This paper describes the implementation of such closed loop software rejuvenation in a real world application and the resulting improvement in availability thereof.","PeriodicalId":436527,"journal":{"name":"2008 IEEE International Conference on Software Reliability Engineering Workshops (ISSRE Wksp)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115431861","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Michael Grottke, Rivalino Matias, Kishor S. Trivedi
{"title":"The fundamentals of software aging","authors":"Michael Grottke, Rivalino Matias, Kishor S. Trivedi","doi":"10.1109/ISSREW.2008.5355512","DOIUrl":"https://doi.org/10.1109/ISSREW.2008.5355512","url":null,"abstract":"Since the notion of software aging was introduced thirteen years ago, the interest in this phenomenon has been increasing from both academia and industry. The majority of the research efforts in studying software aging have focused on understanding its effects theoretically and empirically. However, conceptual aspects related to the foundation of this phenomenon have not been covered in the literature. This paper discusses foundational aspects of the software aging phenomenon, introducing new concepts and interconnecting them with the current body of knowledge, in order to compose a base taxonomy for the software aging research. Three real case studies are presented with the purpose of exemplifying many of the concepts discussed.","PeriodicalId":436527,"journal":{"name":"2008 IEEE International Conference on Software Reliability Engineering Workshops (ISSRE Wksp)","volume":"101 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128125261","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
G. Carrozza, Domenico Cotroneo, R. Natella, A. Pecchia, S. Russo
{"title":"An experiment in memory leak analysis with a mission-critical middleware for air traffic control","authors":"G. Carrozza, Domenico Cotroneo, R. Natella, A. Pecchia, S. Russo","doi":"10.1109/ISSREW.2008.5355513","DOIUrl":"https://doi.org/10.1109/ISSREW.2008.5355513","url":null,"abstract":"This paper reports a practical experience with memory analysis on a real world complex middleware platform, being developed in the context of an academic-industrial collaboration. The reported experience suggests a practical method that can help practitioners to analyze memory leaks and to adopt proper actions to mitigate these bugs, especially in the context of complex off-the-shelf based software systems, and in some cases it highlights issues still open. Indeed, we experience that fixing a memory leak, when possible, might be not enough to solve the memory exhaustion problem.","PeriodicalId":436527,"journal":{"name":"2008 IEEE International Conference on Software Reliability Engineering Workshops (ISSRE Wksp)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131258086","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Availability optimization in operational software system with aperiodic time-based software rejuvenation scheme","authors":"H. Okamura, T. Dohi","doi":"10.1109/ISSREW.2008.5355516","DOIUrl":"https://doi.org/10.1109/ISSREW.2008.5355516","url":null,"abstract":"This paper discusses an aperiodic time-based rejuvenation policy maximizing the steady-state system availability in operational software system. Under a fixed periodic checkpoint schedule, we develop an algorithm to derive the optimal aperiodic rejuvenation times based on dynamic programming. In numerical examples, the optimal rejuvenation time and its maximum availability are illustrated in the case where the system failure obeys the Weibull distribution.","PeriodicalId":436527,"journal":{"name":"2008 IEEE International Conference on Software Reliability Engineering Workshops (ISSRE Wksp)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125422524","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Availability and reliability estimation for a system undergoing minimal, perfect and failed rejuvenation","authors":"V. Koutras, A. Platis, N. Limnios","doi":"10.1109/ISSREW.2008.5355519","DOIUrl":"https://doi.org/10.1109/ISSREW.2008.5355519","url":null,"abstract":"In this paper, a software rejuvenation model is presented in which two different rejuvenation actions are considered, perfect and minimal. The concept of a failed rejuvenation action which leads the system to failure is also introduced. The presented model is studied under a Continuous Time Markov Chain (CTMC) framework and a maximum likelihood estimator of the generator matrix is presented. Based on this, estimators for instantaneous availability and reliability function are also presented. Moreover, the behavior of the above estimators is studied under various rejuvenation policies. A numerical example based on simulation results is finally presented.","PeriodicalId":436527,"journal":{"name":"2008 IEEE International Conference on Software Reliability Engineering Workshops (ISSRE Wksp)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124295842","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
F. Machida, Dong Seong Kim, Jong Sou Park, Kishor S. Trivedi
{"title":"Toward optimal virtual machine placement and rejuvenation scheduling in a virtualized data center","authors":"F. Machida, Dong Seong Kim, Jong Sou Park, Kishor S. Trivedi","doi":"10.1109/ISSREW.2008.5355515","DOIUrl":"https://doi.org/10.1109/ISSREW.2008.5355515","url":null,"abstract":"Virtualization enables data centers to consolidate servers to improve resource utilization and power consumption. This paper presents the issues of performability management in a virtualized data center that hosts multiple services using virtualization. One of main cause of performability degradation is software aging in both virtual machines (VMs) and virtual machine monitor (VMM) in virtualized data center (VDC). Software rejuvenation is a cost effective and a proactive method to counteract software aging. During software rejuvenation operations, there occurs down time, therefore, VM placement and the rejuvenation schedules for VMs and VMM need to be optimized so as to maximize the performability in a VDC. We introduce the state of the art technology on software aging and rejuvenation in virtualized data center and we formulate this problem and show our approach.","PeriodicalId":436527,"journal":{"name":"2008 IEEE International Conference on Software Reliability Engineering Workshops (ISSRE Wksp)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133788324","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
A. Bobbio, S. Garg, M. Gribaudo, A. Horváth, M. Sereno, M. Telek
{"title":"Compositional fluid stochastic Petri net model for operational software system performance","authors":"A. Bobbio, S. Garg, M. Gribaudo, A. Horváth, M. Sereno, M. Telek","doi":"10.1109/ISSREW.2008.5355517","DOIUrl":"https://doi.org/10.1109/ISSREW.2008.5355517","url":null,"abstract":"Software systems experience gradual performance degradation due to several reasons and different preventive and corrective techniques can be applied to restore their performance level. This paper presents a unified model to describe the behavior of long running software systems with performance degrading factors such as system aging and various recovery techniques such as rejuvenation, checkpointing, rollback recovery, restart and replication. The proposed unified model is described as a fluid stochastic Petri net (FSPN). The FSPN formalism offers a descriptive language that allows a compact and precise description of the model behavior. Various analysis methods can be applied to obtain numerical results for the performance indices of interest. We illustrate the use of the model by means of a simple numerical example which captures rejuvenation, restart and replication.","PeriodicalId":436527,"journal":{"name":"2008 IEEE International Conference on Software Reliability Engineering Workshops (ISSRE Wksp)","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116663993","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}