基于微重启的分布式应用自恢复模型

2008 International Conference on Internet Computing in Science and Engineering Pub Date : 2008-01-28 DOI:10.1109/ICICSE.2008.52

Huiqiang Wang, Haizhi Ye, Liang Ying

{"title":"基于微重启的分布式应用自恢复模型","authors":"Huiqiang Wang, Haizhi Ye, Liang Ying","doi":"10.1109/ICICSE.2008.52","DOIUrl":null,"url":null,"abstract":"Automatic and fast recovery from failure is the important way of guaranteeing high availability for distributed application systems. On the base of microreboot techniques and autonomic computing ideas, key issues of realizing self-recovery for distributed application are analyzed in this paper, and then a novel model of self-recovery for distributed application based on microreboot is presented. The construction of the model are expatiated in detail from several perspectives, such as behavior monitoring, failure management and recovery policy, and the principles of realizing self- recovery for distributed application are explained. The established model aims to solve the problems of common failures in large distributed applications, and can recovery itself effectively without human interventions.","PeriodicalId":333889,"journal":{"name":"2008 International Conference on Internet Computing in Science and Engineering","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Self-Recovery Model for Distributed Applications Based on Microreboot\",\"authors\":\"Huiqiang Wang, Haizhi Ye, Liang Ying\",\"doi\":\"10.1109/ICICSE.2008.52\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Automatic and fast recovery from failure is the important way of guaranteeing high availability for distributed application systems. On the base of microreboot techniques and autonomic computing ideas, key issues of realizing self-recovery for distributed application are analyzed in this paper, and then a novel model of self-recovery for distributed application based on microreboot is presented. The construction of the model are expatiated in detail from several perspectives, such as behavior monitoring, failure management and recovery policy, and the principles of realizing self- recovery for distributed application are explained. The established model aims to solve the problems of common failures in large distributed applications, and can recovery itself effectively without human interventions.\",\"PeriodicalId\":333889,\"journal\":{\"name\":\"2008 International Conference on Internet Computing in Science and Engineering\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2008-01-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2008 International Conference on Internet Computing in Science and Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICICSE.2008.52\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 International Conference on Internet Computing in Science and Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICICSE.2008.52","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

故障自动快速恢复是保证分布式应用系统高可用性的重要途径。在微重启技术和自主计算思想的基础上，分析了实现分布式应用自恢复的关键问题，提出了一种基于微重启的分布式应用自恢复模型。从行为监控、故障管理和恢复策略等方面详细阐述了模型的构建，并阐述了分布式应用实现自恢复的原理。所建立的模型旨在解决大型分布式应用中常见的故障问题，并且可以在没有人为干预的情况下有效地自我恢复。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A Self-Recovery Model for Distributed Applications Based on Microreboot

Automatic and fast recovery from failure is the important way of guaranteeing high availability for distributed application systems. On the base of microreboot techniques and autonomic computing ideas, key issues of realizing self-recovery for distributed application are analyzed in this paper, and then a novel model of self-recovery for distributed application based on microreboot is presented. The construction of the model are expatiated in detail from several perspectives, such as behavior monitoring, failure management and recovery policy, and the principles of realizing self- recovery for distributed application are explained. The established model aims to solve the problems of common failures in large distributed applications, and can recovery itself effectively without human interventions.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2008 International Conference on Internet Computing in Science and Engineering

自引率

0.00%

发文量