Failure handling in an optimized two-safe approach to maintaining primary-backup systems

Proceedings Seventeenth IEEE Symposium on Reliable Distributed Systems (Cat. No.98CB36281) Pub Date : 1998-10-20 DOI:10.1109/RELDIS.1998.740488

Kexiang Hu, S. Mehrotra, S. Kaplan

{"title":"Failure handling in an optimized two-safe approach to maintaining primary-backup systems","authors":"Kexiang Hu, S. Mehrotra, S. Kaplan","doi":"10.1109/RELDIS.1998.740488","DOIUrl":null,"url":null,"abstract":"In a primary backup database system, transaction processing takes place at the primary and the log records generated are propagated to the backup which uses them to reconstruct the database state at the primary. If the primary fails, the backup takes over to provide continued service. Most existing designs of primary backup database systems have concentrated on techniques to tolerate complete failures in which the entire primary fails, say due to a disaster. In multiprocessor environments, where the primary and the backup databases are partitioned across multiple computers, a more common case is a partial failure in which some database partitions fail but the system as a whole survives. Existing approaches either ignore partial failures, or require the failed database partition to be unavailable. We explore a design of the primary backup database system that uses the backup not only for disaster protection, but also for continued availability during partial failures. The approach is developed in the context of the improved optimized 2-safe strategy to transmitting logs from the primary to the backup, introduced by K. Hu et al. (1997), which combines the best features of the previously developed 1-safe and 2-safe strategies.","PeriodicalId":376253,"journal":{"name":"Proceedings Seventeenth IEEE Symposium on Reliable Distributed Systems (Cat. No.98CB36281)","volume":"41 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1998-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings Seventeenth IEEE Symposium on Reliable Distributed Systems (Cat. No.98CB36281)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RELDIS.1998.740488","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

Abstract

In a primary backup database system, transaction processing takes place at the primary and the log records generated are propagated to the backup which uses them to reconstruct the database state at the primary. If the primary fails, the backup takes over to provide continued service. Most existing designs of primary backup database systems have concentrated on techniques to tolerate complete failures in which the entire primary fails, say due to a disaster. In multiprocessor environments, where the primary and the backup databases are partitioned across multiple computers, a more common case is a partial failure in which some database partitions fail but the system as a whole survives. Existing approaches either ignore partial failures, or require the failed database partition to be unavailable. We explore a design of the primary backup database system that uses the backup not only for disaster protection, but also for continued availability during partial failures. The approach is developed in the context of the improved optimized 2-safe strategy to transmitting logs from the primary to the backup, introduced by K. Hu et al. (1997), which combines the best features of the previously developed 1-safe and 2-safe strategies.

查看原文本刊更多论文

以优化的双安全方法处理故障以维护主备系统

在主备份数据库系统中，事务处理在主服务器上进行，生成的日志记录被传播到备份服务器，备份服务器使用它们重构主服务器上的数据库状态。如果主服务器发生故障，备份服务器将接管以提供持续的服务。大多数现有的主备份数据库系统设计都集中在容忍完全故障的技术上，在这种情况下，整个主数据库都失败了，比如由于灾难。在多处理器环境中，主数据库和备份数据库在多台计算机上进行分区，更常见的情况是部分故障，其中一些数据库分区失败，但整个系统幸存下来。现有的方法要么忽略部分故障，要么要求出现故障的数据库分区不可用。我们探索了一种主备份数据库系统的设计，该系统不仅使用备份进行灾难保护，而且还使用备份在部分故障期间保持可用性。该方法是在k.h Hu等人(1997)引入的改进优化的2-safe策略的背景下开发的，该策略将日志从主服务器传输到备份服务器，该策略结合了先前开发的1-safe和2-safe策略的最佳特性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings Seventeenth IEEE Symposium on Reliable Distributed Systems (Cat. No.98CB36281)

自引率

0.00%

发文量