Byzantine Fault Tolerance

From Traditional Fault Tolerance to Blockchain Pub Date : 1900-01-01 DOI:10.1002/9781119682127.ch7

Allen Clement

{"title":"Byzantine Fault Tolerance","authors":"Allen Clement","doi":"10.1002/9781119682127.ch7","DOIUrl":null,"url":null,"abstract":"I am excited by the challenge of making distributed systems reliable and robust to failures. Distributed systems form the backbone of a variety of services that play an important part in daily life: email, e-commerce, and air traffic control are a few examples. The impact of failures of such services ranges from the inconvenience of lost email, to the hassles of delayed flights, to financial losses and even closure of companies. It is difficult to design reliable distributed systems because individual computers and the networks connecting them can fail in a variety of ways. Each computer or network failure can lead to an unplanned behavior of individual components and the system as a whole. Can we build reliable systems without considering every possible failure scenario? My research focuses on the development of end-to-end techniques for building reliable systems that are general, practical, and theoretically sound. Specifically, general techniques cover a wide range of faults and are easy to incorporate into a variety of new and legacy systems; practical techniques impose low overheads, provide robust performance in the presence of failures, and are based on realistic and tenable system models; theoretically sound techniques provide welldefined guarantees under well-defined assumptions. I believe that all three properties are important – techniques that are not general have limited deployment potential, techniques that are not practical will not be used, and techniques that are not theoretically sound may not work as advertised.","PeriodicalId":245238,"journal":{"name":"From Traditional Fault Tolerance to Blockchain","volume":"162 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"18","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"From Traditional Fault Tolerance to Blockchain","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1002/9781119682127.ch7","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 18

Abstract

I am excited by the challenge of making distributed systems reliable and robust to failures. Distributed systems form the backbone of a variety of services that play an important part in daily life: email, e-commerce, and air traffic control are a few examples. The impact of failures of such services ranges from the inconvenience of lost email, to the hassles of delayed flights, to financial losses and even closure of companies. It is difficult to design reliable distributed systems because individual computers and the networks connecting them can fail in a variety of ways. Each computer or network failure can lead to an unplanned behavior of individual components and the system as a whole. Can we build reliable systems without considering every possible failure scenario? My research focuses on the development of end-to-end techniques for building reliable systems that are general, practical, and theoretically sound. Specifically, general techniques cover a wide range of faults and are easy to incorporate into a variety of new and legacy systems; practical techniques impose low overheads, provide robust performance in the presence of failures, and are based on realistic and tenable system models; theoretically sound techniques provide welldefined guarantees under well-defined assumptions. I believe that all three properties are important – techniques that are not general have limited deployment potential, techniques that are not practical will not be used, and techniques that are not theoretically sound may not work as advertised.

查看原文本刊更多论文

拜占庭容错

我对使分布式系统对故障可靠且健壮的挑战感到兴奋。分布式系统构成了在日常生活中发挥重要作用的各种服务的支柱:电子邮件、电子商务和空中交通管制就是其中的几个例子。这些服务的失败所带来的影响从丢失电子邮件的不便，到航班延误的麻烦，到经济损失，甚至公司倒闭。设计可靠的分布式系统是困难的，因为单个计算机和连接它们的网络可能以各种方式出现故障。每台计算机或网络故障都可能导致单个组件和整个系统出现计划外行为。我们能在不考虑所有可能的故障情况下构建可靠的系统吗?我的研究重点是开发端到端技术，用于构建通用的、实用的、理论上合理的可靠系统。具体来说，通用技术涵盖了广泛的故障范围，并且很容易合并到各种新的和遗留的系统中;实用的技术可以降低开销，在出现故障时提供健壮的性能，并且基于现实可行的系统模型;理论上可靠的技术在明确的假设下提供明确的保证。我相信这三个属性都很重要——不通用的技术部署潜力有限，不实用的技术不会被使用，理论上不可靠的技术可能不会像宣传的那样工作。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

From Traditional Fault Tolerance to Blockchain

自引率

0.00%

发文量