对异步消息传递系统中活动/崩溃的进程进行动态估计

2006 12th Pacific Rim International Symposium on Dependable Computing (PRDC'06) Pub Date : 2006-12-18 DOI:10.1109/PRDC.2006.48

A. Mostéfaoui, M. Raynal, Gilles Trédan

{"title":"对异步消息传递系统中活动/崩溃的进程进行动态估计","authors":"A. Mostéfaoui, M. Raynal, Gilles Trédan","doi":"10.1109/PRDC.2006.48","DOIUrl":null,"url":null,"abstract":"It is well-known that, in an asynchronous system where processes are prone to crash, it is impossible to design a protocol that provides each process with the set of processes that are currently alive. Basically, this comes from the fact that it is impossible to distinguish a crashed process from a process that is very slow or with which communications are very slow. Nevertheless, designing protocols that provide the processes with good approximations of the set of processes that are currently alive remains a real challenge in fault-tolerant distributed computing. This paper proposes such a protocol. To that end, it considers a realistic computation model where the processes are provided with non-synchronized local clocks and a function alpha(). That function takes a local duration as a parameter, and returns an integer that is an estimate of the number of processes that can crash during that duration. A simulation-based experimental evaluation of the protocol is also presented. The experiments show that the protocol is practically relevant","PeriodicalId":314915,"journal":{"name":"2006 12th Pacific Rim International Symposium on Dependable Computing (PRDC'06)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2006-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"On the fly estimation of the processes that are alive/crashed in an asynchronous message-passing system\",\"authors\":\"A. Mostéfaoui, M. Raynal, Gilles Trédan\",\"doi\":\"10.1109/PRDC.2006.48\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"It is well-known that, in an asynchronous system where processes are prone to crash, it is impossible to design a protocol that provides each process with the set of processes that are currently alive. Basically, this comes from the fact that it is impossible to distinguish a crashed process from a process that is very slow or with which communications are very slow. Nevertheless, designing protocols that provide the processes with good approximations of the set of processes that are currently alive remains a real challenge in fault-tolerant distributed computing. This paper proposes such a protocol. To that end, it considers a realistic computation model where the processes are provided with non-synchronized local clocks and a function alpha(). That function takes a local duration as a parameter, and returns an integer that is an estimate of the number of processes that can crash during that duration. A simulation-based experimental evaluation of the protocol is also presented. The experiments show that the protocol is practically relevant\",\"PeriodicalId\":314915,\"journal\":{\"name\":\"2006 12th Pacific Rim International Symposium on Dependable Computing (PRDC'06)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2006-12-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2006 12th Pacific Rim International Symposium on Dependable Computing (PRDC'06)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/PRDC.2006.48\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2006 12th Pacific Rim International Symposium on Dependable Computing (PRDC'06)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PRDC.2006.48","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

众所周知，在进程容易崩溃的异步系统中，不可能设计一个协议，为每个进程提供当前活动的进程集。基本上，这是因为不可能将崩溃的进程与非常慢的进程或与之通信非常慢的进程区分开来。然而，在容错分布式计算中，设计为进程提供与当前活动的进程集良好近似的协议仍然是一个真正的挑战。本文提出了这样一个协议。为此，它考虑了一个现实的计算模型，其中为进程提供了非同步的本地时钟和函数alpha()。该函数将本地持续时间作为参数，并返回一个整数，该整数表示在该持续时间内可能崩溃的进程数量的估计值。本文还对该协议进行了基于仿真的实验评估。实验结果表明，该方案具有实际意义

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

On the fly estimation of the processes that are alive/crashed in an asynchronous message-passing system

It is well-known that, in an asynchronous system where processes are prone to crash, it is impossible to design a protocol that provides each process with the set of processes that are currently alive. Basically, this comes from the fact that it is impossible to distinguish a crashed process from a process that is very slow or with which communications are very slow. Nevertheless, designing protocols that provide the processes with good approximations of the set of processes that are currently alive remains a real challenge in fault-tolerant distributed computing. This paper proposes such a protocol. To that end, it considers a realistic computation model where the processes are provided with non-synchronized local clocks and a function alpha(). That function takes a local duration as a parameter, and returns an integer that is an estimate of the number of processes that can crash during that duration. A simulation-based experimental evaluation of the protocol is also presented. The experiments show that the protocol is practically relevant

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2006 12th Pacific Rim International Symposium on Dependable Computing (PRDC'06)

自引率

0.00%

发文量