Roberto Cortiñas, Iratxe Soraluze Arriola, A. Lafuente, M. Larrea
{"title":"Brief announcement: an efficient failure detector for omission environments","authors":"Roberto Cortiñas, Iratxe Soraluze Arriola, A. Lafuente, M. Larrea","doi":"10.1145/1835698.1835720","DOIUrl":null,"url":null,"abstract":"Failure detectors, which provide (maybe erroneous) information about process failures, have been shown to be a very useful mechanism to solve, among others, the consensus problem. Communication efficiency, defined in [1] as having a linear number of links carrying messages forever, has been a subject of research when designing failure detectors for the crash model [7], but, as far as we know, not yet for the general omission model. This model covers situations in which message omissions can not be fully controlled, e.g., buffer overflows, restricted capacity of re-sending or malicious behaviour like such of [2]. Failure detection in omission environments was first addressed in [5], and more recently in [2, 4]. All these failure detectors have a permanent all-to-all communication pattern, and thus are not communicationefficient. In this brief announcement, we first define a new failure detector for the general omission model in terms of completeness and accuracy properties, and then propose a communication-efficient algorithm that implements it.","PeriodicalId":447863,"journal":{"name":"Proceedings of the 29th ACM SIGACT-SIGOPS symposium on Principles of distributed computing","volume":"52 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 29th ACM SIGACT-SIGOPS symposium on Principles of distributed computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/1835698.1835720","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Failure detectors, which provide (maybe erroneous) information about process failures, have been shown to be a very useful mechanism to solve, among others, the consensus problem. Communication efficiency, defined in [1] as having a linear number of links carrying messages forever, has been a subject of research when designing failure detectors for the crash model [7], but, as far as we know, not yet for the general omission model. This model covers situations in which message omissions can not be fully controlled, e.g., buffer overflows, restricted capacity of re-sending or malicious behaviour like such of [2]. Failure detection in omission environments was first addressed in [5], and more recently in [2, 4]. All these failure detectors have a permanent all-to-all communication pattern, and thus are not communicationefficient. In this brief announcement, we first define a new failure detector for the general omission model in terms of completeness and accuracy properties, and then propose a communication-efficient algorithm that implements it.