{"title":"HydraNet-FT: network support for dependable services","authors":"Gurudatt Shenoy, Suresh Satapati, R. Bettati","doi":"10.1109/ICDCS.2000.840987","DOIUrl":null,"url":null,"abstract":"With the Internet increasingly being used as the access medium for a variety of critical services, there is a growing need to provide fault-tolerant (FT) services over internetworks, in a completely client-transparent fashion. We present HydraNet-FT, an infrastructure to dynamically replicate services across an internetwork and have the replicas provide a single fault-tolerant service access point to clients. HydraNet-FT uses the TCP communication protocol with a few modifications on the server side to allow one-to-many message delivery from a client to service replicas and many-to-one message delivery from the replicas to the client. A communication channel between the replicas provides atomicity and message ordering. A low-latency failure estimator is used to detect failures of servers in the system and to initiate fail-over mechanisms. An implementation and measurements on a local testbed show that the overhead of our scheme is reasonably small.","PeriodicalId":284992,"journal":{"name":"Proceedings 20th IEEE International Conference on Distributed Computing Systems","volume":"28 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2000-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"40","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings 20th IEEE International Conference on Distributed Computing Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDCS.2000.840987","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 40
Abstract
With the Internet increasingly being used as the access medium for a variety of critical services, there is a growing need to provide fault-tolerant (FT) services over internetworks, in a completely client-transparent fashion. We present HydraNet-FT, an infrastructure to dynamically replicate services across an internetwork and have the replicas provide a single fault-tolerant service access point to clients. HydraNet-FT uses the TCP communication protocol with a few modifications on the server side to allow one-to-many message delivery from a client to service replicas and many-to-one message delivery from the replicas to the client. A communication channel between the replicas provides atomicity and message ordering. A low-latency failure estimator is used to detect failures of servers in the system and to initiate fail-over mechanisms. An implementation and measurements on a local testbed show that the overhead of our scheme is reasonably small.