{"title":"Supporting distributed application management in Sampa","authors":"M. Endler, Anil J. D'Souza","doi":"10.1109/CDS.1996.509360","DOIUrl":null,"url":null,"abstract":"The paper presents the architecture and base services of Sampa, a System for Availability Management of Process-based Applications. The system has been designed to support the management of fault-tolerant DCE-based distributed programs according to user provided and application-specific availability specifications. Sampa is supposed to detect and automatically react to faults such as node crashes, network partitions, process crashes and hang-ups. We focus on the design of its base services-the monitoring, reliable group communication and checkpointing facilities and show how they can be used for managing a generic replicated service.","PeriodicalId":302050,"journal":{"name":"Proceedings of International Conference on Configurable Distributed Systems","volume":"35 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1996-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of International Conference on Configurable Distributed Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CDS.1996.509360","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
The paper presents the architecture and base services of Sampa, a System for Availability Management of Process-based Applications. The system has been designed to support the management of fault-tolerant DCE-based distributed programs according to user provided and application-specific availability specifications. Sampa is supposed to detect and automatically react to faults such as node crashes, network partitions, process crashes and hang-ups. We focus on the design of its base services-the monitoring, reliable group communication and checkpointing facilities and show how they can be used for managing a generic replicated service.