A benchmark for fault monitors in distributed systems

2009 International Conference on Emerging Technologies Pub Date : 2009-12-11 DOI:10.1109/ICET.2009.5353193

S. Hussain, M. Qadir

引用次数: 0

Abstract

Fault monitoring is one of the main activities of fault tolerant distributed systems. It is required to determine the suspected /crashed component and proactively take the recovery steps to keep the system alive. The main objective of the fault monitoring activity is to quickly and correctly identify the faults. There are many techniques for fault monitoring which have general and specific parameters which influence their performance. In this paper we find the parameters that can help us classify the fault monitoring techniques. We created a benchmark ACI (Adaptation, Convergence, Intelligence) and applied it on current techniques.

查看原文本刊更多论文

分布式系统中故障监测的基准

故障监测是容错分布式系统的主要活动之一。需要确定可疑/崩溃的组件，并主动采取恢复步骤以保持系统正常运行。故障监测活动的主要目的是快速、正确地识别故障。有许多故障监测技术，它们具有影响其性能的一般参数和特定参数。在本文中，我们找到了可以帮助我们对故障监测技术进行分类的参数。我们创建了一个基准ACI(适应、融合、智能)，并将其应用于当前的技术。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2009 International Conference on Emerging Technologies

自引率

0.00%

发文量