Subordination: Cluster management without distributed consensus

2015 International Conference on High Performance Computing & Simulation (HPCS) Pub Date : 2015-07-20 DOI:10.1109/HPCSim.2015.7237106

I. Gankevich, Y. Tipikin, V. Gaiduchok

引用次数: 7

Abstract

Nowadays, many cluster management systems rely on distributed consensus algorithms to elect a leader that orchestrates subordinate nodes. Contrary to these studies we propose consensus-free algorithm that arranges cluster nodes into multiple levels of subordination. The algorithm structures IP address range of cluster network so that each node has ranked list of candidates, from which it chooses a leader. The results show that this approach easily scales to a large number of nodes due to its asynchronous nature, and enables fast recovery from node failures as they occur only on one level of hierarchy. Multiple levels of subordination are useful for efficiently collecting monitoring and accounting data from large number of nodes, and for scheduling general-purpose tasks on a cluster.

查看原文本刊更多论文

从属关系:没有分布式共识的集群管理

目前，许多集群管理系统依赖于分布式共识算法来选举领导者，从而协调下级节点。与这些研究相反，我们提出了无共识算法，将集群节点安排为多个从属级别。该算法对集群网络的IP地址范围进行结构化，使每个节点都有一个候选列表，并从中选择一个领导者。结果表明，由于其异步特性，这种方法很容易扩展到大量节点，并且可以从节点故障中快速恢复，因为它们只发生在一个层次结构级别上。多级从属关系对于有效地收集来自大量节点的监视和记帐数据以及调度集群上的通用任务非常有用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2015 International Conference on High Performance Computing & Simulation (HPCS)

自引率

0.00%

发文量