Building Highly Available Cluster File System Based on Replication

2009 International Conference on Parallel and Distributed Computing, Applications and Technologies Pub Date : 2009-12-08 DOI:10.1109/PDCAT.2009.14

Liang Cao, Yu Wang, Jin Xiong

{"title":"Building Highly Available Cluster File System Based on Replication","authors":"Liang Cao, Yu Wang, Jin Xiong","doi":"10.1109/PDCAT.2009.14","DOIUrl":null,"url":null,"abstract":"In order to gain better cost-effectiveness, current large-scale storage systems are typically built up by thousands of individual components. As systems scale up, the probability of the failure of multiple components increases. And for large-scale storage system, failures are normal rather than exception. How to build file systems providing both high throughput and highly available service under such circumstances is a big challenge. We have designed and implemented HA-DCFS3, a highly available cluster file system prototype. It uses a scalable replication algorithm called Asynchronous Primary Copy Protocol (APCP). Unlike traditional Primary Copy Protocol that must synchronize updates to all replicas, APCP introduces an asynchronous approach where write operation is permitted to be synchronized to a subset of replicas. This flexible approach greatly improves the write performance. Furthermore, HA-DCFS3 also introduces a fine-grained failure detection called “ data path detection”, which is integrated into the fault-tolerant framework based on data replication. Hence, HA-DCFS3 can provide continuous service even when component failures occur. And finally, HA-DCFS3 adopts a two-level data recovery strategy that handles transient failures with reintegration and persistent failures with re-replication respectively to reduce the cost of data repair. Our performance results show that HA-DCFS3 can deliver high and scalable aggregate performance and provide highly available service at very low cost.","PeriodicalId":312929,"journal":{"name":"2009 International Conference on Parallel and Distributed Computing, Applications and Technologies","volume":"37 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 International Conference on Parallel and Distributed Computing, Applications and Technologies","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PDCAT.2009.14","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

Abstract

In order to gain better cost-effectiveness, current large-scale storage systems are typically built up by thousands of individual components. As systems scale up, the probability of the failure of multiple components increases. And for large-scale storage system, failures are normal rather than exception. How to build file systems providing both high throughput and highly available service under such circumstances is a big challenge. We have designed and implemented HA-DCFS3, a highly available cluster file system prototype. It uses a scalable replication algorithm called Asynchronous Primary Copy Protocol (APCP). Unlike traditional Primary Copy Protocol that must synchronize updates to all replicas, APCP introduces an asynchronous approach where write operation is permitted to be synchronized to a subset of replicas. This flexible approach greatly improves the write performance. Furthermore, HA-DCFS3 also introduces a fine-grained failure detection called “ data path detection”, which is integrated into the fault-tolerant framework based on data replication. Hence, HA-DCFS3 can provide continuous service even when component failures occur. And finally, HA-DCFS3 adopts a two-level data recovery strategy that handles transient failures with reintegration and persistent failures with re-replication respectively to reduce the cost of data repair. Our performance results show that HA-DCFS3 can deliver high and scalable aggregate performance and provide highly available service at very low cost.

查看原文本刊更多论文

基于复制技术构建高可用集群文件系统

为了获得更好的成本效益，目前的大型存储系统通常由数千个单独的组件组成。随着系统规模的扩大，多个组件失效的概率也会增加。而对于大型存储系统来说，故障是正常现象而非例外。在这种情况下，如何构建既能提供高吞吐量又能提供高可用服务的文件系统是一个很大的挑战。我们已经设计并实现了HA-DCFS3，这是一个高可用的集群文件系统原型。它使用一种可扩展的复制算法，称为异步主复制协议(APCP)。与必须将更新同步到所有副本的传统主复制协议不同，APCP引入了一种异步方法，允许将写操作同步到副本的一个子集。这种灵活的方法大大提高了写性能。此外，HA-DCFS3还引入了一种称为“数据路径检测”的细粒度故障检测，它被集成到基于数据复制的容错框架中。因此，HA-DCFS3即使在组件发生故障时也可以提供连续的服务。最后，HA-DCFS3采用两级数据恢复策略，分别通过重新整合处理瞬态故障和通过重新复制处理持久故障，以降低数据修复的成本。我们的性能结果表明，HA-DCFS3可以提供高可扩展的聚合性能，并以非常低的成本提供高可用性服务。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2009 International Conference on Parallel and Distributed Computing, Applications and Technologies

自引率

0.00%

发文量