Experiences building network-coding-based distributed storage systems

2014 Information Theory and Applications Workshop (ITA) Pub Date : 2014-04-24 DOI:10.1109/ITA.2014.6804234

P. Lee

{"title":"Experiences building network-coding-based distributed storage systems","authors":"P. Lee","doi":"10.1109/ITA.2014.6804234","DOIUrl":null,"url":null,"abstract":"Large-scale distributed storage systems are prone to node failures. To provide fault tolerance, data is often encoded to maintain data redundancy over multiple storage nodes. If a node fails, it can be repaired by downloading data from surviving nodes and regenerating the lost data in a new node. Network coding has recently been proposed (e.g., see [2]) to generate data redundancy. It is shown that network coding can minimize the amount of data being transferred for repair, while maintaining the same fault tolerance as in conventional erasure coding schemes. Its idea is to have storage nodes first encode their stored data and then send the encoded data for regeneration. On the other hand, the topic of network coding in storage systems is mostly investigated in theoretical studies. Its performance in real deployment remains an open issue. This motivates us to study the practicality of deploying network coding in real-world distributed storage systems. We highlight two of our implementation projects of network-coding-based storage systems at the Chinese University of Hong Kong, namely NCCloud and CORE. Both of them target different storage applications, while building on network coding to enable high availability and efficient recovery of storage systems.","PeriodicalId":338302,"journal":{"name":"2014 Information Theory and Applications Workshop (ITA)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 Information Theory and Applications Workshop (ITA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITA.2014.6804234","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

Large-scale distributed storage systems are prone to node failures. To provide fault tolerance, data is often encoded to maintain data redundancy over multiple storage nodes. If a node fails, it can be repaired by downloading data from surviving nodes and regenerating the lost data in a new node. Network coding has recently been proposed (e.g., see [2]) to generate data redundancy. It is shown that network coding can minimize the amount of data being transferred for repair, while maintaining the same fault tolerance as in conventional erasure coding schemes. Its idea is to have storage nodes first encode their stored data and then send the encoded data for regeneration. On the other hand, the topic of network coding in storage systems is mostly investigated in theoretical studies. Its performance in real deployment remains an open issue. This motivates us to study the practicality of deploying network coding in real-world distributed storage systems. We highlight two of our implementation projects of network-coding-based storage systems at the Chinese University of Hong Kong, namely NCCloud and CORE. Both of them target different storage applications, while building on network coding to enable high availability and efficient recovery of storage systems.

查看原文本刊更多论文

有构建基于网络编码的分布式存储系统的经验

大规模分布式存储系统容易出现节点故障。为了提供容错性，通常对数据进行编码，以维护多个存储节点上的数据冗余。如果一个节点发生故障，可以通过从幸存节点下载数据并在新节点中重新生成丢失的数据来修复它。网络编码最近被提出(例如，参见[2])来产生数据冗余。研究表明，网络编码可以在保持与传统纠删编码相同的容错性的同时，最大限度地减少需要修复的数据量。它的思想是让存储节点首先对其存储的数据进行编码，然后发送编码后的数据进行再生。另一方面，存储系统中网络编码的研究大多停留在理论层面。它在实际部署中的性能仍然是一个悬而未决的问题。这促使我们研究在真实的分布式存储系统中部署网络编码的可行性。我们重点介绍了我们在香港中文大学的两个基于网络编码的存储系统实施项目，即NCCloud和CORE。它们都针对不同的存储应用，同时建立在网络编码的基础上，以实现存储系统的高可用性和高效恢复。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2014 Information Theory and Applications Workshop (ITA)

自引率

0.00%

发文量