SwiftER: Elastic Erasure Coded Storage System

2016 IEEE 35th Symposium on Reliable Distributed Systems (SRDS) Pub Date : 2016-09-01 DOI:10.1109/SRDS.2016.040

Anwitaman Datta, Wan-Hee Cho

{"title":"SwiftER: Elastic Erasure Coded Storage System","authors":"Anwitaman Datta, Wan-Hee Cho","doi":"10.1109/SRDS.2016.040","DOIUrl":null,"url":null,"abstract":"Over the life-cycle of a data object, it may be difficult to determine a priori how much redundancy to store it with. The desired degree of fault-tolerance may change over time, for instance, because the importance of the data changes, or the storage system environment changes. If the redundancy is achieved using replication, then changing the degree of fault-tolerance would mean adding (or removing) replicas - a reasonably straightforward operation. However, if erasure code is used instead (which is preferable, given the significantly lower storage overhead of erasure codes with respect to fully replicated systems), then, while shrinking redundancy can still be achieved similarly, expanding redundancy becomes non-trivial. A naive approach will require re-coding, which is both network resource and computation heavy. In this paper, we explore the possibility of using network coding techniques, to both distribute computational load, as well as reduce network usage, and in the process, speed-up the process of creating additional redundancy. The contributions of this paper are defining the problem and analyzing the theoretical limits by leveraging on and extending the existing literature on regenerating codes to realize erasure coded redundancy elasticity, propose a framework to realize code instances that are amenable to network coding based elastic expansion of redundancy, and integrate and benchmark one such code instance (which happens to be optimal with respect to the aforementioned established theoretical limit) with OpenStack Swift to demonstrate the practicality and advantages of the proposed approach.","PeriodicalId":165721,"journal":{"name":"2016 IEEE 35th Symposium on Reliable Distributed Systems (SRDS)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE 35th Symposium on Reliable Distributed Systems (SRDS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SRDS.2016.040","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Over the life-cycle of a data object, it may be difficult to determine a priori how much redundancy to store it with. The desired degree of fault-tolerance may change over time, for instance, because the importance of the data changes, or the storage system environment changes. If the redundancy is achieved using replication, then changing the degree of fault-tolerance would mean adding (or removing) replicas - a reasonably straightforward operation. However, if erasure code is used instead (which is preferable, given the significantly lower storage overhead of erasure codes with respect to fully replicated systems), then, while shrinking redundancy can still be achieved similarly, expanding redundancy becomes non-trivial. A naive approach will require re-coding, which is both network resource and computation heavy. In this paper, we explore the possibility of using network coding techniques, to both distribute computational load, as well as reduce network usage, and in the process, speed-up the process of creating additional redundancy. The contributions of this paper are defining the problem and analyzing the theoretical limits by leveraging on and extending the existing literature on regenerating codes to realize erasure coded redundancy elasticity, propose a framework to realize code instances that are amenable to network coding based elastic expansion of redundancy, and integrate and benchmark one such code instance (which happens to be optimal with respect to the aforementioned established theoretical limit) with OpenStack Swift to demonstrate the practicality and advantages of the proposed approach.

查看原文本刊更多论文

SwiftER: Elastic Erasure Coded Storage System

在数据对象的整个生命周期中，可能很难先验地确定要用多少冗余来存储它。所需的容错程度可能会随着时间的推移而改变，例如，因为数据的重要性发生了变化，或者存储系统环境发生了变化。如果冗余是通过复制实现的，那么更改容错程度就意味着添加(或删除)副本——这是一个相当简单的操作。但是，如果使用erasure code(考虑到相对于完全复制的系统，erasure code的存储开销要低得多，这是可取的)，那么，虽然缩减冗余仍然可以类似地实现，但扩展冗余就变得不容易了。一种朴素的方法需要重新编码，这既占用网络资源，又需要大量计算。在本文中，我们探索了使用网络编码技术的可能性，既可以分配计算负载，也可以减少网络使用，并在此过程中加速创建额外冗余的过程。本文的贡献在于:通过利用和扩展现有的关于代码再生实现擦除编码冗余弹性的文献，对问题进行了界定，分析了理论局限性，提出了一种基于冗余弹性扩展的可适应网络编码的代码实例实现框架;并将一个这样的代码实例(相对于上述建立的理论限制恰好是最优的)与OpenStack Swift集成和基准测试，以证明所提出方法的实用性和优势。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2016 IEEE 35th Symposium on Reliable Distributed Systems (SRDS)

自引率

0.00%

发文量