{"title":"Locality-aware fountain codes for massive distributed storage systems","authors":"Toritseju Okpotse, S. Yousefi","doi":"10.1109/CWIT.2015.7255143","DOIUrl":null,"url":null,"abstract":"Low repair locality of a distributed storage code has been shown to reduce strain on storage node input-output (I/O) resources during node repair operations after a failure. In this paper, we consider the use of Fountain codes for distributed storage systems and aim to understand the relationship between repair locality and code parameters for a systematic Fountain code. While the information-theoretic trade-off between repair locality and storage overhead has been understood and characterized, the challenge of choosing a locality value that satisfies multiple storage system design metrics is yet to be resolved. We approach this problem by deriving an expression for the probability distribution of repair locality in terms of the rateless code degree distribution coefficients and suggest that factoring this relationship into the code design process enables the design of rateless codes better adjusted to the needs of a massive distributed storage system.","PeriodicalId":426245,"journal":{"name":"2015 IEEE 14th Canadian Workshop on Information Theory (CWIT)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 IEEE 14th Canadian Workshop on Information Theory (CWIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CWIT.2015.7255143","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
Low repair locality of a distributed storage code has been shown to reduce strain on storage node input-output (I/O) resources during node repair operations after a failure. In this paper, we consider the use of Fountain codes for distributed storage systems and aim to understand the relationship between repair locality and code parameters for a systematic Fountain code. While the information-theoretic trade-off between repair locality and storage overhead has been understood and characterized, the challenge of choosing a locality value that satisfies multiple storage system design metrics is yet to be resolved. We approach this problem by deriving an expression for the probability distribution of repair locality in terms of the rateless code degree distribution coefficients and suggest that factoring this relationship into the code design process enables the design of rateless codes better adjusted to the needs of a massive distributed storage system.