集群分布式存储中广义再生码的修复带宽代价

Ke Li, Shushi Gu, Ye Wang, Qinyu Zhang, W. Xiang
{"title":"集群分布式存储中广义再生码的修复带宽代价","authors":"Ke Li, Shushi Gu, Ye Wang, Qinyu Zhang, W. Xiang","doi":"10.1109/WCSP.2019.8928064","DOIUrl":null,"url":null,"abstract":"When repairing storage nodes in a clustered distributed storage system (CDSS), it is crucial to distinguish the intra-cluster and inter-cluster bandwidth costs differing sharply. From this perspective, Generalized Regenerating Codes (GRCs) involving two-layer repair processes was proposed previous and proved as reaching a better trade-off between storage overhead and inter-cluster repair bandwidth. However, due to the lack of explicit expression about the GRCs' parameters for any point on the trade-off curve, it is difficult to determine the optimal GRCs' parameter configuration for reducing the total repair bandwidth cost in a practical CDSS. To address this issue, we devise a novel transmission cost model of CDSS, and initially propose two essential concepts - Cost Coefficient (CC) and Global Repair Bandwidth Cost (GRBC) to denote the unit and total transmission costs of repair bandwidths, respectively. Moreover, we parameterize the two extreme points on the optimal storage overhead versus repair bandwidth trade-off curve, termed Minimum Storage Generalized Regenerating Codes (MS-GRCs) and Minimum Inter-cluster Bandwidth Generalized Regenerating Codes (MB-GRCs), and theoretically analyze the relationships between their GRBCs and the number of local helper nodes £ (the helper nodes in the cluster with failure node). Our mathematical results provide a guidance for employing GRCs to achieve the more efficient node repairing method in CDSS.","PeriodicalId":108635,"journal":{"name":"2019 11th International Conference on Wireless Communications and Signal Processing (WCSP)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Repair Bandwidth Cost of Generalized Regenerating Codes for Clustered Distributed Storage\",\"authors\":\"Ke Li, Shushi Gu, Ye Wang, Qinyu Zhang, W. Xiang\",\"doi\":\"10.1109/WCSP.2019.8928064\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"When repairing storage nodes in a clustered distributed storage system (CDSS), it is crucial to distinguish the intra-cluster and inter-cluster bandwidth costs differing sharply. From this perspective, Generalized Regenerating Codes (GRCs) involving two-layer repair processes was proposed previous and proved as reaching a better trade-off between storage overhead and inter-cluster repair bandwidth. However, due to the lack of explicit expression about the GRCs' parameters for any point on the trade-off curve, it is difficult to determine the optimal GRCs' parameter configuration for reducing the total repair bandwidth cost in a practical CDSS. To address this issue, we devise a novel transmission cost model of CDSS, and initially propose two essential concepts - Cost Coefficient (CC) and Global Repair Bandwidth Cost (GRBC) to denote the unit and total transmission costs of repair bandwidths, respectively. Moreover, we parameterize the two extreme points on the optimal storage overhead versus repair bandwidth trade-off curve, termed Minimum Storage Generalized Regenerating Codes (MS-GRCs) and Minimum Inter-cluster Bandwidth Generalized Regenerating Codes (MB-GRCs), and theoretically analyze the relationships between their GRBCs and the number of local helper nodes £ (the helper nodes in the cluster with failure node). Our mathematical results provide a guidance for employing GRCs to achieve the more efficient node repairing method in CDSS.\",\"PeriodicalId\":108635,\"journal\":{\"name\":\"2019 11th International Conference on Wireless Communications and Signal Processing (WCSP)\",\"volume\":\"20 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 11th International Conference on Wireless Communications and Signal Processing (WCSP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/WCSP.2019.8928064\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 11th International Conference on Wireless Communications and Signal Processing (WCSP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WCSP.2019.8928064","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

在集群分布式存储系统(CDSS)中,如何区分集群内和集群间的带宽开销是修复存储节点的关键。从这个角度来看,以前提出的涉及两层修复过程的广义再生码(GRCs)被证明在存储开销和集群间修复带宽之间达到了更好的平衡。然而,由于权衡曲线上任意点的GRCs参数缺乏明确的表达式,在实际的CDSS中很难确定降低总修复带宽成本的最优GRCs参数配置。为了解决这一问题,我们设计了一种新的CDSS传输成本模型,并初步提出了两个基本概念-成本系数(CC)和全局修复带宽成本(GRBC)分别表示修复带宽的单位和总传输成本。此外,我们参数化了最优存储开销与修复带宽权衡曲线上的两个极值点,即最小存储广义再生码(MS-GRCs)和最小集群间带宽广义再生码(MB-GRCs),并从理论上分析了它们的grbc与本地辅助节点数目£(具有故障节点的集群中的辅助节点)之间的关系。我们的数学结果为在CDSS中采用GRCs实现更高效的节点修复方法提供了指导。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Repair Bandwidth Cost of Generalized Regenerating Codes for Clustered Distributed Storage
When repairing storage nodes in a clustered distributed storage system (CDSS), it is crucial to distinguish the intra-cluster and inter-cluster bandwidth costs differing sharply. From this perspective, Generalized Regenerating Codes (GRCs) involving two-layer repair processes was proposed previous and proved as reaching a better trade-off between storage overhead and inter-cluster repair bandwidth. However, due to the lack of explicit expression about the GRCs' parameters for any point on the trade-off curve, it is difficult to determine the optimal GRCs' parameter configuration for reducing the total repair bandwidth cost in a practical CDSS. To address this issue, we devise a novel transmission cost model of CDSS, and initially propose two essential concepts - Cost Coefficient (CC) and Global Repair Bandwidth Cost (GRBC) to denote the unit and total transmission costs of repair bandwidths, respectively. Moreover, we parameterize the two extreme points on the optimal storage overhead versus repair bandwidth trade-off curve, termed Minimum Storage Generalized Regenerating Codes (MS-GRCs) and Minimum Inter-cluster Bandwidth Generalized Regenerating Codes (MB-GRCs), and theoretically analyze the relationships between their GRBCs and the number of local helper nodes £ (the helper nodes in the cluster with failure node). Our mathematical results provide a guidance for employing GRCs to achieve the more efficient node repairing method in CDSS.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信