Enhanced Reliability Modeling of RAID Storage Systems

37th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN'07) Pub Date : 2007-06-25 DOI:10.1109/DSN.2007.41

J. Elerath, M. Pecht

引用次数: 105

Abstract

A flexible model for estimating reliability of RAID storage systems is presented. This model corrects errors associated with the common assumption that system times to failure follow a homogeneous Poisson process. Separate generalized failure distributions are used to model catastrophic failures and usage dependent data corruptions for each hard drive. Catastrophic failure restoration is represented by a three-parameter Weibull, so the model can include a minimum time to restore as a function of data transfer rate and hard drive storage capacity. Data can be scrubbed as a background operation to eliminate corrupted data that, in the event of a simultaneous catastrophic failure, results in double disk failures. Field-based times to failure data and mathematic justification for a new model are presented. Model results have been verified and predict between 2 to 1,500 times as many double disk failures as that estimated using the current mean time to data loss method.

查看原文本刊更多论文

改进的RAID存储系统可靠性建模

提出了一种灵活的RAID存储系统可靠性估计模型。该模型纠正了与系统失效时间遵循齐次泊松过程这一普遍假设相关的错误。使用单独的广义故障分布来模拟每个硬盘驱动器的灾难性故障和使用相关的数据损坏。灾难性故障恢复由三参数威布尔表示，因此该模型可以包含最小恢复时间作为数据传输速率和硬盘存储容量的函数。数据可以作为后台操作进行擦除，以消除在同时发生灾难性故障时导致双磁盘故障的损坏数据。给出了基于现场的失效时间数据和新模型的数学证明。模型结果已经得到验证，并且预测的双盘故障数量是使用当前平均数据丢失时间方法估计的2到1500倍。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

37th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN'07)

自引率

0.00%

发文量