大型高可用性服务器系统中DIMM连接器故障率的RAS含义

T. J. Dell
{"title":"大型高可用性服务器系统中DIMM连接器故障率的RAS含义","authors":"T. J. Dell","doi":"10.1109/HOLM.2007.4318226","DOIUrl":null,"url":null,"abstract":"The juxtaposition of low-cost dual inline memory module (DIMM) connectors in highly reliable servers has created a difficult reliability, availability, and serviceability conundrum: the connector cost must be low enough to allow hundreds of sockets to be used per system, while at the same time, the system-level reliability must be high enough to prevent connector-related memory failures. This paper explores some of the modeling techniques that can be used to guide system-level fault tolerance decisions in view of the propensity of card-edge connectors to experience corrosion-induced failures, and it explains why understanding the probability density function (PDF) of the connector failure rate is crucial in establishing the system RAS strategy for DIMM connectors. The effects of both a \"low\" and \"high\" contact failure rate are analyzed at two different PDF's, and the resultant system implications are discussed.","PeriodicalId":11624,"journal":{"name":"Electrical Contacts - 2007 Proceedings of the 53rd IEEE Holm Conference on Electrical Contacts","volume":"38 1","pages":"256-261"},"PeriodicalIF":0.0000,"publicationDate":"2007-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"The RAS Implications of DIMM Connector Failure Rates in Large, Highly Available Server Systems\",\"authors\":\"T. J. Dell\",\"doi\":\"10.1109/HOLM.2007.4318226\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The juxtaposition of low-cost dual inline memory module (DIMM) connectors in highly reliable servers has created a difficult reliability, availability, and serviceability conundrum: the connector cost must be low enough to allow hundreds of sockets to be used per system, while at the same time, the system-level reliability must be high enough to prevent connector-related memory failures. This paper explores some of the modeling techniques that can be used to guide system-level fault tolerance decisions in view of the propensity of card-edge connectors to experience corrosion-induced failures, and it explains why understanding the probability density function (PDF) of the connector failure rate is crucial in establishing the system RAS strategy for DIMM connectors. The effects of both a \\\"low\\\" and \\\"high\\\" contact failure rate are analyzed at two different PDF's, and the resultant system implications are discussed.\",\"PeriodicalId\":11624,\"journal\":{\"name\":\"Electrical Contacts - 2007 Proceedings of the 53rd IEEE Holm Conference on Electrical Contacts\",\"volume\":\"38 1\",\"pages\":\"256-261\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2007-09-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Electrical Contacts - 2007 Proceedings of the 53rd IEEE Holm Conference on Electrical Contacts\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/HOLM.2007.4318226\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Electrical Contacts - 2007 Proceedings of the 53rd IEEE Holm Conference on Electrical Contacts","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HOLM.2007.4318226","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

在高可靠性服务器中同时使用低成本的双内联内存模块(DIMM)连接器产生了一个难以解决的可靠性、可用性和可维护性难题:连接器成本必须足够低,以允许每个系统使用数百个插座,同时,系统级可靠性必须足够高,以防止连接器相关的内存故障。本文探讨了一些可用于指导系统级容错决策的建模技术,考虑到卡边连接器容易经历腐蚀导致的故障,并解释了为什么理解连接器故障率的概率密度函数(PDF)对于建立DIMM连接器的系统RAS策略至关重要。在两种不同的PDF下,分析了“低”和“高”接触故障率的影响,并讨论了由此产生的系统含义。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
The RAS Implications of DIMM Connector Failure Rates in Large, Highly Available Server Systems
The juxtaposition of low-cost dual inline memory module (DIMM) connectors in highly reliable servers has created a difficult reliability, availability, and serviceability conundrum: the connector cost must be low enough to allow hundreds of sockets to be used per system, while at the same time, the system-level reliability must be high enough to prevent connector-related memory failures. This paper explores some of the modeling techniques that can be used to guide system-level fault tolerance decisions in view of the propensity of card-edge connectors to experience corrosion-induced failures, and it explains why understanding the probability density function (PDF) of the connector failure rate is crucial in establishing the system RAS strategy for DIMM connectors. The effects of both a "low" and "high" contact failure rate are analyzed at two different PDF's, and the resultant system implications are discussed.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信