Flash correct-and-refresh: Retention-aware error management for increased flash memory lifetime

Yu Cai, Gulay Yalcin, O. Mutlu, E. Haratsch, A. Cristal, O. Unsal, K. Mai
{"title":"Flash correct-and-refresh: Retention-aware error management for increased flash memory lifetime","authors":"Yu Cai, Gulay Yalcin, O. Mutlu, E. Haratsch, A. Cristal, O. Unsal, K. Mai","doi":"10.1109/ICCD.2012.6378623","DOIUrl":null,"url":null,"abstract":"With the continued scaling of NAND flash and multi-level cell technology, flash-based storage has gained widespread use in systems ranging from mobile platforms to enterprise servers. However, the robustness of NAND flash cells is an increasing concern, especially at nanometer-regime process geometries. NAND flash memory bit error rate increases exponentially with the number of program/erase cycles. Stronger error correcting codes (ECC) can be used to tolerate higher error rates, but these have diminishing returns with increasing P/E cycles and can have prohibitively high power, area, and latency overheads. The goal of this paper is to develop new techniques that can tolerate high bit error rates without requiring prohibitively strong ECC. Our techniques, called Flash Correct-and-Refresh (FCR) exploit the observation that the dominant error source in NAND flash memory is retention errors, caused by flash cells losing charge over time. The key idea is to periodically read, correct, and reprogram (in-place) or remap the stored data before it accumulates more retention errors than can be corrected by simple ECC. Detailed simulations of a solid-state drive (SSD) storage system driven by measured experimental data from error characterization on real flash memory chips show that our techniques provide 46× average lifetime improvement on a variety of workloads at no additional hardware cost. We also find that our techniques achieve lifetime improvements that cannot feasibly be achieved with stronger ECC.","PeriodicalId":313428,"journal":{"name":"2012 IEEE 30th International Conference on Computer Design (ICCD)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2012-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"215","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 IEEE 30th International Conference on Computer Design (ICCD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCD.2012.6378623","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 215

Abstract

With the continued scaling of NAND flash and multi-level cell technology, flash-based storage has gained widespread use in systems ranging from mobile platforms to enterprise servers. However, the robustness of NAND flash cells is an increasing concern, especially at nanometer-regime process geometries. NAND flash memory bit error rate increases exponentially with the number of program/erase cycles. Stronger error correcting codes (ECC) can be used to tolerate higher error rates, but these have diminishing returns with increasing P/E cycles and can have prohibitively high power, area, and latency overheads. The goal of this paper is to develop new techniques that can tolerate high bit error rates without requiring prohibitively strong ECC. Our techniques, called Flash Correct-and-Refresh (FCR) exploit the observation that the dominant error source in NAND flash memory is retention errors, caused by flash cells losing charge over time. The key idea is to periodically read, correct, and reprogram (in-place) or remap the stored data before it accumulates more retention errors than can be corrected by simple ECC. Detailed simulations of a solid-state drive (SSD) storage system driven by measured experimental data from error characterization on real flash memory chips show that our techniques provide 46× average lifetime improvement on a variety of workloads at no additional hardware cost. We also find that our techniques achieve lifetime improvements that cannot feasibly be achieved with stronger ECC.
闪存更正和刷新:保留感知错误管理,增加闪存寿命
随着NAND闪存和多级存储单元技术的不断扩展,基于闪存的存储已经在从移动平台到企业服务器的系统中得到了广泛的应用。然而,NAND闪存单元的稳健性越来越受到关注,特别是在纳米级工艺几何结构中。NAND闪存的误码率随着程序/擦除周期的增加呈指数增长。可以使用更强的纠错码(ECC)来容忍更高的错误率,但是随着P/E周期的增加,这些代码的回报会减少,并且可能具有过高的功耗、面积和延迟开销。本文的目标是开发能够容忍高误码率的新技术,而不需要过于强大的ECC。我们的技术,称为闪存校正和刷新(FCR)利用了NAND闪存中主要错误来源是保留错误的观察结果,这是由闪存单元随着时间的推移失去电荷引起的。关键思想是在存储的数据积累的保留错误超过简单ECC所能纠正的错误之前,定期读取、纠正和重新编程(就地)或重新映射存储的数据。通过实际闪存芯片上的误差表征测量实验数据驱动的固态驱动器(SSD)存储系统的详细模拟表明,我们的技术在不增加硬件成本的情况下,在各种工作负载上提供了46倍的平均寿命改进。我们还发现,我们的技术实现了更强的ECC无法实现的生命周期改进。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信