{"title":"Sustainable fault management and error correction for next-generation main memories","authors":"Donald Kline, R. Melhem, A. Jones","doi":"10.1109/IGCC.2017.8323584","DOIUrl":null,"url":null,"abstract":"As technology nodes continue to scale, main memories experience both increasing energy consumption as well as reliability challenges. In order to address rising failure rates due to problems with yield and runtime effects, such as crosstalk, due to process variation in small feature sizes, improved correction capabilities at the bit-level are increasingly essential. To address this challenge, we propose a sustainable approach to error correction in deeply scaled memories. In particular, we propose a novel area-efficient and sustainable fault map (SFaultMap) which targets holistic energy considerations to improve reliability while minimizing both operational and embodied energy. To demonstrate the effectiveness of SFaultMap we conduct a sustain-ability study, based on holistic energy consumption, to evaluate under which scenarios different solutions should be employed. In all cases and scenarios with moderate to high fault rates, SFaultMap has reduced energy over Error Correcting Pointers (ECP) for a five year lifetime. Moreover, as fault rate increases, the indifference time for ECP to recover upfront manufacturing energy increases from years to decades.","PeriodicalId":133239,"journal":{"name":"2017 Eighth International Green and Sustainable Computing Conference (IGSC)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"17","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 Eighth International Green and Sustainable Computing Conference (IGSC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IGCC.2017.8323584","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 17
Abstract
As technology nodes continue to scale, main memories experience both increasing energy consumption as well as reliability challenges. In order to address rising failure rates due to problems with yield and runtime effects, such as crosstalk, due to process variation in small feature sizes, improved correction capabilities at the bit-level are increasingly essential. To address this challenge, we propose a sustainable approach to error correction in deeply scaled memories. In particular, we propose a novel area-efficient and sustainable fault map (SFaultMap) which targets holistic energy considerations to improve reliability while minimizing both operational and embodied energy. To demonstrate the effectiveness of SFaultMap we conduct a sustain-ability study, based on holistic energy consumption, to evaluate under which scenarios different solutions should be employed. In all cases and scenarios with moderate to high fault rates, SFaultMap has reduced energy over Error Correcting Pointers (ECP) for a five year lifetime. Moreover, as fault rate increases, the indifference time for ECP to recover upfront manufacturing energy increases from years to decades.