一种基于类nand的高能效SOT MRAM的高质量可配置近似缓存设计

IF 2.5 3区 工程技术 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE
Zhengyi Hou , Luyao Shi , Yan Qiao , Bi Wang , Bi Wu , Lirida Naviner , Zhaohao wang
{"title":"一种基于类nand的高能效SOT MRAM的高质量可配置近似缓存设计","authors":"Zhengyi Hou ,&nbsp;Luyao Shi ,&nbsp;Yan Qiao ,&nbsp;Bi Wang ,&nbsp;Bi Wu ,&nbsp;Lirida Naviner ,&nbsp;Zhaohao wang","doi":"10.1016/j.vlsi.2025.102541","DOIUrl":null,"url":null,"abstract":"<div><div>Approximate computing is widely used for decreasing the power and area consumption of a system since there are many error-tolerant multimedia applications nowadays. NAND-like spin–orbit torque magnetic tunnel junction (SOT-MTJ) as a novel non-volatile multi-bit memory device with high speed and high area efficiency, has a location-related stochastic switching mechanism, which can achieve 100% accuracy for high significant bits (HSBs) while storing low significant bits (LSBs) approximately through the decrease of writing pulse width. Thus, it is highly suitable for the approximate storage and updating of data. In this work, we proposed a quality-configurable last-level cache based on the NAND-like SOT MRAM by utilizing location-related stochastic switching, where the write pulse width is used as an accuracy knob to switch between precise storage and approximate storage. A highly efficient write operation and a specific pipeline read policy are designed to adapt the novel cell structure and application scenarios. To evaluate the performance of the proposal, a circuit-level simulation model NVSim and a set of approximate computing benchmarks Axbench are utilized. By reducing the writing pulse width of the NAND-like SOT MTJs from 3 ns to 1 ns, the improvement of energy and latency are achieved by 42% and 64% with only 3.4% accuracy loss on average for image processing benchmarks.</div></div>","PeriodicalId":54973,"journal":{"name":"Integration-The Vlsi Journal","volume":"106 ","pages":"Article 102541"},"PeriodicalIF":2.5000,"publicationDate":"2025-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A quality-configurable approximate cache design based on NAND-like SOT MRAM with high energy efficiency\",\"authors\":\"Zhengyi Hou ,&nbsp;Luyao Shi ,&nbsp;Yan Qiao ,&nbsp;Bi Wang ,&nbsp;Bi Wu ,&nbsp;Lirida Naviner ,&nbsp;Zhaohao wang\",\"doi\":\"10.1016/j.vlsi.2025.102541\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Approximate computing is widely used for decreasing the power and area consumption of a system since there are many error-tolerant multimedia applications nowadays. NAND-like spin–orbit torque magnetic tunnel junction (SOT-MTJ) as a novel non-volatile multi-bit memory device with high speed and high area efficiency, has a location-related stochastic switching mechanism, which can achieve 100% accuracy for high significant bits (HSBs) while storing low significant bits (LSBs) approximately through the decrease of writing pulse width. Thus, it is highly suitable for the approximate storage and updating of data. In this work, we proposed a quality-configurable last-level cache based on the NAND-like SOT MRAM by utilizing location-related stochastic switching, where the write pulse width is used as an accuracy knob to switch between precise storage and approximate storage. A highly efficient write operation and a specific pipeline read policy are designed to adapt the novel cell structure and application scenarios. To evaluate the performance of the proposal, a circuit-level simulation model NVSim and a set of approximate computing benchmarks Axbench are utilized. By reducing the writing pulse width of the NAND-like SOT MTJs from 3 ns to 1 ns, the improvement of energy and latency are achieved by 42% and 64% with only 3.4% accuracy loss on average for image processing benchmarks.</div></div>\",\"PeriodicalId\":54973,\"journal\":{\"name\":\"Integration-The Vlsi Journal\",\"volume\":\"106 \",\"pages\":\"Article 102541\"},\"PeriodicalIF\":2.5000,\"publicationDate\":\"2025-09-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Integration-The Vlsi Journal\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0167926025001981\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Integration-The Vlsi Journal","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167926025001981","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
引用次数: 0

摘要

摘要近似计算被广泛用于降低系统的功耗和面积消耗,因为现在有许多容错多媒体应用。类nand自旋-轨道转矩磁隧道结(SOT-MTJ)作为一种新型的非易失性多比特存储器件,具有高速、高面积效率的特点,具有位置相关的随机开关机制,通过减小写入脉冲宽度,可以实现高有效位(hsb) 100%的存储精度,同时近似地存储低有效位(LSBs)。因此,它非常适合于数据的近似存储和更新。在这项工作中,我们提出了一种基于类似nand的SOT MRAM的质量可配置的最后一级缓存,利用与位置相关的随机开关,其中写入脉冲宽度用作精确存储和近似存储之间切换的精度旋钮。为了适应新的cell结构和应用场景,设计了高效的写操作和特定的管道读策略。为了评估该方案的性能,使用了电路级仿真模型NVSim和一组近似计算基准Axbench。通过将类nand SOT MTJs的写入脉冲宽度从3ns减少到1ns,在图像处理基准测试中,能量和延迟分别提高了42%和64%,平均精度损失仅为3.4%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A quality-configurable approximate cache design based on NAND-like SOT MRAM with high energy efficiency
Approximate computing is widely used for decreasing the power and area consumption of a system since there are many error-tolerant multimedia applications nowadays. NAND-like spin–orbit torque magnetic tunnel junction (SOT-MTJ) as a novel non-volatile multi-bit memory device with high speed and high area efficiency, has a location-related stochastic switching mechanism, which can achieve 100% accuracy for high significant bits (HSBs) while storing low significant bits (LSBs) approximately through the decrease of writing pulse width. Thus, it is highly suitable for the approximate storage and updating of data. In this work, we proposed a quality-configurable last-level cache based on the NAND-like SOT MRAM by utilizing location-related stochastic switching, where the write pulse width is used as an accuracy knob to switch between precise storage and approximate storage. A highly efficient write operation and a specific pipeline read policy are designed to adapt the novel cell structure and application scenarios. To evaluate the performance of the proposal, a circuit-level simulation model NVSim and a set of approximate computing benchmarks Axbench are utilized. By reducing the writing pulse width of the NAND-like SOT MTJs from 3 ns to 1 ns, the improvement of energy and latency are achieved by 42% and 64% with only 3.4% accuracy loss on average for image processing benchmarks.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Integration-The Vlsi Journal
Integration-The Vlsi Journal 工程技术-工程:电子与电气
CiteScore
3.80
自引率
5.30%
发文量
107
审稿时长
6 months
期刊介绍: Integration''s aim is to cover every aspect of the VLSI area, with an emphasis on cross-fertilization between various fields of science, and the design, verification, test and applications of integrated circuits and systems, as well as closely related topics in process and device technologies. Individual issues will feature peer-reviewed tutorials and articles as well as reviews of recent publications. The intended coverage of the journal can be assessed by examining the following (non-exclusive) list of topics: Specification methods and languages; Analog/Digital Integrated Circuits and Systems; VLSI architectures; Algorithms, methods and tools for modeling, simulation, synthesis and verification of integrated circuits and systems of any complexity; Embedded systems; High-level synthesis for VLSI systems; Logic synthesis and finite automata; Testing, design-for-test and test generation algorithms; Physical design; Formal verification; Algorithms implemented in VLSI systems; Systems engineering; Heterogeneous systems.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信