Higher order neural processing with input-adaptive dynamic weights on MoS2 memtransistor crossbars

L. Rahimifard, Ahish Shylendra , Shamma Nasrin , Stephanie E. Liu , Vinod K. Sangwan , Mark C. Hersam , A. Trivedi
{"title":"Higher order neural processing with input-adaptive dynamic weights on MoS2 memtransistor crossbars","authors":"L. Rahimifard, Ahish Shylendra , Shamma Nasrin , Stephanie E. Liu , Vinod K. Sangwan , Mark C. Hersam , A. Trivedi","doi":"10.3389/femat.2022.950487","DOIUrl":null,"url":null,"abstract":"The increasing complexity of deep learning systems has pushed conventional computing technologies to their limits. While the memristor is one of the prevailing technologies for deep learning acceleration, it is only suited for classical learning layers where only two operands, namely weights and inputs, are processed simultaneously. Meanwhile, to improve the computational efficiency of deep learning for emerging applications, a variety of non-traditional layers requiring concurrent processing of many operands are becoming popular. For example, hypernetworks improve their predictive robustness by simultaneously processing weights and inputs against the application context. Two-electrode memristor grids cannot directly map emerging layers’ higher-order multiplicative neural interactions. Addressing this unmet need, we present crossbar processing using dual-gated memtransistors based on two-dimensional semiconductor MoS2. Unlike the memristor, the resistance states of memtransistors can be persistently programmed and can be actively controlled by multiple gate electrodes. Thus, the discussed memtransistor crossbar enables several advanced inference architectures beyond a conventional passive crossbar. For example, we show that sneak paths can be effectively suppressed in memtransistor crossbars, whereas they limit size scalability in a passive memristor crossbar. Similarly, exploiting gate terminals to suppress crossbar weights dynamically reduces biasing power by ∼20% in memtransistor crossbars for a fully connected layer of AlexNet. On emerging layers such as hypernetworks, collocating multiple operations within the same crossbar cells reduces operating power by ∼ 15 × on the considered network cases.","PeriodicalId":119676,"journal":{"name":"Frontiers in Electronic Materials","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Electronic Materials","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/femat.2022.950487","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

The increasing complexity of deep learning systems has pushed conventional computing technologies to their limits. While the memristor is one of the prevailing technologies for deep learning acceleration, it is only suited for classical learning layers where only two operands, namely weights and inputs, are processed simultaneously. Meanwhile, to improve the computational efficiency of deep learning for emerging applications, a variety of non-traditional layers requiring concurrent processing of many operands are becoming popular. For example, hypernetworks improve their predictive robustness by simultaneously processing weights and inputs against the application context. Two-electrode memristor grids cannot directly map emerging layers’ higher-order multiplicative neural interactions. Addressing this unmet need, we present crossbar processing using dual-gated memtransistors based on two-dimensional semiconductor MoS2. Unlike the memristor, the resistance states of memtransistors can be persistently programmed and can be actively controlled by multiple gate electrodes. Thus, the discussed memtransistor crossbar enables several advanced inference architectures beyond a conventional passive crossbar. For example, we show that sneak paths can be effectively suppressed in memtransistor crossbars, whereas they limit size scalability in a passive memristor crossbar. Similarly, exploiting gate terminals to suppress crossbar weights dynamically reduces biasing power by ∼20% in memtransistor crossbars for a fully connected layer of AlexNet. On emerging layers such as hypernetworks, collocating multiple operations within the same crossbar cells reduces operating power by ∼ 15 × on the considered network cases.
具有输入自适应动态权值的MoS2 mem晶体管交叉栅高阶神经处理
深度学习系统的日益复杂已经将传统的计算技术推向了极限。同时,为了提高新兴应用中深度学习的计算效率,各种需要并发处理多个操作数的非传统层开始流行。例如,超级网络通过针对应用程序上下文同时处理权重和输入来提高其预测健壮性。双电极记忆电阻网格不能直接映射新兴层的高阶乘法神经相互作用。为了解决这一未满足的需求,我们提出了基于二维半导体MoS2的双门控memtransistor的crossbar处理。与忆阻器不同,忆阻晶体管的电阻状态可以持续编程,并且可以由多个栅极主动控制。因此,所讨论的memtransistor crossbar能够实现超越传统无源crossbar的几种高级推理架构。例如,我们证明在memtransistor crossbar中可以有效地抑制潜行路径,然而它们限制了无源记忆电阻器crossbar的尺寸可扩展性。类似地,利用栅极终端动态地抑制交叉栅权重,可将AlexNet完全连接层的mem晶体管交叉栅中的偏置功率降低约20%。在诸如超级网络之类的新兴层中,在考虑的网络情况下,在相同的交叉单元中配置多个操作可将操作功率降低约15倍。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信