Reward-modulated spike-timing-dependent plasticity in van der Waals ferroelectric memtransistor for robotic recognition and tracking.

IF 21.1 1区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES
Yi Cao, Jinhao Liang, Tao Liu, Weihui Sang, Yang Gan, Honghong Li, Yue Wang, Zheng Ren, Yuan Yu, Zhou Xin, Yukang Chen, Xumeng Zhang, Du Xiang, Qi Liu
{"title":"Reward-modulated spike-timing-dependent plasticity in van der Waals ferroelectric memtransistor for robotic recognition and tracking.","authors":"Yi Cao, Jinhao Liang, Tao Liu, Weihui Sang, Yang Gan, Honghong Li, Yue Wang, Zheng Ren, Yuan Yu, Zhou Xin, Yukang Chen, Xumeng Zhang, Du Xiang, Qi Liu","doi":"10.1016/j.scib.2025.05.044","DOIUrl":null,"url":null,"abstract":"<p><p>Reward-modulated spike-timing-dependent plasticity (R-STDP) is a promising biomimetic learning rule in neuromorphic intelligent systems for implementing tasks in variable environments. Nevertheless, realizing R-STDP in a single synaptic device for building compact and energy-efficient neuromorphic systems remains challenging. Here, we report a two-dimensional ferroelectric memtransistor to emulate the R-STDP learning rule by effectively reconfiguring the STDP and anti-STDP. The thermionic emission and tunneling behavior of charges at the ferroelectric interface can be regulated via vertical electric field in a multi-terminal manner, allowing for controllable polarization reversal of synaptic plasticity and transition between STDP and anti-STDP. This enables faithful realization of the R-STDP feature in a single device with energy consumption of ∼1.3 nJ (the lowest known to date), approximately 10<sup>6</sup> times lower than that of its complementary metal-oxide-semiconductor (CMOS) counterpart. By leveraging the synaptic characteristics in the hardware device, we construct spiking neural networks (SNNs) trained with R-STDP to perform robotic recognition and tracking tasks. The SNN achieves 95.1% accuracy on the MNIST dataset using only 8000 parameters, and faster convergence speed requiring only one data batch with 100% inference in the few-shot learning task. Moreover, a robotic arm motion control system configured with R-STDP exhibits 85.5% success rate in tracking both the static and moving targets, illustrating its outstanding adaptability to the dynamic environments. This work provides a potential hardware building block to support compact neuromorphic systems for the application of interactive artificial intelligence agents.</p>","PeriodicalId":421,"journal":{"name":"Science Bulletin","volume":" ","pages":""},"PeriodicalIF":21.1000,"publicationDate":"2025-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Science Bulletin","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1016/j.scib.2025.05.044","RegionNum":1,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Reward-modulated spike-timing-dependent plasticity (R-STDP) is a promising biomimetic learning rule in neuromorphic intelligent systems for implementing tasks in variable environments. Nevertheless, realizing R-STDP in a single synaptic device for building compact and energy-efficient neuromorphic systems remains challenging. Here, we report a two-dimensional ferroelectric memtransistor to emulate the R-STDP learning rule by effectively reconfiguring the STDP and anti-STDP. The thermionic emission and tunneling behavior of charges at the ferroelectric interface can be regulated via vertical electric field in a multi-terminal manner, allowing for controllable polarization reversal of synaptic plasticity and transition between STDP and anti-STDP. This enables faithful realization of the R-STDP feature in a single device with energy consumption of ∼1.3 nJ (the lowest known to date), approximately 106 times lower than that of its complementary metal-oxide-semiconductor (CMOS) counterpart. By leveraging the synaptic characteristics in the hardware device, we construct spiking neural networks (SNNs) trained with R-STDP to perform robotic recognition and tracking tasks. The SNN achieves 95.1% accuracy on the MNIST dataset using only 8000 parameters, and faster convergence speed requiring only one data batch with 100% inference in the few-shot learning task. Moreover, a robotic arm motion control system configured with R-STDP exhibits 85.5% success rate in tracking both the static and moving targets, illustrating its outstanding adaptability to the dynamic environments. This work provides a potential hardware building block to support compact neuromorphic systems for the application of interactive artificial intelligence agents.

用于机器人识别和跟踪的范德华铁电记忆晶体管的奖励调制峰值时间依赖塑性。
奖励调节的spike- time -dependent plasticity (R-STDP)是一种很有前途的仿生学习规则,用于神经形态智能系统在可变环境中执行任务。然而,在单一突触装置中实现R-STDP以构建紧凑和节能的神经形态系统仍然具有挑战性。在这里,我们报告了一个二维铁电mem晶体管,通过有效地重新配置STDP和反STDP来模拟R-STDP学习规则。铁电界面电荷的热离子发射和隧穿行为可以通过垂直电场以多端方式调节,从而实现突触可塑性的可控极化反转和STDP与反STDP之间的过渡。这使得在单个器件中忠实地实现R-STDP特性,能耗约为1.3 nJ(迄今为止已知的最低),比互补金属氧化物半导体(CMOS)对口器件低约106倍。通过利用硬件设备中的突触特征,我们构建了用R-STDP训练的尖峰神经网络(snn)来执行机器人识别和跟踪任务。SNN在仅使用8000个参数的MNIST数据集上达到95.1%的准确率,并且在few-shot学习任务中只需一个数据批就可以实现100%的推理,并且收敛速度更快。此外,配置R-STDP的机械臂运动控制系统对静态和运动目标的跟踪成功率均达到85.5%,显示出其对动态环境的良好适应性。这项工作提供了一个潜在的硬件构建块,以支持紧凑的神经形态系统,用于交互式人工智能代理的应用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Science Bulletin
Science Bulletin MULTIDISCIPLINARY SCIENCES-
CiteScore
24.60
自引率
2.10%
发文量
8092
期刊介绍: Science Bulletin (Sci. Bull., formerly known as Chinese Science Bulletin) is a multidisciplinary academic journal supervised by the Chinese Academy of Sciences (CAS) and co-sponsored by the CAS and the National Natural Science Foundation of China (NSFC). Sci. Bull. is a semi-monthly international journal publishing high-caliber peer-reviewed research on a broad range of natural sciences and high-tech fields on the basis of its originality, scientific significance and whether it is of general interest. In addition, we are committed to serving the scientific community with immediate, authoritative news and valuable insights into upcoming trends around the globe.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信