Reward-modulated spike-timing-dependent plasticity in van der Waals ferroelectric memtransistor for robotic recognition and tracking.

IF 21.1 1区综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES

Science Bulletin Pub Date : 2025-06-04 DOI:10.1016/j.scib.2025.05.044

Yi Cao, Jinhao Liang, Tao Liu, Weihui Sang, Yang Gan, Honghong Li, Yue Wang, Zheng Ren, Yuan Yu, Zhou Xin, Yukang Chen, Xumeng Zhang, Du Xiang, Qi Liu

{"title":"Reward-modulated spike-timing-dependent plasticity in van der Waals ferroelectric memtransistor for robotic recognition and tracking.","authors":"Yi Cao, Jinhao Liang, Tao Liu, Weihui Sang, Yang Gan, Honghong Li, Yue Wang, Zheng Ren, Yuan Yu, Zhou Xin, Yukang Chen, Xumeng Zhang, Du Xiang, Qi Liu","doi":"10.1016/j.scib.2025.05.044","DOIUrl":null,"url":null,"abstract":"Reward-modulated spike-timing-dependent plasticity (R-STDP) is a promising biomimetic learning rule in neuromorphic intelligent systems for implementing tasks in variable environments. Nevertheless, realizing R-STDP in a single synaptic device for building compact and energy-efficient neuromorphic systems remains challenging. Here, we report a two-dimensional ferroelectric memtransistor to emulate the R-STDP learning rule by effectively reconfiguring the STDP and anti-STDP. The thermionic emission and tunneling behavior of charges at the ferroelectric interface can be regulated via vertical electric field in a multi-terminal manner, allowing for controllable polarization reversal of synaptic plasticity and transition between STDP and anti-STDP. This enables faithful realization of the R-STDP feature in a single device with energy consumption of ∼1.3 nJ (the lowest known to date), approximately 106 times lower than that of its complementary metal-oxide-semiconductor (CMOS) counterpart. By leveraging the synaptic characteristics in the hardware device, we construct spiking neural networks (SNNs) trained with R-STDP to perform robotic recognition and tracking tasks. The SNN achieves 95.1% accuracy on the MNIST dataset using only 8000 parameters, and faster convergence speed requiring only one data batch with 100% inference in the few-shot learning task. Moreover, a robotic arm motion control system configured with R-STDP exhibits 85.5% success rate in tracking both the static and moving targets, illustrating its outstanding adaptability to the dynamic environments. This work provides a potential hardware building block to support compact neuromorphic systems for the application of interactive artificial intelligence agents.","PeriodicalId":421,"journal":{"name":"Science Bulletin","volume":" ","pages":""},"PeriodicalIF":21.1000,"publicationDate":"2025-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Science Bulletin","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1016/j.scib.2025.05.044","RegionNum":1,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}

引用次数: 0

Abstract

Reward-modulated spike-timing-dependent plasticity (R-STDP) is a promising biomimetic learning rule in neuromorphic intelligent systems for implementing tasks in variable environments. Nevertheless, realizing R-STDP in a single synaptic device for building compact and energy-efficient neuromorphic systems remains challenging. Here, we report a two-dimensional ferroelectric memtransistor to emulate the R-STDP learning rule by effectively reconfiguring the STDP and anti-STDP. The thermionic emission and tunneling behavior of charges at the ferroelectric interface can be regulated via vertical electric field in a multi-terminal manner, allowing for controllable polarization reversal of synaptic plasticity and transition between STDP and anti-STDP. This enables faithful realization of the R-STDP feature in a single device with energy consumption of ∼1.3 nJ (the lowest known to date), approximately 10⁶ times lower than that of its complementary metal-oxide-semiconductor (CMOS) counterpart. By leveraging the synaptic characteristics in the hardware device, we construct spiking neural networks (SNNs) trained with R-STDP to perform robotic recognition and tracking tasks. The SNN achieves 95.1% accuracy on the MNIST dataset using only 8000 parameters, and faster convergence speed requiring only one data batch with 100% inference in the few-shot learning task. Moreover, a robotic arm motion control system configured with R-STDP exhibits 85.5% success rate in tracking both the static and moving targets, illustrating its outstanding adaptability to the dynamic environments. This work provides a potential hardware building block to support compact neuromorphic systems for the application of interactive artificial intelligence agents.

查看原文本刊更多论文

用于机器人识别和跟踪的范德华铁电记忆晶体管的奖励调制峰值时间依赖塑性。

奖励调节的spike- time -dependent plasticity （R-STDP）是一种很有前途的仿生学习规则，用于神经形态智能系统在可变环境中执行任务。然而，在单一突触装置中实现R-STDP以构建紧凑和节能的神经形态系统仍然具有挑战性。在这里，我们报告了一个二维铁电mem晶体管，通过有效地重新配置STDP和反STDP来模拟R-STDP学习规则。铁电界面电荷的热离子发射和隧穿行为可以通过垂直电场以多端方式调节，从而实现突触可塑性的可控极化反转和STDP与反STDP之间的过渡。这使得在单个器件中忠实地实现R-STDP特性，能耗约为1.3 nJ（迄今为止已知的最低），比互补金属氧化物半导体（CMOS）对口器件低约106倍。通过利用硬件设备中的突触特征，我们构建了用R-STDP训练的尖峰神经网络（snn）来执行机器人识别和跟踪任务。SNN在仅使用8000个参数的MNIST数据集上达到95.1%的准确率，并且在few-shot学习任务中只需一个数据批就可以实现100%的推理，并且收敛速度更快。此外，配置R-STDP的机械臂运动控制系统对静态和运动目标的跟踪成功率均达到85.5%，显示出其对动态环境的良好适应性。这项工作提供了一个潜在的硬件构建块，以支持紧凑的神经形态系统，用于交互式人工智能代理的应用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Science Bulletin MULTIDISCIPLINARY SCIENCES-

CiteScore

24.60

自引率

2.10%

发文量

8092

期刊介绍： Science Bulletin (Sci. Bull., formerly known as Chinese Science Bulletin) is a multidisciplinary academic journal supervised by the Chinese Academy of Sciences (CAS) and co-sponsored by the CAS and the National Natural Science Foundation of China (NSFC). Sci. Bull. is a semi-monthly international journal publishing high-caliber peer-reviewed research on a broad range of natural sciences and high-tech fields on the basis of its originality, scientific significance and whether it is of general interest. In addition, we are committed to serving the scientific community with immediate, authoritative news and valuable insights into upcoming trends around the globe.