Yi Cao, Jinhao Liang, Tao Liu, Weihui Sang, Yang Gan, Honghong Li, Yue Wang, Zheng Ren, Yuan Yu, Zhou Xin, Yukang Chen, Xumeng Zhang, Du Xiang, Qi Liu
{"title":"Reward-modulated spike-timing-dependent plasticity in van der Waals ferroelectric memtransistor for robotic recognition and tracking.","authors":"Yi Cao, Jinhao Liang, Tao Liu, Weihui Sang, Yang Gan, Honghong Li, Yue Wang, Zheng Ren, Yuan Yu, Zhou Xin, Yukang Chen, Xumeng Zhang, Du Xiang, Qi Liu","doi":"10.1016/j.scib.2025.05.044","DOIUrl":null,"url":null,"abstract":"<p><p>Reward-modulated spike-timing-dependent plasticity (R-STDP) is a promising biomimetic learning rule in neuromorphic intelligent systems for implementing tasks in variable environments. Nevertheless, realizing R-STDP in a single synaptic device for building compact and energy-efficient neuromorphic systems remains challenging. Here, we report a two-dimensional ferroelectric memtransistor to emulate the R-STDP learning rule by effectively reconfiguring the STDP and anti-STDP. The thermionic emission and tunneling behavior of charges at the ferroelectric interface can be regulated via vertical electric field in a multi-terminal manner, allowing for controllable polarization reversal of synaptic plasticity and transition between STDP and anti-STDP. This enables faithful realization of the R-STDP feature in a single device with energy consumption of ∼1.3 nJ (the lowest known to date), approximately 10<sup>6</sup> times lower than that of its complementary metal-oxide-semiconductor (CMOS) counterpart. By leveraging the synaptic characteristics in the hardware device, we construct spiking neural networks (SNNs) trained with R-STDP to perform robotic recognition and tracking tasks. The SNN achieves 95.1% accuracy on the MNIST dataset using only 8000 parameters, and faster convergence speed requiring only one data batch with 100% inference in the few-shot learning task. Moreover, a robotic arm motion control system configured with R-STDP exhibits 85.5% success rate in tracking both the static and moving targets, illustrating its outstanding adaptability to the dynamic environments. This work provides a potential hardware building block to support compact neuromorphic systems for the application of interactive artificial intelligence agents.</p>","PeriodicalId":421,"journal":{"name":"Science Bulletin","volume":" ","pages":""},"PeriodicalIF":21.1000,"publicationDate":"2025-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Science Bulletin","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1016/j.scib.2025.05.044","RegionNum":1,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Reward-modulated spike-timing-dependent plasticity (R-STDP) is a promising biomimetic learning rule in neuromorphic intelligent systems for implementing tasks in variable environments. Nevertheless, realizing R-STDP in a single synaptic device for building compact and energy-efficient neuromorphic systems remains challenging. Here, we report a two-dimensional ferroelectric memtransistor to emulate the R-STDP learning rule by effectively reconfiguring the STDP and anti-STDP. The thermionic emission and tunneling behavior of charges at the ferroelectric interface can be regulated via vertical electric field in a multi-terminal manner, allowing for controllable polarization reversal of synaptic plasticity and transition between STDP and anti-STDP. This enables faithful realization of the R-STDP feature in a single device with energy consumption of ∼1.3 nJ (the lowest known to date), approximately 106 times lower than that of its complementary metal-oxide-semiconductor (CMOS) counterpart. By leveraging the synaptic characteristics in the hardware device, we construct spiking neural networks (SNNs) trained with R-STDP to perform robotic recognition and tracking tasks. The SNN achieves 95.1% accuracy on the MNIST dataset using only 8000 parameters, and faster convergence speed requiring only one data batch with 100% inference in the few-shot learning task. Moreover, a robotic arm motion control system configured with R-STDP exhibits 85.5% success rate in tracking both the static and moving targets, illustrating its outstanding adaptability to the dynamic environments. This work provides a potential hardware building block to support compact neuromorphic systems for the application of interactive artificial intelligence agents.
期刊介绍:
Science Bulletin (Sci. Bull., formerly known as Chinese Science Bulletin) is a multidisciplinary academic journal supervised by the Chinese Academy of Sciences (CAS) and co-sponsored by the CAS and the National Natural Science Foundation of China (NSFC). Sci. Bull. is a semi-monthly international journal publishing high-caliber peer-reviewed research on a broad range of natural sciences and high-tech fields on the basis of its originality, scientific significance and whether it is of general interest. In addition, we are committed to serving the scientific community with immediate, authoritative news and valuable insights into upcoming trends around the globe.