小脑与基底神经节协同强化学习。

IF 4 2区医学 Q1 NEUROSCIENCES

Journal of Neuroscience Pub Date : 2025-06-04 DOI:10.1523/JNEUROSCI.1464-24.2025

Tatsumi Yoshida, Hikaru Sugino, Hinako Yamamoto, Sho Tanno, Mikihide Tamura, Jun Igarashi, Yoshikazu Isomura, Riichiro Hira

{"title":"小脑与基底神经节协同强化学习。","authors":"Tatsumi Yoshida, Hikaru Sugino, Hinako Yamamoto, Sho Tanno, Mikihide Tamura, Jun Igarashi, Yoshikazu Isomura, Riichiro Hira","doi":"10.1523/JNEUROSCI.1464-24.2025","DOIUrl":null,"url":null,"abstract":"The cerebral cortex, cerebellum, and basal ganglia are essential for flexible learning in mammals. Although traditionally thought to operate under different learning rules, recent evidence suggests that both the basal ganglia and the cerebellum may employ reinforcement learning mechanisms. This raises the question of how these structures coordinate when a common reward prediction error mechanism is active. To address this issue, we first examined output signals from the basal ganglia and cerebellum following the activity of the cerebral cortex. We recorded single-neuron activity from the output regions of the cerebellum and basal ganglia-the cerebellar nuclei (CN) and substantia nigra pars reticulata (SNr)-in both male and female ChR2 transgenic rats. Neurons in the CN and SNr exhibited distinct temporal response patterns; notably, the fast excitatory response in the CN, driven by mossy fiber input, was synchronized with the inhibitory response in the SNr, mediated via the direct pathway. Using these experimental findings together with connectome data, we developed both a semirealistic spiking network model and a reservoir-based reinforcement learning model. In the latter model, successful learning depended on synaptic plasticity in both the cerebellum and basal ganglia with a temporal precision on the order of 10 ms. Furthermore, cortical β-oscillations enhanced learning and optimal reinforcement learning occurred when the output of cerebellar and basal ganglia signal phase-locked at the frequency of cortical oscillation. Taken together, our results suggest that the coordinated output of the cerebellum and basal ganglia, driven by tightly tuned cortical input, underlies brain-wide synergistic reinforcement learning.","PeriodicalId":50114,"journal":{"name":"Journal of Neuroscience","volume":" ","pages":""},"PeriodicalIF":4.0000,"publicationDate":"2025-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12139595/pdf/","citationCount":"0","resultStr":"{\"title\":\"Synergistic Reinforcement Learning by Cooperation of the Cerebellum and Basal Ganglia.\",\"authors\":\"Tatsumi Yoshida, Hikaru Sugino, Hinako Yamamoto, Sho Tanno, Mikihide Tamura, Jun Igarashi, Yoshikazu Isomura, Riichiro Hira\",\"doi\":\"10.1523/JNEUROSCI.1464-24.2025\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The cerebral cortex, cerebellum, and basal ganglia are essential for flexible learning in mammals. Although traditionally thought to operate under different learning rules, recent evidence suggests that both the basal ganglia and the cerebellum may employ reinforcement learning mechanisms. This raises the question of how these structures coordinate when a common reward prediction error mechanism is active. To address this issue, we first examined output signals from the basal ganglia and cerebellum following the activity of the cerebral cortex. We recorded single-neuron activity from the output regions of the cerebellum and basal ganglia-the cerebellar nuclei (CN) and substantia nigra pars reticulata (SNr)-in both male and female ChR2 transgenic rats. Neurons in the CN and SNr exhibited distinct temporal response patterns; notably, the fast excitatory response in the CN, driven by mossy fiber input, was synchronized with the inhibitory response in the SNr, mediated via the direct pathway. Using these experimental findings together with connectome data, we developed both a semirealistic spiking network model and a reservoir-based reinforcement learning model. In the latter model, successful learning depended on synaptic plasticity in both the cerebellum and basal ganglia with a temporal precision on the order of 10 ms. Furthermore, cortical β-oscillations enhanced learning and optimal reinforcement learning occurred when the output of cerebellar and basal ganglia signal phase-locked at the frequency of cortical oscillation. Taken together, our results suggest that the coordinated output of the cerebellum and basal ganglia, driven by tightly tuned cortical input, underlies brain-wide synergistic reinforcement learning.\",\"PeriodicalId\":50114,\"journal\":{\"name\":\"Journal of Neuroscience\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":4.0000,\"publicationDate\":\"2025-06-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12139595/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Neuroscience\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1523/JNEUROSCI.1464-24.2025\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"NEUROSCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Neuroscience","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1523/JNEUROSCI.1464-24.2025","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"NEUROSCIENCES","Score":null,"Total":0}

引用次数: 0

摘要

哺乳动物的大脑皮层、小脑和基底神经节对灵活学习至关重要。虽然传统上认为基底神经节和小脑在不同的学习规则下运作，但最近的证据表明，基底神经节和小脑都可能采用强化学习机制。这就提出了一个问题，当一个常见的奖励预测错误机制活跃时，这些结构是如何协调的。为了解决这个问题，我们首先检查了大脑皮层活动后基底神经节和小脑的输出信号。我们记录了雄性和雌性ChR2转基因大鼠小脑和基底神经节输出区——小脑核（CN）和网状黑质（SNr）的单个神经元活动。中枢神经网络和信噪比神经元表现出不同的时间响应模式；值得注意的是，由苔藓纤维输入驱动的CN中的快速兴奋反应与SNr中的抑制反应同步，通过直接途径介导。利用这些实验结果和连接体数据，我们开发了一个半真实的尖峰网络模型和基于水库的强化学习模型。在后一种模型中，成功的学习依赖于小脑和基底节区突触的可塑性，其时间精度在10毫秒左右。此外，当小脑和基底节区信号输出锁相于皮质振荡频率时，皮质β振荡增强了学习和最优强化学习。综上所述，我们的研究结果表明，小脑和基底神经节的协调输出，由紧密调谐的皮层输入驱动，是全脑协同强化学习的基础。大脑皮层、小脑和基底神经节支持学习。最近的研究表明，基底神经节和小脑都使用类似的学习过程，称为强化学习，其中包括预测奖励。为了了解这些大脑区域是如何协同工作的，我们在光刺激大鼠的大脑皮层时记录了大脑活动。我们发现小脑和基底神经节的两种反应是同步的，这可能有助于激活大脑皮层。计算机模型显示，来自小脑和基底神经节的信号的精确定时对学习很重要。只有当大脑皮层在特定的频率范围内工作时，这个时间才重要。我们的研究结果表明，协调的大脑活动可以提高学习能力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Synergistic Reinforcement Learning by Cooperation of the Cerebellum and Basal Ganglia.

The cerebral cortex, cerebellum, and basal ganglia are essential for flexible learning in mammals. Although traditionally thought to operate under different learning rules, recent evidence suggests that both the basal ganglia and the cerebellum may employ reinforcement learning mechanisms. This raises the question of how these structures coordinate when a common reward prediction error mechanism is active. To address this issue, we first examined output signals from the basal ganglia and cerebellum following the activity of the cerebral cortex. We recorded single-neuron activity from the output regions of the cerebellum and basal ganglia-the cerebellar nuclei (CN) and substantia nigra pars reticulata (SNr)-in both male and female ChR2 transgenic rats. Neurons in the CN and SNr exhibited distinct temporal response patterns; notably, the fast excitatory response in the CN, driven by mossy fiber input, was synchronized with the inhibitory response in the SNr, mediated via the direct pathway. Using these experimental findings together with connectome data, we developed both a semirealistic spiking network model and a reservoir-based reinforcement learning model. In the latter model, successful learning depended on synaptic plasticity in both the cerebellum and basal ganglia with a temporal precision on the order of 10 ms. Furthermore, cortical β-oscillations enhanced learning and optimal reinforcement learning occurred when the output of cerebellar and basal ganglia signal phase-locked at the frequency of cortical oscillation. Taken together, our results suggest that the coordinated output of the cerebellum and basal ganglia, driven by tightly tuned cortical input, underlies brain-wide synergistic reinforcement learning.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Journal of Neuroscience 医学-神经科学

CiteScore

9.30

自引率

3.80%

发文量

1164

审稿时长

12 months

期刊介绍： JNeurosci (ISSN 0270-6474) is an official journal of the Society for Neuroscience. It is published weekly by the Society, fifty weeks a year, one volume a year. JNeurosci publishes papers on a broad range of topics of general interest to those working on the nervous system. Authors now have an Open Choice option for their published articles