Alejandro Pan-Vazquez, Yoel Sanchez Araujo, Brenna McMannon, Miranta Louka, Akhil Bandi, Laura Haetzel, Mayo Faulkner, Jonathan W Pillow, Nathaniel D Daw, Ilana B Witten
{"title":"Pre-existing visual responses in a projection-defined dopamine population explain individual learning trajectories.","authors":"Alejandro Pan-Vazquez, Yoel Sanchez Araujo, Brenna McMannon, Miranta Louka, Akhil Bandi, Laura Haetzel, Mayo Faulkner, Jonathan W Pillow, Nathaniel D Daw, Ilana B Witten","doi":"10.1016/j.cub.2024.09.045","DOIUrl":null,"url":null,"abstract":"<p><p>A key challenge of learning a new task is that the environment is high dimensional-there are many different sensory features and possible actions, with typically only a small reward-relevant subset. Although animals can learn to perform complex tasks that involve arbitrary associations between stimuli, actions, and rewards,<sup>1</sup><sup>,</sup><sup>2</sup><sup>,</sup><sup>3</sup><sup>,</sup><sup>4</sup><sup>,</sup><sup>5</sup><sup>,</sup><sup>6</sup> a consistent and striking result across varied experimental paradigms is that in initially acquiring such tasks, large differences between individuals are apparent in the learning process.<sup>7</sup><sup>,</sup><sup>8</sup><sup>,</sup><sup>9</sup><sup>,</sup><sup>10</sup><sup>,</sup><sup>11</sup><sup>,</sup><sup>12</sup> What neural mechanisms contribute to initial task acquisition, and why do some individuals learn a new task much more quickly than others? To address these questions, we recorded longitudinally from dopaminergic (DA) axon terminals in mice learning a visual decision-making task.<sup>7</sup> Across striatum, DA responses tracked idiosyncratic and side-specific learning trajectories, consistent with widespread reward prediction error coding across DA terminals. However, even before any rewards were delivered, contralateral-side-specific visual responses were present in DA terminals, primarily in the dorsomedial striatum (DMS). These pre-existing responses predicted the extent of learning for contralateral stimuli. Moreover, activation of these terminals improved contralateral performance. Thus, the initial conditions of a projection-specific and feature-specific DA signal help explain individual learning trajectories. More broadly, this work suggests that functional heterogeneity across DA projections may serve to bias target regions toward learning about different subsets of task features, providing a potential mechanism to address the dimensionality of the initial task learning problem.</p>","PeriodicalId":11359,"journal":{"name":"Current Biology","volume":" ","pages":"5349-5358.e6"},"PeriodicalIF":8.1000,"publicationDate":"2024-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11579926/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Current Biology","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1016/j.cub.2024.09.045","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/10/16 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
A key challenge of learning a new task is that the environment is high dimensional-there are many different sensory features and possible actions, with typically only a small reward-relevant subset. Although animals can learn to perform complex tasks that involve arbitrary associations between stimuli, actions, and rewards,1,2,3,4,5,6 a consistent and striking result across varied experimental paradigms is that in initially acquiring such tasks, large differences between individuals are apparent in the learning process.7,8,9,10,11,12 What neural mechanisms contribute to initial task acquisition, and why do some individuals learn a new task much more quickly than others? To address these questions, we recorded longitudinally from dopaminergic (DA) axon terminals in mice learning a visual decision-making task.7 Across striatum, DA responses tracked idiosyncratic and side-specific learning trajectories, consistent with widespread reward prediction error coding across DA terminals. However, even before any rewards were delivered, contralateral-side-specific visual responses were present in DA terminals, primarily in the dorsomedial striatum (DMS). These pre-existing responses predicted the extent of learning for contralateral stimuli. Moreover, activation of these terminals improved contralateral performance. Thus, the initial conditions of a projection-specific and feature-specific DA signal help explain individual learning trajectories. More broadly, this work suggests that functional heterogeneity across DA projections may serve to bias target regions toward learning about different subsets of task features, providing a potential mechanism to address the dimensionality of the initial task learning problem.
期刊介绍:
Current Biology is a comprehensive journal that showcases original research in various disciplines of biology. It provides a platform for scientists to disseminate their groundbreaking findings and promotes interdisciplinary communication. The journal publishes articles of general interest, encompassing diverse fields of biology. Moreover, it offers accessible editorial pieces that are specifically designed to enlighten non-specialist readers.