A hierarchy of processing complexity and timescales for natural sounds in the human auditory cortex

IF 9.4 1区综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES

Proceedings of the National Academy of Sciences of the United States of America Pub Date : 2025-04-28 DOI:10.1073/pnas.2412243122

Kyle M. Rupp, Jasmine L. Hect, Emily E. Harford, Lori L. Holt, Avniel Singh Ghuman, Taylor J. Abel

{"title":"A hierarchy of processing complexity and timescales for natural sounds in the human auditory cortex","authors":"Kyle M. Rupp, Jasmine L. Hect, Emily E. Harford, Lori L. Holt, Avniel Singh Ghuman, Taylor J. Abel","doi":"10.1073/pnas.2412243122","DOIUrl":null,"url":null,"abstract":"Efficient behavior is supported by humans’ ability to rapidly recognize acoustically distinct sounds as members of a common category. Within the auditory cortex, critical unanswered questions remain regarding the organization and dynamics of sound categorization. We performed intracerebral recordings during epilepsy surgery evaluation as 20 patient-participants listened to natural sounds. We then built encoding models to predict neural responses using sound representations extracted from different layers within a deep neural network (DNN) pretrained to categorize sounds from acoustics. This approach yielded accurate models of neural responses throughout the auditory cortex. The complexity of a cortical site’s representation (measured by the depth of the DNN layer that produced the best model) was closely related to its anatomical location, with shallow, middle, and deep layers associated with core (primary auditory cortex), lateral belt, and parabelt regions, respectively. Smoothly varying gradients of representational complexity existed within these regions, with complexity increasing along a posteromedial-to-anterolateral direction in core and lateral belt and along posterior-to-anterior and dorsal-to-ventral dimensions in parabelt. We then characterized the time (relative to sound onset) when feature representations emerged; this measure of temporal dynamics increased across the auditory hierarchy. Finally, we found separable effects of region and temporal dynamics on representational complexity: sites that took longer to begin encoding stimulus features had higher representational complexity independent of region, and downstream regions encoded more complex features independent of temporal dynamics. These findings suggest that hierarchies of timescales and complexity represent a functional organizational principle of the auditory stream underlying our ability to rapidly categorize sounds.","PeriodicalId":20548,"journal":{"name":"Proceedings of the National Academy of Sciences of the United States of America","volume":"82 1","pages":""},"PeriodicalIF":9.4000,"publicationDate":"2025-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the National Academy of Sciences of the United States of America","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1073/pnas.2412243122","RegionNum":1,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}

引用次数: 0

Abstract

Efficient behavior is supported by humans’ ability to rapidly recognize acoustically distinct sounds as members of a common category. Within the auditory cortex, critical unanswered questions remain regarding the organization and dynamics of sound categorization. We performed intracerebral recordings during epilepsy surgery evaluation as 20 patient-participants listened to natural sounds. We then built encoding models to predict neural responses using sound representations extracted from different layers within a deep neural network (DNN) pretrained to categorize sounds from acoustics. This approach yielded accurate models of neural responses throughout the auditory cortex. The complexity of a cortical site’s representation (measured by the depth of the DNN layer that produced the best model) was closely related to its anatomical location, with shallow, middle, and deep layers associated with core (primary auditory cortex), lateral belt, and parabelt regions, respectively. Smoothly varying gradients of representational complexity existed within these regions, with complexity increasing along a posteromedial-to-anterolateral direction in core and lateral belt and along posterior-to-anterior and dorsal-to-ventral dimensions in parabelt. We then characterized the time (relative to sound onset) when feature representations emerged; this measure of temporal dynamics increased across the auditory hierarchy. Finally, we found separable effects of region and temporal dynamics on representational complexity: sites that took longer to begin encoding stimulus features had higher representational complexity independent of region, and downstream regions encoded more complex features independent of temporal dynamics. These findings suggest that hierarchies of timescales and complexity represent a functional organizational principle of the auditory stream underlying our ability to rapidly categorize sounds.

查看原文本刊更多论文

人类听觉皮层处理自然声音的复杂性和时间尺度的层次结构

高效的行为是由人类能够快速识别不同的声音作为一个共同类别的成员的能力支持的。在听觉皮层中，关于声音分类的组织和动态的关键未解问题仍然存在。在癫痫手术评估期间，我们对20名患者参与者进行了脑内录音，让他们听自然声音。然后，我们建立编码模型来预测神经反应，使用预训练的深度神经网络（DNN）中从不同层提取的声音表示来对声学中的声音进行分类。这种方法产生了贯穿听觉皮层的精确的神经反应模型。皮层部位表征的复杂性（通过产生最佳模型的DNN层的深度来测量）与其解剖位置密切相关，浅层、中层和深层分别与核心（初级听觉皮层）、侧带和旁带区域相关。在这些区域内，表征复杂性的梯度呈平滑变化，核心和外侧带的复杂性沿后内侧到前外侧方向增加，抛物线带的复杂性沿后前方和背侧到腹侧方向增加。然后，我们描述了特征表征出现时的时间（相对于声音开始）；这种对时间动态的测量在听觉层次上有所增加。最后，我们发现区域和时间动态对表征复杂性的影响是可分离的：开始编码刺激特征所需时间较长的区域具有较高的表征复杂性，与区域无关，下游区域编码更复杂的特征，与时间动态无关。这些发现表明，时间尺度的层次和复杂性代表了听觉流的功能组织原则，这是我们快速分类声音的能力的基础。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the National Academy of Sciences of the United States of America 综合性期刊-综合性期刊

CiteScore

19.00

自引率

0.90%

发文量

3575

审稿时长

2.5 months

期刊介绍： The Proceedings of the National Academy of Sciences (PNAS), a peer-reviewed journal of the National Academy of Sciences (NAS), serves as an authoritative source for high-impact, original research across the biological, physical, and social sciences. With a global scope, the journal welcomes submissions from researchers worldwide, making it an inclusive platform for advancing scientific knowledge.