Thaiz Sánchez-Costa , Alejandra Carboni , Francisco Cervantes Constantino
{"title":"Never mind the repeat: How speech expectations reduce tracking at the cocktail party","authors":"Thaiz Sánchez-Costa , Alejandra Carboni , Francisco Cervantes Constantino","doi":"10.1016/j.cortex.2025.05.003","DOIUrl":null,"url":null,"abstract":"<div><div>When the brain focuses on a conversation in a noisy environment, it exploits past experience to prioritize relevant elements from the auditory scene. This prompts the question of what changes occur in the selective neural processing of speech mixtures as listeners garner prior experience about single speech objects. In three different priming experiments, we quantified cortical selection of temporal landmarks from continuous speech, applying the temporal response function (TRF) method to single-trial electroencephalography (EEG) recordings. The designs specifically addressed how attention interacts with exact (Experiment 1), voice (Experiment 2a), or message (Experiment 2b) content priming of the target or background speakers in cortical responses to speech. Our results demonstrate that, during multispeaker listening, attentional gains typical of cortical responses under speech selection are met with attenuations as a consequence of prior experience. The changes were observed at the P2 processing stage (220–320 msec) of speech envelope onset processing and were specific to responses to primed speech targets (Experiment 1). Suppressions at stages earlier than the P2, or under partial priming conditions (Experiments 2a and 2b), were not observed. An exploratory analysis suggests the observed P2 reduction predicts listeners' ability to report target words, consistent with this component encoding in part temporal prediction error about onset edge cues exclusive to target speech. Our results show that at this late and definitive stage of selective attention, the auditory system may test the evidence for its own predictive model of the noise-invariant speech stream. Precise inference of its temporal structure is bound to tag all checkpoints where auditory evidence can be most reliably connected into higher-order representations of continuous speech.</div></div>","PeriodicalId":10758,"journal":{"name":"Cortex","volume":"189 ","pages":"Pages 1-19"},"PeriodicalIF":3.2000,"publicationDate":"2025-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cortex","FirstCategoryId":"102","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0010945225001248","RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BEHAVIORAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
When the brain focuses on a conversation in a noisy environment, it exploits past experience to prioritize relevant elements from the auditory scene. This prompts the question of what changes occur in the selective neural processing of speech mixtures as listeners garner prior experience about single speech objects. In three different priming experiments, we quantified cortical selection of temporal landmarks from continuous speech, applying the temporal response function (TRF) method to single-trial electroencephalography (EEG) recordings. The designs specifically addressed how attention interacts with exact (Experiment 1), voice (Experiment 2a), or message (Experiment 2b) content priming of the target or background speakers in cortical responses to speech. Our results demonstrate that, during multispeaker listening, attentional gains typical of cortical responses under speech selection are met with attenuations as a consequence of prior experience. The changes were observed at the P2 processing stage (220–320 msec) of speech envelope onset processing and were specific to responses to primed speech targets (Experiment 1). Suppressions at stages earlier than the P2, or under partial priming conditions (Experiments 2a and 2b), were not observed. An exploratory analysis suggests the observed P2 reduction predicts listeners' ability to report target words, consistent with this component encoding in part temporal prediction error about onset edge cues exclusive to target speech. Our results show that at this late and definitive stage of selective attention, the auditory system may test the evidence for its own predictive model of the noise-invariant speech stream. Precise inference of its temporal structure is bound to tag all checkpoints where auditory evidence can be most reliably connected into higher-order representations of continuous speech.
期刊介绍:
CORTEX is an international journal devoted to the study of cognition and of the relationship between the nervous system and mental processes, particularly as these are reflected in the behaviour of patients with acquired brain lesions, normal volunteers, children with typical and atypical development, and in the activation of brain regions and systems as recorded by functional neuroimaging techniques. It was founded in 1964 by Ennio De Renzi.