{"title":"调制滤波语音的感知学习。","authors":"James M Webb, Ediz Sohoglu","doi":"10.1037/xhp0001274","DOIUrl":null,"url":null,"abstract":"<p><p>Human listeners have a remarkable capacity to adapt to severe distortions of the speech signal. Previous work indicates that perceptual learning of degraded speech reflects changes to sublexical representations, though the precise format of these representations has not yet been established. Inspired by the neurophysiology of auditory cortex, we hypothesized that perceptual learning involves changes to perceptual representations that are tuned to acoustic modulations of the speech signal. We systematically filtered speech to control modulation content during training and test blocks. Perceptual learning was highly specific to the modulation filter heard during training, consistent with the hypothesis that learning involves changes to representations of speech modulations. In further experiments, we used modulation filtering and different feedback regimes (clear speech vs. written feedback) to investigate the role of talker-specific cues for cross-talker generalization of learning. Our results suggest that learning partially generalizes to speech from novel (untrained) talkers but that talker-specific cues can enhance generalization. These findings are consistent with the proposal that perceptual learning entails the adjustment of internal models that map acoustic features to phonological categories. These models can be applied to degraded speech from novel talkers, particularly when listeners can account for talker-specific variability in the acoustic signal. (PsycInfo Database Record (c) 2025 APA, all rights reserved).</p>","PeriodicalId":50195,"journal":{"name":"Journal of Experimental Psychology-Human Perception and Performance","volume":" ","pages":"314-340"},"PeriodicalIF":2.1000,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Perceptual learning of modulation filtered speech.\",\"authors\":\"James M Webb, Ediz Sohoglu\",\"doi\":\"10.1037/xhp0001274\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Human listeners have a remarkable capacity to adapt to severe distortions of the speech signal. Previous work indicates that perceptual learning of degraded speech reflects changes to sublexical representations, though the precise format of these representations has not yet been established. Inspired by the neurophysiology of auditory cortex, we hypothesized that perceptual learning involves changes to perceptual representations that are tuned to acoustic modulations of the speech signal. We systematically filtered speech to control modulation content during training and test blocks. Perceptual learning was highly specific to the modulation filter heard during training, consistent with the hypothesis that learning involves changes to representations of speech modulations. In further experiments, we used modulation filtering and different feedback regimes (clear speech vs. written feedback) to investigate the role of talker-specific cues for cross-talker generalization of learning. Our results suggest that learning partially generalizes to speech from novel (untrained) talkers but that talker-specific cues can enhance generalization. These findings are consistent with the proposal that perceptual learning entails the adjustment of internal models that map acoustic features to phonological categories. These models can be applied to degraded speech from novel talkers, particularly when listeners can account for talker-specific variability in the acoustic signal. (PsycInfo Database Record (c) 2025 APA, all rights reserved).</p>\",\"PeriodicalId\":50195,\"journal\":{\"name\":\"Journal of Experimental Psychology-Human Perception and Performance\",\"volume\":\" \",\"pages\":\"314-340\"},\"PeriodicalIF\":2.1000,\"publicationDate\":\"2025-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Experimental Psychology-Human Perception and Performance\",\"FirstCategoryId\":\"102\",\"ListUrlMain\":\"https://doi.org/10.1037/xhp0001274\",\"RegionNum\":3,\"RegionCategory\":\"心理学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/1/23 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q2\",\"JCRName\":\"PSYCHOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Experimental Psychology-Human Perception and Performance","FirstCategoryId":"102","ListUrlMain":"https://doi.org/10.1037/xhp0001274","RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/23 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"PSYCHOLOGY","Score":null,"Total":0}
引用次数: 0
摘要
人类听众对语音信号的严重失真具有非凡的适应能力。先前的研究表明,退化语音的感知学习反映了亚词汇表征的变化,尽管这些表征的精确格式尚未确定。受听觉皮层神经生理学的启发,我们假设感知学习涉及到感知表征的变化,这些变化被语音信号的声学调制所调节。在训练和测试模块中,我们系统地过滤语音以控制调制内容。知觉学习对训练过程中听到的调制滤波器具有高度特异性,这与学习涉及语音调制表征变化的假设相一致。在进一步的实验中,我们使用调制滤波和不同的反馈机制(清晰语音与书面反馈)来研究说话者特定线索对相声者学习泛化的作用。我们的研究结果表明,从新的(未经训练的)说话者那里学习部分概括,但说话者特定的线索可以增强概括。这些发现与知觉学习需要调整将声学特征映射到语音类别的内部模型的建议是一致的。这些模型可以应用于新说话者的退化语音,特别是当听众可以解释声音信号中说话者特有的可变性时。(PsycInfo Database Record (c) 2025 APA,版权所有)。
Perceptual learning of modulation filtered speech.
Human listeners have a remarkable capacity to adapt to severe distortions of the speech signal. Previous work indicates that perceptual learning of degraded speech reflects changes to sublexical representations, though the precise format of these representations has not yet been established. Inspired by the neurophysiology of auditory cortex, we hypothesized that perceptual learning involves changes to perceptual representations that are tuned to acoustic modulations of the speech signal. We systematically filtered speech to control modulation content during training and test blocks. Perceptual learning was highly specific to the modulation filter heard during training, consistent with the hypothesis that learning involves changes to representations of speech modulations. In further experiments, we used modulation filtering and different feedback regimes (clear speech vs. written feedback) to investigate the role of talker-specific cues for cross-talker generalization of learning. Our results suggest that learning partially generalizes to speech from novel (untrained) talkers but that talker-specific cues can enhance generalization. These findings are consistent with the proposal that perceptual learning entails the adjustment of internal models that map acoustic features to phonological categories. These models can be applied to degraded speech from novel talkers, particularly when listeners can account for talker-specific variability in the acoustic signal. (PsycInfo Database Record (c) 2025 APA, all rights reserved).
期刊介绍:
The Journal of Experimental Psychology: Human Perception and Performance publishes studies on perception, control of action, perceptual aspects of language processing, and related cognitive processes.