Probability and rate of reinforcement in negative prediction error learning.

IF 0.9 4区心理学 Q4 BEHAVIORAL SCIENCES

Journal of Experimental Psychology-Animal Learning and Cognition Pub Date : 2025-07-01 Epub Date: 2025-05-12 DOI:10.1037/xan0000396

David J Sanderson, Joseph M Austen, Anthony McGregor, Jasmin A Strickland

{"title":"Probability and rate of reinforcement in negative prediction error learning.","authors":"David J Sanderson, Joseph M Austen, Anthony McGregor, Jasmin A Strickland","doi":"10.1037/xan0000396","DOIUrl":null,"url":null,"abstract":"<p><p>Trial-based theories of associative learning propose that learning is sensitive to the probability of reinforcement signaled by a conditioned stimulus (CS). Learning, however, is often sensitive to reinforcement rate rather than probability of reinforcement per trial, suggesting that temporal properties of cues may be more important than trial-based properties. In four experiments, the role of probability of reinforcement per trial was examined in appetitive Pavlovian conditioning in mice under conditions in which reinforcement rate was controlled. Experiments 1 and 2 examined the loss of conditioned responding caused by overexpectation of reinforcement. The probability of reinforcement per trial failed to affect acquisition and summation of conditioned responding and failed to affect overexpectation. It also failed to affect extinction of conditioned responding in Experiments 3 and 4. Experiments 2-4 contained nonreinforced trials in which responding at the offset of the CS could be measured. These probe trials did reveal an effect of probability of reinforcement per trial. Cues associated with 100% reinforcement elicited greater post-CS responding than cues associated with 50% reinforcement. The effect was also evident in summation trials (in Experiment 2) in which two 100% or 50% reinforced cues were presented in compound. The results show that mice learn about rate and probability information, but reinforcement rate determines anticipatory responding during the CS. The probability of reinforcement determines responding at the expected time of reinforcement. Thus, learning occurs continuously over the duration of experience and per episode of experience independent of duration. (PsycInfo Database Record (c) 2025 APA, all rights reserved).</p>","PeriodicalId":54259,"journal":{"name":"Journal of Experimental Psychology-Animal Learning and Cognition","volume":" ","pages":"126-146"},"PeriodicalIF":0.9000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12224701/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Experimental Psychology-Animal Learning and Cognition","FirstCategoryId":"102","ListUrlMain":"https://doi.org/10.1037/xan0000396","RegionNum":4,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/5/12 0:00:00","PubModel":"Epub","JCR":"Q4","JCRName":"BEHAVIORAL SCIENCES","Score":null,"Total":0}

引用次数: 0

Abstract

Trial-based theories of associative learning propose that learning is sensitive to the probability of reinforcement signaled by a conditioned stimulus (CS). Learning, however, is often sensitive to reinforcement rate rather than probability of reinforcement per trial, suggesting that temporal properties of cues may be more important than trial-based properties. In four experiments, the role of probability of reinforcement per trial was examined in appetitive Pavlovian conditioning in mice under conditions in which reinforcement rate was controlled. Experiments 1 and 2 examined the loss of conditioned responding caused by overexpectation of reinforcement. The probability of reinforcement per trial failed to affect acquisition and summation of conditioned responding and failed to affect overexpectation. It also failed to affect extinction of conditioned responding in Experiments 3 and 4. Experiments 2-4 contained nonreinforced trials in which responding at the offset of the CS could be measured. These probe trials did reveal an effect of probability of reinforcement per trial. Cues associated with 100% reinforcement elicited greater post-CS responding than cues associated with 50% reinforcement. The effect was also evident in summation trials (in Experiment 2) in which two 100% or 50% reinforced cues were presented in compound. The results show that mice learn about rate and probability information, but reinforcement rate determines anticipatory responding during the CS. The probability of reinforcement determines responding at the expected time of reinforcement. Thus, learning occurs continuously over the duration of experience and per episode of experience independent of duration. (PsycInfo Database Record (c) 2025 APA, all rights reserved).

Abstract Image

查看原文本刊更多论文

负预测误差学习中的强化概率和强化率。

基于实验的联想学习理论提出，学习对条件刺激（CS）发出的强化信号的概率敏感。然而，学习通常对强化率而不是每次试验的强化概率敏感，这表明线索的时间特性可能比基于试验的特性更重要。在四个实验中，研究了每次强化概率在控制强化率条件下小鼠食欲巴甫洛夫条件反射中的作用。实验1和实验2考察了过度期望强化导致的条件反应丧失。每次强化的概率对条件反应的习得和求和没有影响，对过度期望没有影响。在实验3和实验4中，它也没有影响条件反应的消退。实验2-4包含非强化试验，其中可以测量CS偏移处的响应。这些探针试验确实揭示了每次试验强化概率的影响。与50%强化相关的线索相比，100%强化相关的线索引发了更大的后cs反应。在综合实验（实验2）中，两个100%或50%强化的线索以复合形式呈现，这种效应也很明显。结果表明，小鼠学习了速率和概率信息，但强化率决定了预期反应。强化的概率决定了在预期的强化时间响应。因此，学习在经历的持续时间内不断发生，每一段经历与持续时间无关。（PsycInfo Database Record (c) 2025 APA，版权所有）。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Experimental Psychology-Animal Learning and Cognition Psychology-Experimental and Cognitive Psychology

CiteScore

2.90

自引率

23.10%

发文量

期刊介绍： The Journal of Experimental Psychology: Animal Learning and Cognition publishes experimental and theoretical studies concerning all aspects of animal behavior processes.