David J Sanderson, Joseph M Austen, Anthony McGregor, Jasmin A Strickland
{"title":"Probability and rate of reinforcement in negative prediction error learning.","authors":"David J Sanderson, Joseph M Austen, Anthony McGregor, Jasmin A Strickland","doi":"10.1037/xan0000396","DOIUrl":null,"url":null,"abstract":"<p><p>Trial-based theories of associative learning propose that learning is sensitive to the probability of reinforcement signaled by a conditioned stimulus (CS). Learning, however, is often sensitive to reinforcement rate rather than probability of reinforcement per trial, suggesting that temporal properties of cues may be more important than trial-based properties. In four experiments, the role of probability of reinforcement per trial was examined in appetitive Pavlovian conditioning in mice under conditions in which reinforcement rate was controlled. Experiments 1 and 2 examined the loss of conditioned responding caused by overexpectation of reinforcement. The probability of reinforcement per trial failed to affect acquisition and summation of conditioned responding and failed to affect overexpectation. It also failed to affect extinction of conditioned responding in Experiments 3 and 4. Experiments 2-4 contained nonreinforced trials in which responding at the offset of the CS could be measured. These probe trials did reveal an effect of probability of reinforcement per trial. Cues associated with 100% reinforcement elicited greater post-CS responding than cues associated with 50% reinforcement. The effect was also evident in summation trials (in Experiment 2) in which two 100% or 50% reinforced cues were presented in compound. The results show that mice learn about rate and probability information, but reinforcement rate determines anticipatory responding during the CS. The probability of reinforcement determines responding at the expected time of reinforcement. Thus, learning occurs continuously over the duration of experience and per episode of experience independent of duration. (PsycInfo Database Record (c) 2025 APA, all rights reserved).</p>","PeriodicalId":54259,"journal":{"name":"Journal of Experimental Psychology-Animal Learning and Cognition","volume":" ","pages":"126-146"},"PeriodicalIF":0.9000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12224701/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Experimental Psychology-Animal Learning and Cognition","FirstCategoryId":"102","ListUrlMain":"https://doi.org/10.1037/xan0000396","RegionNum":4,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/5/12 0:00:00","PubModel":"Epub","JCR":"Q4","JCRName":"BEHAVIORAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Trial-based theories of associative learning propose that learning is sensitive to the probability of reinforcement signaled by a conditioned stimulus (CS). Learning, however, is often sensitive to reinforcement rate rather than probability of reinforcement per trial, suggesting that temporal properties of cues may be more important than trial-based properties. In four experiments, the role of probability of reinforcement per trial was examined in appetitive Pavlovian conditioning in mice under conditions in which reinforcement rate was controlled. Experiments 1 and 2 examined the loss of conditioned responding caused by overexpectation of reinforcement. The probability of reinforcement per trial failed to affect acquisition and summation of conditioned responding and failed to affect overexpectation. It also failed to affect extinction of conditioned responding in Experiments 3 and 4. Experiments 2-4 contained nonreinforced trials in which responding at the offset of the CS could be measured. These probe trials did reveal an effect of probability of reinforcement per trial. Cues associated with 100% reinforcement elicited greater post-CS responding than cues associated with 50% reinforcement. The effect was also evident in summation trials (in Experiment 2) in which two 100% or 50% reinforced cues were presented in compound. The results show that mice learn about rate and probability information, but reinforcement rate determines anticipatory responding during the CS. The probability of reinforcement determines responding at the expected time of reinforcement. Thus, learning occurs continuously over the duration of experience and per episode of experience independent of duration. (PsycInfo Database Record (c) 2025 APA, all rights reserved).
期刊介绍:
The Journal of Experimental Psychology: Animal Learning and Cognition publishes experimental and theoretical studies concerning all aspects of animal behavior processes.