Influence of reinforcement and its omission on trial-by-trial changes of response bias in perceptual decision making

IF 1.4 3区心理学 Q4 BEHAVIORAL SCIENCES

Journal of the experimental analysis of behavior Pub Date : 2024-03-01 DOI:10.1002/jeab.908

Maik C. Stüttgen, Andrea Dietl, Vanya V. Stoilova Eckert, Luis de la Cuesta-Ferrer, Jan-Hendrik Blanke, Christina Koß, Frank Jäkel

{"title":"Influence of reinforcement and its omission on trial-by-trial changes of response bias in perceptual decision making","authors":"Maik C. Stüttgen, Andrea Dietl, Vanya V. Stoilova Eckert, Luis de la Cuesta-Ferrer, Jan-Hendrik Blanke, Christina Koß, Frank Jäkel","doi":"10.1002/jeab.908","DOIUrl":null,"url":null,"abstract":"Discrimination performance in perceptual choice tasks is known to reflect both sensory discriminability and nonsensory response bias. In the framework of signal detection theory, these aspects of discrimination performance are quantified through separate measures, sensitivity (d′) for sensory discriminability and decision criterion (c) for response bias. However, it is unknown how response bias (i.e., criterion) changes at the single-trial level as a consequence of reinforcement history. We subjected rats to a two-stimulus two-response conditional discrimination task with auditory stimuli and induced response bias through unequal reinforcement probabilities for the two responses. We compared three signal-detection-theory-based criterion learning models with respect to their ability to fit experimentally observed fluctuations of response bias on a trial-by-trial level. These models shift the criterion by a fixed step (1) after each reinforced response or (2) after each nonreinforced response or (3) after both. We find that all three models fail to capture essential aspects of the data. Prompted by the observation that steady-state criterion values conformed well to a behavioral model of signal detection based on the generalized matching law, we constructed a trial-based version of this model and find that it provides a superior account of response bias fluctuations under changing reinforcement contingencies.","PeriodicalId":17411,"journal":{"name":"Journal of the experimental analysis of behavior","volume":"121 3","pages":"294-313"},"PeriodicalIF":1.4000,"publicationDate":"2024-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/jeab.908","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the experimental analysis of behavior","FirstCategoryId":"102","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/jeab.908","RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"BEHAVIORAL SCIENCES","Score":null,"Total":0}

引用次数: 0

Abstract

Discrimination performance in perceptual choice tasks is known to reflect both sensory discriminability and nonsensory response bias. In the framework of signal detection theory, these aspects of discrimination performance are quantified through separate measures, sensitivity (d′) for sensory discriminability and decision criterion (c) for response bias. However, it is unknown how response bias (i.e., criterion) changes at the single-trial level as a consequence of reinforcement history. We subjected rats to a two-stimulus two-response conditional discrimination task with auditory stimuli and induced response bias through unequal reinforcement probabilities for the two responses. We compared three signal-detection-theory-based criterion learning models with respect to their ability to fit experimentally observed fluctuations of response bias on a trial-by-trial level. These models shift the criterion by a fixed step (1) after each reinforced response or (2) after each nonreinforced response or (3) after both. We find that all three models fail to capture essential aspects of the data. Prompted by the observation that steady-state criterion values conformed well to a behavioral model of signal detection based on the generalized matching law, we constructed a trial-based version of this model and find that it provides a superior account of response bias fluctuations under changing reinforcement contingencies.

Abstract Image

查看原文本刊更多论文

强化和省略对知觉决策中逐次试验反应偏差变化的影响

众所周知，知觉选择任务中的辨别能力既反映了感官辨别能力，也反映了非感官反应偏差。在信号检测理论的框架下，分辨成绩的这两个方面是通过单独的测量来量化的，即感官可分辨性的灵敏度（d'）和反应偏差的决策标准（c）。然而，在单次试验水平上，反应偏差（即标准）是如何随强化历史而变化的，目前还不得而知。我们用听觉刺激对大鼠进行双刺激双反应条件辨别任务，并通过两种反应的不等强化概率诱导反应偏差。我们比较了三种基于信号检测理论的标准学习模型，看它们是否能够适应实验观察到的逐次试验水平的反应偏差波动。这些模型通过一个固定的步骤（1）在每次强化反应之后或（2）在每次非强化反应之后或（3）在两者之后移动标准。我们发现这三种模型都无法捕捉到数据的基本方面。在观察到稳态标准值与基于广义匹配定律的信号检测行为模型十分吻合后，我们构建了该模型的试验版本，并发现它能更好地解释强化条件变化下的反应偏差波动。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of the experimental analysis of behavior 医学-行为科学

CiteScore

3.90

自引率

14.80%

发文量

审稿时长

>12 weeks

期刊介绍： Journal of the Experimental Analysis of Behavior is primarily for the original publication of experiments relevant to the behavior of individual organisms.