Influence of reinforcement and its omission on trial-by-trial changes of response bias in perceptual decision making

IF 1.4 3区 心理学 Q4 BEHAVIORAL SCIENCES
Maik C. Stüttgen, Andrea Dietl, Vanya V. Stoilova Eckert, Luis de la Cuesta-Ferrer, Jan-Hendrik Blanke, Christina Koß, Frank Jäkel
{"title":"Influence of reinforcement and its omission on trial-by-trial changes of response bias in perceptual decision making","authors":"Maik C. Stüttgen,&nbsp;Andrea Dietl,&nbsp;Vanya V. Stoilova Eckert,&nbsp;Luis de la Cuesta-Ferrer,&nbsp;Jan-Hendrik Blanke,&nbsp;Christina Koß,&nbsp;Frank Jäkel","doi":"10.1002/jeab.908","DOIUrl":null,"url":null,"abstract":"<p>Discrimination performance in perceptual choice tasks is known to reflect both sensory discriminability and nonsensory response bias. In the framework of signal detection theory, these aspects of discrimination performance are quantified through separate measures, sensitivity (<i>d′</i>) for sensory discriminability and decision criterion (<i>c</i>) for response bias. However, it is unknown how response bias (i.e., criterion) changes at the single-trial level as a consequence of reinforcement history. We subjected rats to a two-stimulus two-response conditional discrimination task with auditory stimuli and induced response bias through unequal reinforcement probabilities for the two responses. We compared three signal-detection-theory-based criterion learning models with respect to their ability to fit experimentally observed fluctuations of response bias on a trial-by-trial level. These models shift the criterion by a fixed step (1) after each reinforced response or (2) after each nonreinforced response or (3) after both. We find that all three models fail to capture essential aspects of the data. Prompted by the observation that steady-state criterion values conformed well to a behavioral model of signal detection based on the generalized matching law, we constructed a trial-based version of this model and find that it provides a superior account of response bias fluctuations under changing reinforcement contingencies.</p>","PeriodicalId":17411,"journal":{"name":"Journal of the experimental analysis of behavior","volume":"121 3","pages":"294-313"},"PeriodicalIF":1.4000,"publicationDate":"2024-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/jeab.908","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the experimental analysis of behavior","FirstCategoryId":"102","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/jeab.908","RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"BEHAVIORAL SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Discrimination performance in perceptual choice tasks is known to reflect both sensory discriminability and nonsensory response bias. In the framework of signal detection theory, these aspects of discrimination performance are quantified through separate measures, sensitivity (d′) for sensory discriminability and decision criterion (c) for response bias. However, it is unknown how response bias (i.e., criterion) changes at the single-trial level as a consequence of reinforcement history. We subjected rats to a two-stimulus two-response conditional discrimination task with auditory stimuli and induced response bias through unequal reinforcement probabilities for the two responses. We compared three signal-detection-theory-based criterion learning models with respect to their ability to fit experimentally observed fluctuations of response bias on a trial-by-trial level. These models shift the criterion by a fixed step (1) after each reinforced response or (2) after each nonreinforced response or (3) after both. We find that all three models fail to capture essential aspects of the data. Prompted by the observation that steady-state criterion values conformed well to a behavioral model of signal detection based on the generalized matching law, we constructed a trial-based version of this model and find that it provides a superior account of response bias fluctuations under changing reinforcement contingencies.

Abstract Image

强化和省略对知觉决策中逐次试验反应偏差变化的影响
众所周知,知觉选择任务中的辨别能力既反映了感官辨别能力,也反映了非感官反应偏差。在信号检测理论的框架下,分辨成绩的这两个方面是通过单独的测量来量化的,即感官可分辨性的灵敏度(d')和反应偏差的决策标准(c)。然而,在单次试验水平上,反应偏差(即标准)是如何随强化历史而变化的,目前还不得而知。我们用听觉刺激对大鼠进行双刺激双反应条件辨别任务,并通过两种反应的不等强化概率诱导反应偏差。我们比较了三种基于信号检测理论的标准学习模型,看它们是否能够适应实验观察到的逐次试验水平的反应偏差波动。这些模型通过一个固定的步骤(1)在每次强化反应之后或(2)在每次非强化反应之后或(3)在两者之后移动标准。我们发现这三种模型都无法捕捉到数据的基本方面。在观察到稳态标准值与基于广义匹配定律的信号检测行为模型十分吻合后,我们构建了该模型的试验版本,并发现它能更好地解释强化条件变化下的反应偏差波动。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
3.90
自引率
14.80%
发文量
83
审稿时长
>12 weeks
期刊介绍: Journal of the Experimental Analysis of Behavior is primarily for the original publication of experiments relevant to the behavior of individual organisms.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信