Input-dependent noise can explain magnitude-sensitivity in optimal value-based decision-making

IF 1.9 3区心理学 Q2 PSYCHOLOGY, MULTIDISCIPLINARY

Judgment and Decision Making Pub Date : 2021-09-01 DOI:10.1017/s1930297500008408

Angelo Pirrone, A. Reina, F. Gobet

{"title":"Input-dependent noise can explain magnitude-sensitivity in optimal value-based decision-making","authors":"Angelo Pirrone, A. Reina, F. Gobet","doi":"10.1017/s1930297500008408","DOIUrl":null,"url":null,"abstract":"Recent work has derived the optimal policy for two-alternative value-based decisions, in which decision-makers compare the subjective expected reward of two alternatives. Under specific task assumptions — such as linear utility, linear cost of time and constant processing noise — the optimal policy is implemented by a diffusion process in which parallel decision thresholds collapse over time as a function of prior knowledge about average reward across trials. This policy predicts that the decision dynamics of each trial are dominated by the difference in value between alternatives and are insensitive to the magnitude of the alternatives (i.e., their summed values). This prediction clashes with empirical evidence showing magnitude-sensitivity even in the case of equal alternatives, and with ecologically plausible accounts of decision making. Previous work has shown that relaxing assumptions about linear utility or linear time cost can give rise to optimal magnitude-sensitive policies. Here we question the assumption of constant processing noise, in favour of input-dependent noise. The neurally plausible assumption of input-dependent noise during evidence accumulation has received strong support from previous experimental and modelling work. We show that including input-dependent noise in the evidence accumulation process results in a magnitude-sensitive optimal policy for value-based decision-making, even in the case of a linear utility function and a linear cost of time, for both single (i.e., isolated) choices and sequences of choices in which decision-makers maximise reward rate. Compared to explanations that rely on non-linear utility functions and/or non-linear cost of time, our proposed account of magnitude-sensitive optimal decision-making provides a parsimonious explanation that bridges the gap between various task assumptions and between various types of decision making.","PeriodicalId":48045,"journal":{"name":"Judgment and Decision Making","volume":" ","pages":""},"PeriodicalIF":1.9000,"publicationDate":"2021-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Judgment and Decision Making","FirstCategoryId":"102","ListUrlMain":"https://doi.org/10.1017/s1930297500008408","RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PSYCHOLOGY, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 2

Abstract

Recent work has derived the optimal policy for two-alternative value-based decisions, in which decision-makers compare the subjective expected reward of two alternatives. Under specific task assumptions — such as linear utility, linear cost of time and constant processing noise — the optimal policy is implemented by a diffusion process in which parallel decision thresholds collapse over time as a function of prior knowledge about average reward across trials. This policy predicts that the decision dynamics of each trial are dominated by the difference in value between alternatives and are insensitive to the magnitude of the alternatives (i.e., their summed values). This prediction clashes with empirical evidence showing magnitude-sensitivity even in the case of equal alternatives, and with ecologically plausible accounts of decision making. Previous work has shown that relaxing assumptions about linear utility or linear time cost can give rise to optimal magnitude-sensitive policies. Here we question the assumption of constant processing noise, in favour of input-dependent noise. The neurally plausible assumption of input-dependent noise during evidence accumulation has received strong support from previous experimental and modelling work. We show that including input-dependent noise in the evidence accumulation process results in a magnitude-sensitive optimal policy for value-based decision-making, even in the case of a linear utility function and a linear cost of time, for both single (i.e., isolated) choices and sequences of choices in which decision-makers maximise reward rate. Compared to explanations that rely on non-linear utility functions and/or non-linear cost of time, our proposed account of magnitude-sensitive optimal decision-making provides a parsimonious explanation that bridges the gap between various task assumptions and between various types of decision making.

查看原文本刊更多论文

输入相关噪声可以解释基于最优值的决策中的幅度敏感性

最近的工作推导了两个基于价值的替代决策的最优策略，其中决策者比较了两个替代方案的主观预期回报。在特定的任务假设下，如线性效用、线性时间成本和恒定处理噪声，最优策略是通过扩散过程实现的，在扩散过程中，平行决策阈值随着时间的推移而崩溃，这是关于整个试验的平均报酬的先验知识的函数。该政策预测，每个试验的决策动态由备选方案之间的价值差异主导，并且对备选方案的大小（即其总值）不敏感。这一预测与实证证据相冲突，实证证据表明，即使在同等替代方案的情况下，也具有数量敏感性，并且与决策的生态合理性描述相冲突。先前的工作表明，放松对线性效用或线性时间成本的假设可以产生最优幅度敏感政策。在这里，我们质疑恒定处理噪声的假设，支持依赖于输入的噪声。在证据积累过程中，对输入相关噪声的神经上合理的假设得到了先前实验和建模工作的有力支持。我们表明，在证据积累过程中包括与输入相关的噪声，对于基于价值的决策，即使在线性效用函数和线性时间成本的情况下，对于单个（即孤立的）选择和决策者最大化回报率的选择序列，也会产生幅度敏感的最优策略。与依赖于非线性效用函数和/或非线性时间成本的解释相比，我们提出的对幅度敏感的最优决策的解释提供了一种简约的解释，弥补了各种任务假设之间和各种类型决策之间的差距。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊