选择前的最佳学习

T. Ke, J. M. Villas-Boas
{"title":"选择前的最佳学习","authors":"T. Ke, J. M. Villas-Boas","doi":"10.2139/ssrn.2844417","DOIUrl":null,"url":null,"abstract":"A Bayesian decision maker is choosing among two alternatives with uncertain payoffs and an outside option with known payoff. Before deciding which alternative to adopt, the decision maker can purchase sequentially multiple informative signals on each of the two alternatives. To maximize the expected payoff, the decision maker solves the problem of optimal dynamic allocation of learning efforts as well as optimal stopping of the learning process. We show that the decision maker considers an alternative for learning or adoption if and only if the expected payoff of the alternative is above a threshold. Given both alternatives in the decision maker's consideration set, we find that if the outside option is weak and the decision maker's beliefs about both alternatives are relatively low, it is optimal for the decision maker to learn information from the alternative that has a lower expected payoff and less uncertainty, given all other characteristics of the two alternatives being the same. If the decision maker subsequently receives enough positive informative signals, the decision maker will switch to learning the better alternative; otherwise the decision maker will rule out this alternative from consideration and adopt the currently more preferred alternative. We find that this strategy works because it minimizes the decision maker's learning efforts. We also characterize the optimal learning policy when the outside option is relatively high, and discuss several extensions.","PeriodicalId":376757,"journal":{"name":"Decision-Making in Operations Research eJournal","volume":"68 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"55","resultStr":"{\"title\":\"Optimal Learning Before Choice\",\"authors\":\"T. Ke, J. M. Villas-Boas\",\"doi\":\"10.2139/ssrn.2844417\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A Bayesian decision maker is choosing among two alternatives with uncertain payoffs and an outside option with known payoff. Before deciding which alternative to adopt, the decision maker can purchase sequentially multiple informative signals on each of the two alternatives. To maximize the expected payoff, the decision maker solves the problem of optimal dynamic allocation of learning efforts as well as optimal stopping of the learning process. We show that the decision maker considers an alternative for learning or adoption if and only if the expected payoff of the alternative is above a threshold. Given both alternatives in the decision maker's consideration set, we find that if the outside option is weak and the decision maker's beliefs about both alternatives are relatively low, it is optimal for the decision maker to learn information from the alternative that has a lower expected payoff and less uncertainty, given all other characteristics of the two alternatives being the same. If the decision maker subsequently receives enough positive informative signals, the decision maker will switch to learning the better alternative; otherwise the decision maker will rule out this alternative from consideration and adopt the currently more preferred alternative. We find that this strategy works because it minimizes the decision maker's learning efforts. We also characterize the optimal learning policy when the outside option is relatively high, and discuss several extensions.\",\"PeriodicalId\":376757,\"journal\":{\"name\":\"Decision-Making in Operations Research eJournal\",\"volume\":\"68 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-11-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"55\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Decision-Making in Operations Research eJournal\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2139/ssrn.2844417\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Decision-Making in Operations Research eJournal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2139/ssrn.2844417","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 55

摘要

贝叶斯决策者在两个收益不确定的选项和一个收益已知的外部选项中进行选择。在决定采用哪个备选方案之前,决策者可以在两个备选方案中依次购买多个信息信号。为了使期望收益最大化,决策者解决了学习努力的最优动态分配和学习过程的最优停止问题。我们表明,当且仅当替代方案的预期收益高于阈值时,决策者才会考虑学习或采用替代方案。给定决策者的考虑集中的两个选择,我们发现如果外部选择较弱且决策者对两个选择的信念都相对较低,在两个选择的所有其他特征相同的情况下,决策者从期望收益较低且不确定性较小的选择中学习信息是最优的。如果决策者随后收到足够的积极信息信号,决策者将转向学习更好的选择;否则,决策者将从考虑中排除这一选择,而采用目前更受欢迎的选择。我们发现这个策略是有效的,因为它最小化了决策者的学习努力。我们还描述了当外部选项相对较高时的最优学习策略,并讨论了几个扩展。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Optimal Learning Before Choice
A Bayesian decision maker is choosing among two alternatives with uncertain payoffs and an outside option with known payoff. Before deciding which alternative to adopt, the decision maker can purchase sequentially multiple informative signals on each of the two alternatives. To maximize the expected payoff, the decision maker solves the problem of optimal dynamic allocation of learning efforts as well as optimal stopping of the learning process. We show that the decision maker considers an alternative for learning or adoption if and only if the expected payoff of the alternative is above a threshold. Given both alternatives in the decision maker's consideration set, we find that if the outside option is weak and the decision maker's beliefs about both alternatives are relatively low, it is optimal for the decision maker to learn information from the alternative that has a lower expected payoff and less uncertainty, given all other characteristics of the two alternatives being the same. If the decision maker subsequently receives enough positive informative signals, the decision maker will switch to learning the better alternative; otherwise the decision maker will rule out this alternative from consideration and adopt the currently more preferred alternative. We find that this strategy works because it minimizes the decision maker's learning efforts. We also characterize the optimal learning policy when the outside option is relatively high, and discuss several extensions.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信