Vector-valued robust stochastic control

Igor Cialenco, Gabriela Kováčová
{"title":"Vector-valued robust stochastic control","authors":"Igor Cialenco, Gabriela Kováčová","doi":"arxiv-2407.00266","DOIUrl":null,"url":null,"abstract":"We study a dynamic stochastic control problem subject to Knightian\nuncertainty with multi-objective (vector-valued) criteria. Assuming the\npreferences across expected multi-loss vectors are represented by a given, yet\ngeneral, preorder, we address the model uncertainty by adopting a robust or\nminimax perspective, minimizing expected loss across the worst-case model. For\nloss functions taking real (or scalar) values, there is no ambiguity in\ninterpreting supremum and infimum. In contrast to the scalar case, major\nchallenges for multi-loss control problems include properly defining and\ninterpreting the notions of supremum and infimum, and addressing the\nnon-uniqueness of these suprema and infima. To deal with these, we employ the\nnotion of an ideal point vector-valued supremum for the robust part of the\nproblem, while we view the control part as a multi-objective (or vector)\noptimization problem. Using a set-valued framework, we derive both a weak and\nstrong version of the dynamic programming principle (DPP) or Bellman equations\nby taking the value function as the collection of all worst expected losses\nacross all feasible actions. The weak version of Bellman's principle is proved\nunder minimal assumptions. To establish a stronger version of DPP, we introduce\nthe rectangularity property with respect to a general preorder. We also further\nstudy a particular, but important, case of component-wise partial order of\nvectors, for which we additionally derive DPP under a different set-valued\nnotion for the value function, the so-called upper image of the multi-objective\nproblem. Finally, we provide illustrative examples motivated by financial\nproblems. These results will serve as a foundation for addressing time-inconsistent\nproblems subject to model uncertainty through the lens of a set-valued\nframework, as well as for studying multi-portfolio allocation problems under\nmodel uncertainty.","PeriodicalId":501045,"journal":{"name":"arXiv - QuantFin - Portfolio Management","volume":"81 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - QuantFin - Portfolio Management","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2407.00266","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

We study a dynamic stochastic control problem subject to Knightian uncertainty with multi-objective (vector-valued) criteria. Assuming the preferences across expected multi-loss vectors are represented by a given, yet general, preorder, we address the model uncertainty by adopting a robust or minimax perspective, minimizing expected loss across the worst-case model. For loss functions taking real (or scalar) values, there is no ambiguity in interpreting supremum and infimum. In contrast to the scalar case, major challenges for multi-loss control problems include properly defining and interpreting the notions of supremum and infimum, and addressing the non-uniqueness of these suprema and infima. To deal with these, we employ the notion of an ideal point vector-valued supremum for the robust part of the problem, while we view the control part as a multi-objective (or vector) optimization problem. Using a set-valued framework, we derive both a weak and strong version of the dynamic programming principle (DPP) or Bellman equations by taking the value function as the collection of all worst expected losses across all feasible actions. The weak version of Bellman's principle is proved under minimal assumptions. To establish a stronger version of DPP, we introduce the rectangularity property with respect to a general preorder. We also further study a particular, but important, case of component-wise partial order of vectors, for which we additionally derive DPP under a different set-valued notion for the value function, the so-called upper image of the multi-objective problem. Finally, we provide illustrative examples motivated by financial problems. These results will serve as a foundation for addressing time-inconsistent problems subject to model uncertainty through the lens of a set-valued framework, as well as for studying multi-portfolio allocation problems under model uncertainty.
矢量值稳健随机控制
我们研究的是一个动态随机控制问题,它受奈特不确定性和多目标(向量值)标准的制约。假设对预期多损失向量的偏好由一个给定但通用的前序表示,我们采用稳健或最小最大的视角来解决模型的不确定性,最小化最坏情况下模型的预期损失。对于取实值(或标量)的损失函数,在解释上确值和下确值时没有歧义。与标量情况相反,多重损失控制问题的主要挑战包括正确定义和解释上确值和下确值的概念,以及解决这些上确值和下确值的非唯一性问题。为了解决这些问题,我们对问题的鲁棒性部分采用了理想点矢量值上界的运动,而将控制部分视为多目标(或矢量)优化问题。利用集值框架,我们将价值函数视为所有可行行动的最坏预期损失的集合,从而推导出动态编程原理(DPP)或贝尔曼方程的弱版和强版。弱版贝尔曼原理是在最小的假设条件下证明的。为了建立更强版本的 DPP,我们引入了关于一般预序的矩形性属性。我们还进一步研究了一种特殊但重要的情况,即向量的分量偏序,并在不同的值函数集值概念下推导出了 DPP,即所谓的多目标问题的上图。最后,我们提供了以金融问题为动机的示例。这些结果将为通过集值框架的视角解决模型不确定性下的时间不一致问题,以及研究模型不确定性下的多投资组合分配问题奠定基础。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信