非平稳随机过程非凸动态规划中值函数的次微分

Q2 Mathematics

Communications on Stochastic Analysis Pub Date : 2019-09-01 DOI:10.31390/COSA.13.3.05

B. Mordukhovich, Nobusumi Sagara

{"title":"非平稳随机过程非凸动态规划中值函数的次微分","authors":"B. Mordukhovich, Nobusumi Sagara","doi":"10.31390/COSA.13.3.05","DOIUrl":null,"url":null,"abstract":"The main goal of this paper is to apply the machinery of variational analysis and generalized differentiation to study infinite horizon stochastic dynamic programming (DP) with discrete time in the Banach space setting without convexity assumptions. Unlike to standard stochastic DP with stationary Markov processes, we investigate here stochastic DP in $L^p$ spaces to deal with nonstationary stochastic processes, which describe a more flexible learning procedure for the decision-maker. Our main concern is to calculate generalized subgradients of the corresponding value function and to derive necessary conditions for optimality in terms of the stochastic Euler inclusion under appropriate Lipschitzian assumptions. The usage of the subdifferential formula for integral functionals on $L^p$ spaces allows us, in particular, to find verifiable conditions to ensure smoothness of the value function without any convexity and/or interiority assumptions.","PeriodicalId":53434,"journal":{"name":"Communications on Stochastic Analysis","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2019-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Subdifferentials of Value Functions in Nonconvex Dynamic Programming for Nonstationary Stochastic Processes\",\"authors\":\"B. Mordukhovich, Nobusumi Sagara\",\"doi\":\"10.31390/COSA.13.3.05\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The main goal of this paper is to apply the machinery of variational analysis and generalized differentiation to study infinite horizon stochastic dynamic programming (DP) with discrete time in the Banach space setting without convexity assumptions. Unlike to standard stochastic DP with stationary Markov processes, we investigate here stochastic DP in $L^p$ spaces to deal with nonstationary stochastic processes, which describe a more flexible learning procedure for the decision-maker. Our main concern is to calculate generalized subgradients of the corresponding value function and to derive necessary conditions for optimality in terms of the stochastic Euler inclusion under appropriate Lipschitzian assumptions. The usage of the subdifferential formula for integral functionals on $L^p$ spaces allows us, in particular, to find verifiable conditions to ensure smoothness of the value function without any convexity and/or interiority assumptions.\",\"PeriodicalId\":53434,\"journal\":{\"name\":\"Communications on Stochastic Analysis\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Communications on Stochastic Analysis\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.31390/COSA.13.3.05\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"Mathematics\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Communications on Stochastic Analysis","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.31390/COSA.13.3.05","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Mathematics","Score":null,"Total":0}

引用次数: 4

摘要

本文的主要目的是应用变分分析和广义微分机制，在没有凸性假设的Banach空间环境中研究具有离散时间的无限时域随机动态规划。与具有平稳马尔可夫过程的标准随机DP不同，我们研究了$L^p$空间中的随机DP来处理非平稳随机过程，这为决策者描述了一种更灵活的学习过程。我们主要关注的是计算相应值函数的广义次梯度，并在适当的Lipschitzian假设下，根据随机Euler包含导出最优性的必要条件。在$L^p$空间上使用积分泛函的次微分公式，特别允许我们找到可验证的条件，以确保值函数的光滑性，而不存在任何凸性和/或内在性假设。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Subdifferentials of Value Functions in Nonconvex Dynamic Programming for Nonstationary Stochastic Processes

The main goal of this paper is to apply the machinery of variational analysis and generalized differentiation to study infinite horizon stochastic dynamic programming (DP) with discrete time in the Banach space setting without convexity assumptions. Unlike to standard stochastic DP with stationary Markov processes, we investigate here stochastic DP in $L^p$ spaces to deal with nonstationary stochastic processes, which describe a more flexible learning procedure for the decision-maker. Our main concern is to calculate generalized subgradients of the corresponding value function and to derive necessary conditions for optimality in terms of the stochastic Euler inclusion under appropriate Lipschitzian assumptions. The usage of the subdifferential formula for integral functionals on $L^p$ spaces allows us, in particular, to find verifiable conditions to ensure smoothness of the value function without any convexity and/or interiority assumptions.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Communications on Stochastic Analysis Mathematics-Statistics and Probability

CiteScore

2.40

自引率

0.00%

发文量

期刊介绍： The journal Communications on Stochastic Analysis (COSA) is published in four issues annually (March, June, September, December). It aims to present original research papers of high quality in stochastic analysis (both theory and applications) and emphasizes the global development of the scientific community. The journal welcomes articles of interdisciplinary nature. Expository articles of current interest will occasionally be published. COSAis indexed in Mathematical Reviews (MathSciNet), Zentralblatt für Mathematik, and SCOPUS