On fuzzy decision processes with discounted fuzzy rewards

Proceedings of 3rd International Symposium on Uncertainty Modeling and Analysis and Annual Conference of the North American Fuzzy Information Processing Society Pub Date : 1995-03-17 DOI:10.1109/ISUMA.1995.527705

Y. Yoshida

引用次数: 3

Abstract

Deals with a multi-stage decision process with fuzzy transitions, which is termed a 'fuzzy decision process'. We consider the fuzzy decision process, where both states and actions are assumed to be fuzzy, from the point of view of a dynamic fuzzy system which has been developed by the authors. The discounted total reward is described by a fuzzy number on a closed bounded interval. A partial order of convex fuzzy numbers, which is called a 'fuzzy max order', is used to discuss the optimization problem. We characterize the discounted total reward associated with an admissible stationary policy by a unique fixed point of the contractive mapping. Further, we estimate the fuzzy rewards by introducing a fuzzy expectation generated by a fuzzy goal.

查看原文本刊更多论文

模糊奖励折现的模糊决策过程

研究具有模糊过渡的多阶段决策过程，称为“模糊决策过程”。本文从作者提出的动态模糊系统的角度考虑状态和动作都是模糊的模糊决策过程。折现总奖励用有界封闭区间上的模糊数来描述。利用凸模糊数的一种偏阶，即模糊最大阶来讨论最优化问题。我们用收缩映射的唯一不动点来刻画与一个允许的平稳策略相关的贴现总奖励。进一步，我们通过引入由模糊目标产生的模糊期望来估计模糊奖励。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of 3rd International Symposium on Uncertainty Modeling and Analysis and Annual Conference of the North American Fuzzy Information Processing Society

自引率

0.00%

发文量