Exploring Markov Decision Processes: A Comprehensive Survey of Optimization Applications and Techniques

IgMin Research Pub Date : 2024-07-04 DOI:10.61927/igmin210

Khan Qazi Waqas

引用次数: 0

Abstract

Markov decision process is a dynamic programming algorithm that can be used to solve an optimization problem. It was used in applications like robotics, radar tracking, medical treatments, and decision-making. In the existing literature, the researcher only targets a few applications area of MDP. However, this work surveyed the Markov decision process’s application in various regions for solving optimization problems. In a survey, we compared optimization techniques based on MDP. We performed a comparative analysis of past work of other researchers in the last few years based on a few parameters. These parameters are focused on the proposed problem, the proposed methodology for solving an optimization problem, and the results and outcomes of the optimization technique in solving a specific problem. Reinforcement learning is an emerging machine learning domain based on the Markov decision process. In this work, we conclude that the MDP-based approach is most widely used when deciding on the current state in some environments to move to the next state.

查看原文本刊更多论文

探索马尔可夫决策过程：优化应用与技术综合概览

马尔可夫决策过程是一种动态编程算法，可用于解决优化问题。它被应用于机器人、雷达跟踪、医疗和决策等领域。在现有文献中，研究人员只针对 MDP 的几个应用领域进行了研究。然而，这项工作调查了马尔可夫决策过程在解决优化问题的各个领域中的应用。在调查中，我们比较了基于 MDP 的优化技术。我们根据几个参数对其他研究人员过去几年的工作进行了比较分析。这些参数主要集中在提出的问题、解决优化问题的方法以及优化技术解决特定问题的结果和成果。强化学习是基于马尔可夫决策过程的新兴机器学习领域。在这项工作中，我们得出结论，在某些环境中决定当前状态以进入下一状态时，基于马尔可夫决策过程的方法得到了最广泛的应用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IgMin Research

自引率

0.00%

发文量