Inverse Reinforcement Learning Approach for Elicitation of Preferences in Multi-objective Sequential Optimization

2018 IEEE International Conference on Agents (ICA) Pub Date : 2018-07-01 DOI:10.1109/AGENTS.2018.8460075

A. Ikenaga, S. Arai

引用次数: 8

Abstract

It is crucial to know which criterion should be focused on, in a multi-objective decision making context, to select the best alternative from the multiple Pareto optimal solutions. However, in general, it is hard for the decision maker to express his/her own preference order for each criterion. In this study, we propose a preference elicitation method to estimate relative importance in terms of weights for each criterion by observing his/her processes of decision making. This method would make expert's preference elicited, and contribute at an important decision making point, such as urban planning,

查看原文本刊更多论文

多目标序列优化中偏好激发的逆强化学习方法

在多目标决策环境下，从多个帕累托最优解中选择最佳方案时，知道应该关注哪个标准是至关重要的。然而，一般来说，决策者很难表达自己对每个标准的偏好顺序。在这项研究中，我们提出了一种偏好启发方法，通过观察他/她的决策过程来估计每个标准的权重相对重要性。这种方法可以引出专家的偏好，并有助于重要的决策点，如城市规划。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2018 IEEE International Conference on Agents (ICA)

自引率

0.00%

发文量