Proximal Policy Optimization for Explainable Recommended Systems

2022 4th International Conference on Data-driven Optimization of Complex Systems (DOCS) Pub Date : 2022-10-28 DOI:10.1109/DOCS55193.2022.9967709

Qian Feng, Geyang Xiao, Yuan Liang, Huifeng Zhang, Linlin Yan, Xiaoyu Yi

引用次数: 0

Abstract

In this paper, in order to intuitively improve the interpretability of the recommendation, we make full use of the knowledge graph and provide an ‘explicit’ recommendation path for the recommended result. Specifically, we embeds the entities and relations in the knowledge graph according to consumers’ behavior. Then a knowledge reasoning method based on reinforcement learning algorithm is proposed, which aims to find a reasonable path after two entities in the knowledge graph are given. The main contributions of this paper are as follows: The current State-of-the-art method in reinforcement learning, PPO is used for improvement. The soft-reward function after adjustment behaves better for the current problem while training. And a Q-value based path exploration method is innovated for testing. With extensive experiments on several real-world datasets provided by Amazon, our method behaves better for the current problem compared with the baseline.

查看原文本刊更多论文

可解释推荐系统的近端策略优化

在本文中，为了直观地提高推荐的可解释性，我们充分利用了知识图，为推荐结果提供了一条“显式”的推荐路径。具体来说，我们根据消费者的行为将实体和关系嵌入到知识图中。然后提出了一种基于强化学习算法的知识推理方法，其目的是在给定知识图中的两个实体后，找到一条合理的路径。本文的主要贡献如下:使用当前最先进的强化学习方法，PPO进行改进。在训练过程中，调整后的软奖励函数对当前问题的表现更好。创新了一种基于q值的路径探索方法。在Amazon提供的几个真实数据集上进行了大量的实验，与基线相比，我们的方法对当前问题的表现更好。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 4th International Conference on Data-driven Optimization of Complex Systems (DOCS)

自引率

0.00%

发文量