{"title":"Proximal Policy Optimization for Explainable Recommended Systems","authors":"Qian Feng, Geyang Xiao, Yuan Liang, Huifeng Zhang, Linlin Yan, Xiaoyu Yi","doi":"10.1109/DOCS55193.2022.9967709","DOIUrl":null,"url":null,"abstract":"In this paper, in order to intuitively improve the interpretability of the recommendation, we make full use of the knowledge graph and provide an ‘explicit’ recommendation path for the recommended result. Specifically, we embeds the entities and relations in the knowledge graph according to consumers’ behavior. Then a knowledge reasoning method based on reinforcement learning algorithm is proposed, which aims to find a reasonable path after two entities in the knowledge graph are given. The main contributions of this paper are as follows: The current State-of-the-art method in reinforcement learning, PPO is used for improvement. The soft-reward function after adjustment behaves better for the current problem while training. And a Q-value based path exploration method is innovated for testing. With extensive experiments on several real-world datasets provided by Amazon, our method behaves better for the current problem compared with the baseline.","PeriodicalId":348545,"journal":{"name":"2022 4th International Conference on Data-driven Optimization of Complex Systems (DOCS)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 4th International Conference on Data-driven Optimization of Complex Systems (DOCS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DOCS55193.2022.9967709","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In this paper, in order to intuitively improve the interpretability of the recommendation, we make full use of the knowledge graph and provide an ‘explicit’ recommendation path for the recommended result. Specifically, we embeds the entities and relations in the knowledge graph according to consumers’ behavior. Then a knowledge reasoning method based on reinforcement learning algorithm is proposed, which aims to find a reasonable path after two entities in the knowledge graph are given. The main contributions of this paper are as follows: The current State-of-the-art method in reinforcement learning, PPO is used for improvement. The soft-reward function after adjustment behaves better for the current problem while training. And a Q-value based path exploration method is innovated for testing. With extensive experiments on several real-world datasets provided by Amazon, our method behaves better for the current problem compared with the baseline.