强化学习及其在推荐系统中的应用

International Journal of All Research Education and Scientific Methods Pub Date : 1900-01-01 DOI:10.56025/ijaresm.2023.11223278

Mohak Sharma, Neeraj Gandhi, Supreme Datta, Bhavani Annarapu, Krutika Arvind Tomanvar, Mayuresh Bhovardhan

{"title":"强化学习及其在推荐系统中的应用","authors":"Mohak Sharma, Neeraj Gandhi, Supreme Datta, Bhavani Annarapu, Krutika Arvind Tomanvar, Mayuresh Bhovardhan","doi":"10.56025/ijaresm.2023.11223278","DOIUrl":null,"url":null,"abstract":"Reinforcement learning (RL) is an area of machine learning concerned with how intelligent agents are made to take actions in an environment in order to maximize the total reward. RL works on Markov Decision Process which leads to Q-learning. MDP provides a mechanism to maximize the reward in a given environment. Deep reinforcement learning is the combination of reinforcement learning (RL) and deep learning. DRL has applications in many fields like medicine, robotics, games, etc. Combining DL and RL leads to the formation of Deep QNetworks. Another application of RL and the focus of this seminar is personalized recommendation systems. Recommendation systems are trained on user-item interaction to predict the next item that a user can be interested in. In personalized recommendation systems, it is important to consider a lot of factors like user-item interactions, clicks, purchases, loss, etc. DRL does a very good job of reducing the loss in recommendation systems and recommending relevant items. I will explore a technique called self-supervised learning in which a model is made to predict data for itself for its use in recommendation systems. I also summarize 2 frameworks namely SelfSupervised Q-learning (SQN) and Self-Supervised Actor-Critic (SAC) based on self-supervised learning and try to understand how they work differently as compared to non-RL recommendation systems.","PeriodicalId":314707,"journal":{"name":"International Journal of All Research Education and Scientific Methods","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Reinforcement Learning and its application in making Recommendation System\",\"authors\":\"Mohak Sharma, Neeraj Gandhi, Supreme Datta, Bhavani Annarapu, Krutika Arvind Tomanvar, Mayuresh Bhovardhan\",\"doi\":\"10.56025/ijaresm.2023.11223278\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Reinforcement learning (RL) is an area of machine learning concerned with how intelligent agents are made to take actions in an environment in order to maximize the total reward. RL works on Markov Decision Process which leads to Q-learning. MDP provides a mechanism to maximize the reward in a given environment. Deep reinforcement learning is the combination of reinforcement learning (RL) and deep learning. DRL has applications in many fields like medicine, robotics, games, etc. Combining DL and RL leads to the formation of Deep QNetworks. Another application of RL and the focus of this seminar is personalized recommendation systems. Recommendation systems are trained on user-item interaction to predict the next item that a user can be interested in. In personalized recommendation systems, it is important to consider a lot of factors like user-item interactions, clicks, purchases, loss, etc. DRL does a very good job of reducing the loss in recommendation systems and recommending relevant items. I will explore a technique called self-supervised learning in which a model is made to predict data for itself for its use in recommendation systems. I also summarize 2 frameworks namely SelfSupervised Q-learning (SQN) and Self-Supervised Actor-Critic (SAC) based on self-supervised learning and try to understand how they work differently as compared to non-RL recommendation systems.\",\"PeriodicalId\":314707,\"journal\":{\"name\":\"International Journal of All Research Education and Scientific Methods\",\"volume\":\"22 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of All Research Education and Scientific Methods\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.56025/ijaresm.2023.11223278\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of All Research Education and Scientific Methods","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.56025/ijaresm.2023.11223278","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

强化学习(RL)是机器学习的一个领域，涉及如何使智能代理在环境中采取行动以最大化总奖励。强化学习研究马尔科夫决策过程，这导致了q学习。MDP提供了一种在给定环境中最大化奖励的机制。深度强化学习是强化学习(RL)和深度学习的结合。DRL在医学、机器人、游戏等许多领域都有应用。深度学习和强化学习的结合形成了深度QNetworks。RL的另一个应用，也是本次研讨会的重点是个性化推荐系统。推荐系统通过用户与物品的交互进行训练，以预测用户可能感兴趣的下一个物品。在个性化推荐系统中，重要的是要考虑许多因素，如用户与物品的交互、点击、购买、损失等。DRL在减少推荐系统的损失和推荐相关项目方面做得很好。我将探索一种称为自监督学习的技术，在这种技术中，模型可以为自己预测数据，以便在推荐系统中使用。我还总结了两种框架，即基于自我监督学习的自我监督Q-learning (SQN)和自我监督Actor-Critic (SAC)，并试图理解它们与非强化学习推荐系统相比有何不同。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Reinforcement Learning and its application in making Recommendation System

Reinforcement learning (RL) is an area of machine learning concerned with how intelligent agents are made to take actions in an environment in order to maximize the total reward. RL works on Markov Decision Process which leads to Q-learning. MDP provides a mechanism to maximize the reward in a given environment. Deep reinforcement learning is the combination of reinforcement learning (RL) and deep learning. DRL has applications in many fields like medicine, robotics, games, etc. Combining DL and RL leads to the formation of Deep QNetworks. Another application of RL and the focus of this seminar is personalized recommendation systems. Recommendation systems are trained on user-item interaction to predict the next item that a user can be interested in. In personalized recommendation systems, it is important to consider a lot of factors like user-item interactions, clicks, purchases, loss, etc. DRL does a very good job of reducing the loss in recommendation systems and recommending relevant items. I will explore a technique called self-supervised learning in which a model is made to predict data for itself for its use in recommendation systems. I also summarize 2 frameworks namely SelfSupervised Q-learning (SQN) and Self-Supervised Actor-Critic (SAC) based on self-supervised learning and try to understand how they work differently as compared to non-RL recommendation systems.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

International Journal of All Research Education and Scientific Methods

自引率

0.00%

发文量