{"title":"强化学习及其在推荐系统中的应用","authors":"Mohak Sharma, Neeraj Gandhi, Supreme Datta, Bhavani Annarapu, Krutika Arvind Tomanvar, Mayuresh Bhovardhan","doi":"10.56025/ijaresm.2023.11223278","DOIUrl":null,"url":null,"abstract":"Reinforcement learning (RL) is an area of machine learning concerned with how intelligent agents are made to take actions in an environment in order to maximize the total reward. RL works on Markov Decision Process which leads to Q-learning. MDP provides a mechanism to maximize the reward in a given environment. Deep reinforcement learning is the combination of reinforcement learning (RL) and deep learning. DRL has applications in many fields like medicine, robotics, games, etc. Combining DL and RL leads to the formation of Deep QNetworks. Another application of RL and the focus of this seminar is personalized recommendation systems. Recommendation systems are trained on user-item interaction to predict the next item that a user can be interested in. In personalized recommendation systems, it is important to consider a lot of factors like user-item interactions, clicks, purchases, loss, etc. DRL does a very good job of reducing the loss in recommendation systems and recommending relevant items. I will explore a technique called self-supervised learning in which a model is made to predict data for itself for its use in recommendation systems. I also summarize 2 frameworks namely SelfSupervised Q-learning (SQN) and Self-Supervised Actor-Critic (SAC) based on self-supervised learning and try to understand how they work differently as compared to non-RL recommendation systems.","PeriodicalId":314707,"journal":{"name":"International Journal of All Research Education and Scientific Methods","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Reinforcement Learning and its application in making Recommendation System\",\"authors\":\"Mohak Sharma, Neeraj Gandhi, Supreme Datta, Bhavani Annarapu, Krutika Arvind Tomanvar, Mayuresh Bhovardhan\",\"doi\":\"10.56025/ijaresm.2023.11223278\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Reinforcement learning (RL) is an area of machine learning concerned with how intelligent agents are made to take actions in an environment in order to maximize the total reward. RL works on Markov Decision Process which leads to Q-learning. MDP provides a mechanism to maximize the reward in a given environment. Deep reinforcement learning is the combination of reinforcement learning (RL) and deep learning. DRL has applications in many fields like medicine, robotics, games, etc. Combining DL and RL leads to the formation of Deep QNetworks. Another application of RL and the focus of this seminar is personalized recommendation systems. Recommendation systems are trained on user-item interaction to predict the next item that a user can be interested in. In personalized recommendation systems, it is important to consider a lot of factors like user-item interactions, clicks, purchases, loss, etc. DRL does a very good job of reducing the loss in recommendation systems and recommending relevant items. I will explore a technique called self-supervised learning in which a model is made to predict data for itself for its use in recommendation systems. I also summarize 2 frameworks namely SelfSupervised Q-learning (SQN) and Self-Supervised Actor-Critic (SAC) based on self-supervised learning and try to understand how they work differently as compared to non-RL recommendation systems.\",\"PeriodicalId\":314707,\"journal\":{\"name\":\"International Journal of All Research Education and Scientific Methods\",\"volume\":\"22 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of All Research Education and Scientific Methods\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.56025/ijaresm.2023.11223278\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of All Research Education and Scientific Methods","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.56025/ijaresm.2023.11223278","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Reinforcement Learning and its application in making Recommendation System
Reinforcement learning (RL) is an area of machine learning concerned with how intelligent agents are made to take actions in an environment in order to maximize the total reward. RL works on Markov Decision Process which leads to Q-learning. MDP provides a mechanism to maximize the reward in a given environment. Deep reinforcement learning is the combination of reinforcement learning (RL) and deep learning. DRL has applications in many fields like medicine, robotics, games, etc. Combining DL and RL leads to the formation of Deep QNetworks. Another application of RL and the focus of this seminar is personalized recommendation systems. Recommendation systems are trained on user-item interaction to predict the next item that a user can be interested in. In personalized recommendation systems, it is important to consider a lot of factors like user-item interactions, clicks, purchases, loss, etc. DRL does a very good job of reducing the loss in recommendation systems and recommending relevant items. I will explore a technique called self-supervised learning in which a model is made to predict data for itself for its use in recommendation systems. I also summarize 2 frameworks namely SelfSupervised Q-learning (SQN) and Self-Supervised Actor-Critic (SAC) based on self-supervised learning and try to understand how they work differently as compared to non-RL recommendation systems.