{"title":"使用深度强化学习和多头注意的顺序推荐","authors":"Raneem Sultan, Mervat Abu-Elkheir","doi":"10.1109/CISS53076.2022.9751174","DOIUrl":null,"url":null,"abstract":"Recommender Systems have become a crucial part of many of our online interactions. From shopping for clothes, planning a trip, or deciding what to watch, recommender systems are aiming to help users navigate the overwhelming amount of options available online. The problem with most of the existing recommender systems is that they treat the recommendation process as a static one and make recommendations according to a fixed greedy strategy. This is a problem because user preferences are dynamic. In this paper, we aim to address this problem by modeling the recommendation problem as a Markov Decision Process (MDP) and solving it using deep reinforcement learning. Furthermore, we use multi-head attention to improve the recommendations. We conduct extensive experiments using the MovieLens real-world dataset and achieve an improvement of 6% over the state-of-the-art approach results in terms of precision@20.","PeriodicalId":305918,"journal":{"name":"2022 56th Annual Conference on Information Sciences and Systems (CISS)","volume":"64 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-03-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Sequential Recommendation Using Deep Reinforcement Learning and Multi-Head Attention\",\"authors\":\"Raneem Sultan, Mervat Abu-Elkheir\",\"doi\":\"10.1109/CISS53076.2022.9751174\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Recommender Systems have become a crucial part of many of our online interactions. From shopping for clothes, planning a trip, or deciding what to watch, recommender systems are aiming to help users navigate the overwhelming amount of options available online. The problem with most of the existing recommender systems is that they treat the recommendation process as a static one and make recommendations according to a fixed greedy strategy. This is a problem because user preferences are dynamic. In this paper, we aim to address this problem by modeling the recommendation problem as a Markov Decision Process (MDP) and solving it using deep reinforcement learning. Furthermore, we use multi-head attention to improve the recommendations. We conduct extensive experiments using the MovieLens real-world dataset and achieve an improvement of 6% over the state-of-the-art approach results in terms of precision@20.\",\"PeriodicalId\":305918,\"journal\":{\"name\":\"2022 56th Annual Conference on Information Sciences and Systems (CISS)\",\"volume\":\"64 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-03-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 56th Annual Conference on Information Sciences and Systems (CISS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CISS53076.2022.9751174\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 56th Annual Conference on Information Sciences and Systems (CISS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CISS53076.2022.9751174","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Sequential Recommendation Using Deep Reinforcement Learning and Multi-Head Attention
Recommender Systems have become a crucial part of many of our online interactions. From shopping for clothes, planning a trip, or deciding what to watch, recommender systems are aiming to help users navigate the overwhelming amount of options available online. The problem with most of the existing recommender systems is that they treat the recommendation process as a static one and make recommendations according to a fixed greedy strategy. This is a problem because user preferences are dynamic. In this paper, we aim to address this problem by modeling the recommendation problem as a Markov Decision Process (MDP) and solving it using deep reinforcement learning. Furthermore, we use multi-head attention to improve the recommendations. We conduct extensive experiments using the MovieLens real-world dataset and achieve an improvement of 6% over the state-of-the-art approach results in terms of precision@20.