A Multi-Armed Bandit Recommender Algorithm Based on Conversation and KNN

Proceedings of the 2022 5th International Conference on Algorithms, Computing and Artificial Intelligence Pub Date : 2022-12-23 DOI:10.1145/3579654.3579714

Hao-dong Xia, Zhifeng Lu, Wenxing Hong

{"title":"A Multi-Armed Bandit Recommender Algorithm Based on Conversation and KNN","authors":"Hao-dong Xia, Zhifeng Lu, Wenxing Hong","doi":"10.1145/3579654.3579714","DOIUrl":null,"url":null,"abstract":"With the wide application of recommendation systems in various fields, in order to effectively solve the cold-start problem in recommendation systems, contextual bandit algorithm uses user feedback to update user preferences online, converting the cold-start problem of recommendation systems into an exploration and exploitation problem. However, traditional contextual bandit algorithm is slow to learn due to the extensive exploration required. With the development of conversational recommendation, conversational contextual bandit algorithm learns the user's preference for key-term through conversation thus accelerating the learning speed. However, it only considers user feedback on key-term and ignores the relevance of key-term to each other. To solve the problem, a multi-armed bandit based on conversation and KNN (K-Nearest Neighbors) algorithm is proposed by introducing a more refined collaboration (KNNConUCB). Experiments on Synthetic data, as well as real datasets from Movielens and Last.FM, demonstrate the efficacy of the KNNConUCB algorithm.","PeriodicalId":146783,"journal":{"name":"Proceedings of the 2022 5th International Conference on Algorithms, Computing and Artificial Intelligence","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2022 5th International Conference on Algorithms, Computing and Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3579654.3579714","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

With the wide application of recommendation systems in various fields, in order to effectively solve the cold-start problem in recommendation systems, contextual bandit algorithm uses user feedback to update user preferences online, converting the cold-start problem of recommendation systems into an exploration and exploitation problem. However, traditional contextual bandit algorithm is slow to learn due to the extensive exploration required. With the development of conversational recommendation, conversational contextual bandit algorithm learns the user's preference for key-term through conversation thus accelerating the learning speed. However, it only considers user feedback on key-term and ignores the relevance of key-term to each other. To solve the problem, a multi-armed bandit based on conversation and KNN (K-Nearest Neighbors) algorithm is proposed by introducing a more refined collaboration (KNNConUCB). Experiments on Synthetic data, as well as real datasets from Movielens and Last.FM, demonstrate the efficacy of the KNNConUCB algorithm.

查看原文本刊更多论文

基于对话和KNN的多臂强盗推荐算法

随着推荐系统在各个领域的广泛应用，为了有效解决推荐系统的冷启动问题，上下文强盗算法利用用户反馈在线更新用户偏好，将推荐系统的冷启动问题转化为一个探索和开发问题。然而，传统的上下文强盗算法由于需要大量的探索，学习速度较慢。随着会话推荐的发展，会话上下文强盗算法通过会话学习用户对关键词的偏好，从而加快了学习速度。然而，它只考虑用户对关键词的反馈，而忽略了关键词之间的相关性。为了解决这一问题，通过引入更精细的协作(KNNConUCB)，提出了一种基于对话和KNN (K-Nearest Neighbors)算法的多臂强盗。合成数据的实验，以及来自Movielens和Last的真实数据集。FM，验证了KNNConUCB算法的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 2022 5th International Conference on Algorithms, Computing and Artificial Intelligence

自引率

0.00%

发文量