A Multi-Armed Bandit Recommender Algorithm Based on Conversation and KNN

Hao-dong Xia, Zhifeng Lu, Wenxing Hong
{"title":"A Multi-Armed Bandit Recommender Algorithm Based on Conversation and KNN","authors":"Hao-dong Xia, Zhifeng Lu, Wenxing Hong","doi":"10.1145/3579654.3579714","DOIUrl":null,"url":null,"abstract":"With the wide application of recommendation systems in various fields, in order to effectively solve the cold-start problem in recommendation systems, contextual bandit algorithm uses user feedback to update user preferences online, converting the cold-start problem of recommendation systems into an exploration and exploitation problem. However, traditional contextual bandit algorithm is slow to learn due to the extensive exploration required. With the development of conversational recommendation, conversational contextual bandit algorithm learns the user's preference for key-term through conversation thus accelerating the learning speed. However, it only considers user feedback on key-term and ignores the relevance of key-term to each other. To solve the problem, a multi-armed bandit based on conversation and KNN (K-Nearest Neighbors) algorithm is proposed by introducing a more refined collaboration (KNNConUCB). Experiments on Synthetic data, as well as real datasets from Movielens and Last.FM, demonstrate the efficacy of the KNNConUCB algorithm.","PeriodicalId":146783,"journal":{"name":"Proceedings of the 2022 5th International Conference on Algorithms, Computing and Artificial Intelligence","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2022 5th International Conference on Algorithms, Computing and Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3579654.3579714","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

With the wide application of recommendation systems in various fields, in order to effectively solve the cold-start problem in recommendation systems, contextual bandit algorithm uses user feedback to update user preferences online, converting the cold-start problem of recommendation systems into an exploration and exploitation problem. However, traditional contextual bandit algorithm is slow to learn due to the extensive exploration required. With the development of conversational recommendation, conversational contextual bandit algorithm learns the user's preference for key-term through conversation thus accelerating the learning speed. However, it only considers user feedback on key-term and ignores the relevance of key-term to each other. To solve the problem, a multi-armed bandit based on conversation and KNN (K-Nearest Neighbors) algorithm is proposed by introducing a more refined collaboration (KNNConUCB). Experiments on Synthetic data, as well as real datasets from Movielens and Last.FM, demonstrate the efficacy of the KNNConUCB algorithm.
基于对话和KNN的多臂强盗推荐算法
随着推荐系统在各个领域的广泛应用,为了有效解决推荐系统的冷启动问题,上下文强盗算法利用用户反馈在线更新用户偏好,将推荐系统的冷启动问题转化为一个探索和开发问题。然而,传统的上下文强盗算法由于需要大量的探索,学习速度较慢。随着会话推荐的发展,会话上下文强盗算法通过会话学习用户对关键词的偏好,从而加快了学习速度。然而,它只考虑用户对关键词的反馈,而忽略了关键词之间的相关性。为了解决这一问题,通过引入更精细的协作(KNNConUCB),提出了一种基于对话和KNN (K-Nearest Neighbors)算法的多臂强盗。合成数据的实验,以及来自Movielens和Last的真实数据集。FM,验证了KNNConUCB算法的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信