Exploration in Recommender Systems

Proceedings of the Fifteenth ACM International Conference on Web Search and Data Mining Pub Date : 2022-02-11 DOI:10.1145/3488560.3510009

Minmin Chen

{"title":"Exploration in Recommender Systems","authors":"Minmin Chen","doi":"10.1145/3488560.3510009","DOIUrl":null,"url":null,"abstract":"In the era of increasing choices, recommender systems are becoming indispensable in helping users navigate the million or billion pieces of content on recommendation platforms. Most of the recommender systems are powered by ML models trained on a large amount of user-item interaction data. Such a setup however induces a strong feedback loop that creates the rich gets richer phenomenon where head contents are getting more and more exposure while tail and fresh contents are not discovered. At the same time, it pigeonholes users to contents they are already familiar with. We believe exploration is key to break away from the feedback loop and to optimize long term user experience on recommendation platforms. The exploration-exploitation tradeoff, being the foundation of bandits and RL research, has been extensively studied in RL. While effective exploration is believed to positively influence the user experience on the platform, the exact value of exploration in recommender systems has not been well established. In this talk, we examine the roles of exploration in recommender systems in three facets: 1) system exploration to surface fresh/tail recommendations based on users' known interests; 2) user exploration to identify unknown user interests or introduce users to new interests; and 3) online exploration to utilize real-time user feedback to reduce extrapolation errors in performing system and user exploration. We discuss the challenges in measurements and optimization in different types of exploration, and propose initial solutions. We showcase how each aspect of exploration contributes to the long term user experience through offline and live experiments on industrial recommendation platforms. We hope this talk can inspire more follow up work in understanding and improving exploration in recommender systems.","PeriodicalId":348686,"journal":{"name":"Proceedings of the Fifteenth ACM International Conference on Web Search and Data Mining","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-02-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Fifteenth ACM International Conference on Web Search and Data Mining","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3488560.3510009","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

In the era of increasing choices, recommender systems are becoming indispensable in helping users navigate the million or billion pieces of content on recommendation platforms. Most of the recommender systems are powered by ML models trained on a large amount of user-item interaction data. Such a setup however induces a strong feedback loop that creates the rich gets richer phenomenon where head contents are getting more and more exposure while tail and fresh contents are not discovered. At the same time, it pigeonholes users to contents they are already familiar with. We believe exploration is key to break away from the feedback loop and to optimize long term user experience on recommendation platforms. The exploration-exploitation tradeoff, being the foundation of bandits and RL research, has been extensively studied in RL. While effective exploration is believed to positively influence the user experience on the platform, the exact value of exploration in recommender systems has not been well established. In this talk, we examine the roles of exploration in recommender systems in three facets: 1) system exploration to surface fresh/tail recommendations based on users' known interests; 2) user exploration to identify unknown user interests or introduce users to new interests; and 3) online exploration to utilize real-time user feedback to reduce extrapolation errors in performing system and user exploration. We discuss the challenges in measurements and optimization in different types of exploration, and propose initial solutions. We showcase how each aspect of exploration contributes to the long term user experience through offline and live experiments on industrial recommendation platforms. We hope this talk can inspire more follow up work in understanding and improving exploration in recommender systems.

查看原文本刊更多论文

推荐系统的探索

在选择越来越多的时代，推荐系统在帮助用户浏览推荐平台上的数百万或数十亿条内容方面变得不可或缺。大多数推荐系统都是由经过大量用户-项目交互数据训练的ML模型提供支持的。然而，这样的设置诱导了一个强大的反馈循环，创造了“富得越来越富”的现象，即头部内容得到越来越多的曝光，而尾部和新鲜内容却没有被发现。与此同时，它将用户归类到他们已经熟悉的内容中。我们认为，探索是打破反馈循环、优化推荐平台长期用户体验的关键。勘探与开发的权衡是强盗和RL研究的基础，在RL中得到了广泛的研究。虽然有效的探索被认为对平台上的用户体验有积极的影响，但探索在推荐系统中的确切价值尚未得到很好的确立。在这次演讲中，我们从三个方面考察了探索在推荐系统中的作用:1)系统探索，根据用户已知的兴趣提供新鲜/尾推荐;2)用户探索，识别未知的用户兴趣或向用户介绍新的兴趣;3)在线探索利用实时用户反馈来减少执行系统和用户探索时的外推误差。我们讨论了在不同类型的勘探中测量和优化的挑战，并提出了初步的解决方案。我们通过行业推荐平台上的线下和现场实验，展示了探索的各个方面如何为长期用户体验做出贡献。我们希望这次演讲能够激发更多的后续工作来理解和改进推荐系统的探索。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the Fifteenth ACM International Conference on Web Search and Data Mining

自引率

0.00%

发文量