RePlay: a Recommendation Framework for Experimentation and Production Use

arXiv - CS - Software Engineering Pub Date : 2024-09-11 DOI:arxiv-2409.07272

Alexey Vasilev, Anna Volodkevich, Denis Kulandin, Tatiana Bysheva, Anton Klenitskiy

引用次数: 0

Abstract

Using a single tool to build and compare recommender systems significantly reduces the time to market for new models. In addition, the comparison results when using such tools look more consistent. This is why many different tools and libraries for researchers in the field of recommendations have recently appeared. Unfortunately, most of these frameworks are aimed primarily at researchers and require modification for use in production due to the inability to work on large datasets or an inappropriate architecture. In this demo, we present our open-source toolkit RePlay - a framework containing an end-to-end pipeline for building recommender systems, which is ready for production use. RePlay also allows you to use a suitable stack for the pipeline on each stage: Pandas, Polars, or Spark. This allows the library to scale computations and deploy to a cluster. Thus, RePlay allows data scientists to easily move from research mode to production mode using the same interfaces.

查看原文本刊更多论文

RePlay：用于实验和生产的推荐框架

使用单一工具来构建和比较推荐系统可以大大缩短新模型的上市时间。此外，使用此类工具得出的比较结果也更加一致。这就是为什么最近出现了许多不同的工具和库，供推荐领域的研究人员使用。遗憾的是，这些框架大多主要面向研究人员，由于无法处理大型数据集或架构不合适，在生产中使用时需要进行修改。在本演示中，我们将介绍我们的开源工具包 RePlay--一个包含用于构建推荐系统的端到端流水线的框架，可随时用于生产。这样，该库就可以扩展计算并部署到集群中。因此，RePlay 允许数据科学家使用相同的接口轻松地从研究模式转向生产模式。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

arXiv - CS - Software Engineering

自引率

0.00%

发文量