Alexey Vasilev, Anna Volodkevich, Denis Kulandin, Tatiana Bysheva, Anton Klenitskiy
{"title":"RePlay: a Recommendation Framework for Experimentation and Production Use","authors":"Alexey Vasilev, Anna Volodkevich, Denis Kulandin, Tatiana Bysheva, Anton Klenitskiy","doi":"arxiv-2409.07272","DOIUrl":null,"url":null,"abstract":"Using a single tool to build and compare recommender systems significantly\nreduces the time to market for new models. In addition, the comparison results\nwhen using such tools look more consistent. This is why many different tools\nand libraries for researchers in the field of recommendations have recently\nappeared. Unfortunately, most of these frameworks are aimed primarily at\nresearchers and require modification for use in production due to the inability\nto work on large datasets or an inappropriate architecture. In this demo, we\npresent our open-source toolkit RePlay - a framework containing an end-to-end\npipeline for building recommender systems, which is ready for production use.\nRePlay also allows you to use a suitable stack for the pipeline on each stage:\nPandas, Polars, or Spark. This allows the library to scale computations and\ndeploy to a cluster. Thus, RePlay allows data scientists to easily move from\nresearch mode to production mode using the same interfaces.","PeriodicalId":501278,"journal":{"name":"arXiv - CS - Software Engineering","volume":"92 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Software Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.07272","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Using a single tool to build and compare recommender systems significantly
reduces the time to market for new models. In addition, the comparison results
when using such tools look more consistent. This is why many different tools
and libraries for researchers in the field of recommendations have recently
appeared. Unfortunately, most of these frameworks are aimed primarily at
researchers and require modification for use in production due to the inability
to work on large datasets or an inappropriate architecture. In this demo, we
present our open-source toolkit RePlay - a framework containing an end-to-end
pipeline for building recommender systems, which is ready for production use.
RePlay also allows you to use a suitable stack for the pipeline on each stage:
Pandas, Polars, or Spark. This allows the library to scale computations and
deploy to a cluster. Thus, RePlay allows data scientists to easily move from
research mode to production mode using the same interfaces.