Scaling Up Recommendation Services in Many Dimensions

Proceedings of the 9th ACM Conference on Recommender Systems Pub Date : 2015-09-16 DOI:10.1145/2792838.2799499

B. Németh

{"title":"Scaling Up Recommendation Services in Many Dimensions","authors":"B. Németh","doi":"10.1145/2792838.2799499","DOIUrl":null,"url":null,"abstract":"Gravity R&D has been providing recommendation engines as SaaS solutions since 2009. The company has a strong research focus and recommendation quality has always been their primary differentiating factor. Widely used or open source recommendation algorithms are of little use to our technology team as a result of the superiority of our in-house developed, proprietary algorithms. Gravity R&D experienced many challenges while scaling up their services. The sheer quantity of data handled on a daily basis increased exponentially. This presentation will cover how overcoming these challenges permanently shaped our algorithms and system architecture used to generate these recommendations. Serving personalized recommendations requires real-time computation and data access for every single request. To generate responses in real-time, current user inputs have to be compared against their history in order to deliver accurate recommendations. We then combine this user information with specific details about available items as the next step in the recommendation process. It becomes more difficult to provide accurate recommendations as the number of transactions and items increase. It also becomes difficult because this type of analysis requires the combination of multiple heterogeneous algorithms that all require different inputs. Initially, the architecture was designed for MF based models and serving huge numbers of requests but with a limited number of items. Now, Gravity is using MF, neighborhood based models and metadata based models to generate recommendations for millions of items within their databases. This required a shift from a monolithic architecture with in-process caching to a more service oriented architecture with multi-layer caching. As a result of an increase in the number of components and number of clients, managing the infrastructure can be quite difficult. Even with these challenges, we don't believe that it is worthwhile to use a fully distributed system. It adds unneeded complexity, resources, and overhead to the system. We prefer an approach of firstly optimizing current algorithms and architecture and only moving to a distributed system when no other options are left.","PeriodicalId":325637,"journal":{"name":"Proceedings of the 9th ACM Conference on Recommender Systems","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 9th ACM Conference on Recommender Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2792838.2799499","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Gravity R&D has been providing recommendation engines as SaaS solutions since 2009. The company has a strong research focus and recommendation quality has always been their primary differentiating factor. Widely used or open source recommendation algorithms are of little use to our technology team as a result of the superiority of our in-house developed, proprietary algorithms. Gravity R&D experienced many challenges while scaling up their services. The sheer quantity of data handled on a daily basis increased exponentially. This presentation will cover how overcoming these challenges permanently shaped our algorithms and system architecture used to generate these recommendations. Serving personalized recommendations requires real-time computation and data access for every single request. To generate responses in real-time, current user inputs have to be compared against their history in order to deliver accurate recommendations. We then combine this user information with specific details about available items as the next step in the recommendation process. It becomes more difficult to provide accurate recommendations as the number of transactions and items increase. It also becomes difficult because this type of analysis requires the combination of multiple heterogeneous algorithms that all require different inputs. Initially, the architecture was designed for MF based models and serving huge numbers of requests but with a limited number of items. Now, Gravity is using MF, neighborhood based models and metadata based models to generate recommendations for millions of items within their databases. This required a shift from a monolithic architecture with in-process caching to a more service oriented architecture with multi-layer caching. As a result of an increase in the number of components and number of clients, managing the infrastructure can be quite difficult. Even with these challenges, we don't believe that it is worthwhile to use a fully distributed system. It adds unneeded complexity, resources, and overhead to the system. We prefer an approach of firstly optimizing current algorithms and architecture and only moving to a distributed system when no other options are left.

查看原文本刊更多论文

在多个维度上扩展推荐服务

Gravity R&D从2009年开始提供推荐引擎作为SaaS解决方案。该公司有很强的研究重点和推荐质量一直是他们的主要差异化因素。广泛使用的或开源的推荐算法对我们的技术团队来说用处不大，因为我们内部开发的专有算法具有优势。Gravity研发在扩大服务规模的过程中遇到了许多挑战。每天处理的数据量呈指数级增长。本演讲将介绍如何克服这些挑战永久地塑造我们用于生成这些建议的算法和系统架构。提供个性化推荐需要对每个请求进行实时计算和数据访问。为了实时生成响应，必须将当前用户的输入与他们的历史进行比较，以便提供准确的建议。然后，我们将这些用户信息与有关可用项目的具体细节结合起来，作为推荐过程的下一步。随着交易和项目数量的增加，提供准确的推荐变得更加困难。它也变得困难，因为这种类型的分析需要多个异构算法的组合，这些算法都需要不同的输入。最初，该架构是为基于MF的模型而设计的，可以处理大量的请求，但项目数量有限。现在，Gravity正在使用MF、基于邻域的模型和基于元数据的模型来为他们数据库中的数百万个项目生成推荐。这需要从具有进程内缓存的单片架构转变为具有多层缓存的更面向服务的架构。由于组件和客户机数量的增加，管理基础设施可能会变得相当困难。即使面对这些挑战，我们也不认为使用一个完全分布式的系统是值得的。它给系统增加了不必要的复杂性、资源和开销。我们更倾向于首先优化当前的算法和架构，只有在没有其他选择的情况下才转向分布式系统。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 9th ACM Conference on Recommender Systems

自引率

0.00%

发文量