Regressing Towards Simpler Prediction Systems

Proceedings of the Eighth ACM International Conference on Web Search and Data Mining Pub Date : 2015-02-02 DOI:10.1145/2684822.2697045

Tushar Chandra

{"title":"Regressing Towards Simpler Prediction Systems","authors":"Tushar Chandra","doi":"10.1145/2684822.2697045","DOIUrl":null,"url":null,"abstract":"This talk will focus on our experience in managing the complexity of Sibyl, a large scale machine learning system that is widely used within Google. We believe that a large fraction of the challenges faced by Sibyl are inherent to large scale production machine learning and that other production systems are likely to encounter them as well [1]. Thus, these challenges present interesting opportunities for future research. The Sibyl system is complex for a number of reasons. We have learnt that a complete end-to-end machine learning solution has to have subsystems to address a variety of different needs: data ingestion, data analysis, data verification, experimentation, model analysis, model serving, configuration, data transformations, support for different kinds of loss functions and modeling, machine learning algorithm implementations, etc. Machine learning algorithms themselves constitute a relatively small fraction of the overall system. Each subsystem consists of a number of distinct components to support the variety of product needs. For example, Sibyl supports more than 5 different model serving systems, each with its own idiosyncrasies and challenges. In addition, Sibyl configuration contains more lines of code than the core Sibyl learner itself. Finally existing solutions for some of the challenges don't feel adequate and we believe these challenges present opportunities for future research. Though the overall system is complex, our users need to be able to deploy solutions quickly. This is because a machine learning deployment is typically an iterative process of model improvements. At each iteration, our users experiment with new features, find those that improve the model's prediction capability, and then \"launch\" a new model with those improved features. A user may go through 10 or more such productive launches. Not only is speed of iteration crucial to our users, but they are often willing to sacrifice the improved prediction quality of a high quality but cumbersome system for the speed of iteration of a lower quality but nimble system. In this talk I will give an example of how simplification drives systems design and sometimes the design of novel algorithms.","PeriodicalId":179443,"journal":{"name":"Proceedings of the Eighth ACM International Conference on Web Search and Data Mining","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-02-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Eighth ACM International Conference on Web Search and Data Mining","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2684822.2697045","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

This talk will focus on our experience in managing the complexity of Sibyl, a large scale machine learning system that is widely used within Google. We believe that a large fraction of the challenges faced by Sibyl are inherent to large scale production machine learning and that other production systems are likely to encounter them as well [1]. Thus, these challenges present interesting opportunities for future research. The Sibyl system is complex for a number of reasons. We have learnt that a complete end-to-end machine learning solution has to have subsystems to address a variety of different needs: data ingestion, data analysis, data verification, experimentation, model analysis, model serving, configuration, data transformations, support for different kinds of loss functions and modeling, machine learning algorithm implementations, etc. Machine learning algorithms themselves constitute a relatively small fraction of the overall system. Each subsystem consists of a number of distinct components to support the variety of product needs. For example, Sibyl supports more than 5 different model serving systems, each with its own idiosyncrasies and challenges. In addition, Sibyl configuration contains more lines of code than the core Sibyl learner itself. Finally existing solutions for some of the challenges don't feel adequate and we believe these challenges present opportunities for future research. Though the overall system is complex, our users need to be able to deploy solutions quickly. This is because a machine learning deployment is typically an iterative process of model improvements. At each iteration, our users experiment with new features, find those that improve the model's prediction capability, and then "launch" a new model with those improved features. A user may go through 10 or more such productive launches. Not only is speed of iteration crucial to our users, but they are often willing to sacrifice the improved prediction quality of a high quality but cumbersome system for the speed of iteration of a lower quality but nimble system. In this talk I will give an example of how simplification drives systems design and sometimes the design of novel algorithms.

查看原文本刊更多论文

回归到更简单的预测系统

这次演讲将重点介绍我们在管理Sibyl复杂性方面的经验，Sibyl是一个在谷歌内部广泛使用的大型机器学习系统。我们认为，Sibyl面临的很大一部分挑战是大规模生产机器学习所固有的，其他生产系统也可能遇到它们[1]。因此，这些挑战为未来的研究提供了有趣的机会。由于许多原因，Sibyl系统是复杂的。我们已经了解到，一个完整的端到端机器学习解决方案必须有子系统来满足各种不同的需求:数据摄取、数据分析、数据验证、实验、模型分析、模型服务、配置、数据转换、支持不同类型的损失函数和建模、机器学习算法实现等。机器学习算法本身只占整个系统的一小部分。每个子系统由许多不同的组件组成，以支持各种产品需求。例如，Sibyl支持超过5种不同的模型服务系统，每种系统都有自己的特性和挑战。此外，Sibyl配置包含比核心Sibyl学习器本身更多的代码行。最后，对于一些挑战的现有解决方案还不够充分，我们相信这些挑战为未来的研究提供了机会。虽然整个系统很复杂，但我们的用户需要能够快速部署解决方案。这是因为机器学习部署通常是一个模型改进的迭代过程。在每次迭代中，我们的用户尝试新的特性，找到那些改进模型预测能力的特性，然后“启动”一个具有这些改进特性的新模型。用户可能会经历10次或更多这样的生产性启动。不仅迭代速度对我们的用户至关重要，而且他们经常愿意牺牲高质量但笨重的系统的改进预测质量，以换取低质量但灵活的系统的迭代速度。在这次演讲中，我将给出一个例子，说明简化如何驱动系统设计，有时还会驱动新算法的设计。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the Eighth ACM International Conference on Web Search and Data Mining

自引率

0.00%

发文量