Safely and Quickly Deploying New Features with a Staged Rollout Framework Using Sequential Test and Adaptive Experimental Design

2018 3rd International Conference on Computational Intelligence and Applications (ICCIA) Pub Date : 2018-07-01 DOI:10.1109/ICCIA.2018.00019

Zhenyu Zhao, Mandie Liu, Anirban Deb

{"title":"Safely and Quickly Deploying New Features with a Staged Rollout Framework Using Sequential Test and Adaptive Experimental Design","authors":"Zhenyu Zhao, Mandie Liu, Anirban Deb","doi":"10.1109/ICCIA.2018.00019","DOIUrl":null,"url":null,"abstract":"During the rapid development cycle for Internet products (websites and mobile apps), new features are developed and rolled out to users constantly. Features with code defects or design flaws can cause outages and significant degradation of user experience. The traditional method of code review and change management can be time-consuming and error-prone. In order to make the feature rollout process safe and fast, this paper proposes a methodology for rolling out features in an automated way using an adaptive experimental design. Under this framework, a feature is gradually ramped up from a small proportion of users to a larger population based on real-time evaluation of the performance of important metrics. If there are any regression detected during the ramp-up step, the ramp-up process stops and the feature developer is alerted. There are two main algorithm components powering this framework: 1) a continuous monitoring algorithm - using a variant of the sequential probability ratio test (SPRT) to monitor the feature performance metrics and alert feature developers when a metric degradation is detected, 2) an automated ramp-up algorithm - deciding when and how to ramp up to the next stage with larger sample size. This paper presents one monitoring algorithm and three ramping up algorithms including time-based, power-based, and risk-based (a Bayesian approach) schedules. These algorithms are evaluated and compared on both simulated data and real data. There are three benefits provided by this framework for feature rollout: 1) for defective features, it can detect the regression early and reduce negative effect, 2) for healthy features, it rolls out the feature quickly, 3) it reduces the need for manual intervention via the automation of the feature rollout process.","PeriodicalId":297098,"journal":{"name":"2018 3rd International Conference on Computational Intelligence and Applications (ICCIA)","volume":"112 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 3rd International Conference on Computational Intelligence and Applications (ICCIA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCIA.2018.00019","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 6

Abstract

During the rapid development cycle for Internet products (websites and mobile apps), new features are developed and rolled out to users constantly. Features with code defects or design flaws can cause outages and significant degradation of user experience. The traditional method of code review and change management can be time-consuming and error-prone. In order to make the feature rollout process safe and fast, this paper proposes a methodology for rolling out features in an automated way using an adaptive experimental design. Under this framework, a feature is gradually ramped up from a small proportion of users to a larger population based on real-time evaluation of the performance of important metrics. If there are any regression detected during the ramp-up step, the ramp-up process stops and the feature developer is alerted. There are two main algorithm components powering this framework: 1) a continuous monitoring algorithm - using a variant of the sequential probability ratio test (SPRT) to monitor the feature performance metrics and alert feature developers when a metric degradation is detected, 2) an automated ramp-up algorithm - deciding when and how to ramp up to the next stage with larger sample size. This paper presents one monitoring algorithm and three ramping up algorithms including time-based, power-based, and risk-based (a Bayesian approach) schedules. These algorithms are evaluated and compared on both simulated data and real data. There are three benefits provided by this framework for feature rollout: 1) for defective features, it can detect the regression early and reduce negative effect, 2) for healthy features, it rolls out the feature quickly, 3) it reduces the need for manual intervention via the automation of the feature rollout process.

查看原文本刊更多论文

使用顺序测试和自适应实验设计的分阶段推出框架安全快速地部署新功能

在互联网产品(网站和移动应用)的快速开发周期中，不断有新的特性被开发出来并推向用户。带有代码缺陷或设计缺陷的特性可能导致中断和显著降低用户体验。传统的代码审查和变更管理方法既耗时又容易出错。为了保证特征推出过程的安全性和快速性，本文提出了一种基于自适应实验设计的特征自动推出方法。在这个框架下，基于对重要指标性能的实时评估，一个功能从一小部分用户逐渐增加到更大的人群。如果在升级步骤中检测到任何回归，那么升级过程将停止，并向功能开发人员发出警报。这个框架有两个主要的算法组件:1)一个持续监控算法——使用序列概率比测试(SPRT)的一种变体来监控特征性能指标，并在检测到指标退化时向特征开发人员发出警报;2)一个自动提升算法——决定何时以及如何以更大的样本量提升到下一阶段。本文提出了一种监测算法和三种加速算法，包括基于时间的、基于功率的和基于风险的(贝叶斯方法)调度。在模拟数据和实际数据上对这些算法进行了评价和比较。该框架为功能推出提供了三个好处:1)对于有缺陷的功能，它可以及早发现回归并减少负面影响;2)对于健康的功能，它可以快速推出功能;3)通过功能推出过程的自动化减少了人工干预的需要。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2018 3rd International Conference on Computational Intelligence and Applications (ICCIA)

自引率

0.00%

发文量