Response Time Characterization of Microservice-Based Systems

2018 IEEE 17th International Symposium on Network Computing and Applications (NCA) Pub Date : 2018-11-01 DOI:10.1109/NCA.2018.8548062

Jaime Correia, Fabio Ribeiro, R. Filipe, Filipe Araújo, J. Cardoso

{"title":"Response Time Characterization of Microservice-Based Systems","authors":"Jaime Correia, Fabio Ribeiro, R. Filipe, Filipe Araújo, J. Cardoso","doi":"10.1109/NCA.2018.8548062","DOIUrl":null,"url":null,"abstract":"In pursuit of faster development cycles, companies have favored small decoupled services over monoliths. Following this trend, distributed systems made of microservices have grown in scale and complexity, giving rise to a new set of operational problems. Even though this paradigm simplifies development, deployment, management of individual services, it hinders system observability. In particular, performance monitoring and analysis becomes more challenging, especially for critical production systems that have grown organically, operate continuously, and cannot afford the availability cost of online benchmarking. Additionally, these systems are often very large and expensive, thus being bad candidates for full-scale development replicas. Creating models of services and systems for characterization and formal analysis can alleviate the aforementioned issues. Since performance, namely response time, is the main interest of this work, we focused on bottleneck detection and optimal resource scheduling. We propose a method for modeling production services as queuing systems from request traces. Additionally, we provide analytical tools for response time characterization and optimal resource allocation. Our results show that a simple queuing system with a single queue and multiple homogeneous servers has a small parameter space that can be estimated in production. The resulting model can be used to accurately predict response time distribution and the necessary number of instances to maintain a desired service level, under a given load.","PeriodicalId":268662,"journal":{"name":"2018 IEEE 17th International Symposium on Network Computing and Applications (NCA)","volume":"52 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE 17th International Symposium on Network Computing and Applications (NCA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NCA.2018.8548062","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 9

Abstract

In pursuit of faster development cycles, companies have favored small decoupled services over monoliths. Following this trend, distributed systems made of microservices have grown in scale and complexity, giving rise to a new set of operational problems. Even though this paradigm simplifies development, deployment, management of individual services, it hinders system observability. In particular, performance monitoring and analysis becomes more challenging, especially for critical production systems that have grown organically, operate continuously, and cannot afford the availability cost of online benchmarking. Additionally, these systems are often very large and expensive, thus being bad candidates for full-scale development replicas. Creating models of services and systems for characterization and formal analysis can alleviate the aforementioned issues. Since performance, namely response time, is the main interest of this work, we focused on bottleneck detection and optimal resource scheduling. We propose a method for modeling production services as queuing systems from request traces. Additionally, we provide analytical tools for response time characterization and optimal resource allocation. Our results show that a simple queuing system with a single queue and multiple homogeneous servers has a small parameter space that can be estimated in production. The resulting model can be used to accurately predict response time distribution and the necessary number of instances to maintain a desired service level, under a given load.

查看原文本刊更多论文

基于微服务的系统响应时间表征

为了追求更快的开发周期，公司更喜欢小型的解耦服务，而不是大型服务。遵循这一趋势，由微服务组成的分布式系统在规模和复杂性上都有所增长，从而产生了一系列新的操作问题。尽管这种范例简化了单个服务的开发、部署和管理，但它阻碍了系统的可观察性。特别是，性能监视和分析变得更具挑战性，特别是对于那些有机增长、持续运行且无法负担在线基准测试的可用性成本的关键生产系统。此外，这些系统通常非常大且昂贵，因此不适合全面开发副本。为特性描述和形式化分析创建服务和系统模型可以缓解上述问题。由于性能(即响应时间)是本工作的主要关注点，因此我们将重点放在瓶颈检测和最优资源调度上。我们提出了一种将生产服务建模为基于请求跟踪的排队系统的方法。此外，我们还提供响应时间表征和最佳资源分配的分析工具。我们的结果表明，具有单个队列和多个同构服务器的简单排队系统具有可以在生产中估计的小参数空间。得到的模型可用于准确预测响应时间分布，以及在给定负载下维持所需服务水平所需的实例数量。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2018 IEEE 17th International Symposium on Network Computing and Applications (NCA)

自引率

0.00%

发文量