Automated Analysis of Load Test Results of Systems with Equilibrium or Transient Behavior: Invited Talk

Companion Publication for ACM/SPEC on International Conference on Performance Engineering Pub Date : 2016-03-12 DOI:10.1145/2859889.2880447

A. Bondi

{"title":"Automated Analysis of Load Test Results of Systems with Equilibrium or Transient Behavior: Invited Talk","authors":"A. Bondi","doi":"10.1145/2859889.2880447","DOIUrl":null,"url":null,"abstract":"Performance test data should be analyzed to determine if performance requirements are being met, to see if they reveal opportunities for performance improvement, and to see if they show signs of lurking performance issues or malfunctions. Automated analyses of the measurements can be useful when the number of resource usage measures and performance measures is large, when the number of nodes under test is large, or when the number of test cases is large. We shall examine this for cases in which the system under test is subjected to a constant load, as might be the case for an online transaction processing system, and for the case where the load is inherently bursty, as would be the case for an alarm or monitoring system that is receiving streams of notifications from many sensors at once. We expect a well behaved system under constant load to reach steady state shortly after the test load begins, and to ramp down to its previous state once the load is withdrawn. This corresponds to a system reaching equilibrium in stochastic terms. We also expect a well behaved system to return to steady state after a surge of traffic abates. Failure to achieve equilibrium under constant load is a sign of a problem that should be investigated. Automated analysis of voluminous test data facilitates the identification of intervals of steady operation and cases when steady operation has not occurred during performance tests. An embedded control or monitoring system, such as a building security system, might be subject to a sustained burst of message traffic in an emergency situation. Fire alarm systems might have to respond to at least one of these messages within seconds of the onset of the burst by triggering bells and sirens, closing doors, and automatically alerting emergency services. They might consist of only a few hosts. An automated tool could use statistical methods to identify phases of execution during an emergency by noting when each thread's processor consumption changes. This facilitates the identification of areas for performance improvement, especially if the system is implemented with so many processes or threads that visual identification of heavy consumers of processing power is difficult. We shall elaborate on these issues in this talk, and also discuss methods for performing automated analyses of load tests of systems whose loads are expected to be steady and of systems whose loads are expected to be intense for a transient period.","PeriodicalId":265808,"journal":{"name":"Companion Publication for ACM/SPEC on International Conference on Performance Engineering","volume":"147 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Companion Publication for ACM/SPEC on International Conference on Performance Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2859889.2880447","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Performance test data should be analyzed to determine if performance requirements are being met, to see if they reveal opportunities for performance improvement, and to see if they show signs of lurking performance issues or malfunctions. Automated analyses of the measurements can be useful when the number of resource usage measures and performance measures is large, when the number of nodes under test is large, or when the number of test cases is large. We shall examine this for cases in which the system under test is subjected to a constant load, as might be the case for an online transaction processing system, and for the case where the load is inherently bursty, as would be the case for an alarm or monitoring system that is receiving streams of notifications from many sensors at once. We expect a well behaved system under constant load to reach steady state shortly after the test load begins, and to ramp down to its previous state once the load is withdrawn. This corresponds to a system reaching equilibrium in stochastic terms. We also expect a well behaved system to return to steady state after a surge of traffic abates. Failure to achieve equilibrium under constant load is a sign of a problem that should be investigated. Automated analysis of voluminous test data facilitates the identification of intervals of steady operation and cases when steady operation has not occurred during performance tests. An embedded control or monitoring system, such as a building security system, might be subject to a sustained burst of message traffic in an emergency situation. Fire alarm systems might have to respond to at least one of these messages within seconds of the onset of the burst by triggering bells and sirens, closing doors, and automatically alerting emergency services. They might consist of only a few hosts. An automated tool could use statistical methods to identify phases of execution during an emergency by noting when each thread's processor consumption changes. This facilitates the identification of areas for performance improvement, especially if the system is implemented with so many processes or threads that visual identification of heavy consumers of processing power is difficult. We shall elaborate on these issues in this talk, and also discuss methods for performing automated analyses of load tests of systems whose loads are expected to be steady and of systems whose loads are expected to be intense for a transient period.

查看原文本刊更多论文

具有平衡或暂态行为的系统负载测试结果的自动分析:特邀演讲

应该分析性能测试数据，以确定是否满足性能需求，查看它们是否揭示了性能改进的机会，并查看它们是否显示了潜在性能问题或故障的迹象。当资源使用度量和性能度量的数量很大时，当被测节点的数量很大时，或者当测试用例的数量很大时，度量的自动化分析是有用的。我们将在被测系统承受恒定负载的情况下检查这一点，例如在线事务处理系统的情况，以及负载本身是突发的情况，例如同时接收来自许多传感器的通知流的报警或监控系统的情况。我们期望在恒定负载下性能良好的系统在测试负载开始后不久达到稳定状态，并且一旦负载被撤回就会下降到以前的状态。这对应于一个系统在随机条件下达到平衡。我们也期望一个良好的系统在流量激增后恢复到稳定状态。在恒定负荷下不能达到平衡是一个问题的标志，应该进行调查。大量测试数据的自动分析有助于识别稳定运行的间隔和在性能测试期间未发生稳定运行的情况。嵌入式控制或监控系统，例如建筑物安全系统，在紧急情况下可能会受到持续突发信息流量的影响。火灾报警系统可能必须在爆炸开始的几秒钟内对这些信息中的至少一个作出反应，触发警铃和警报器，关闭门，并自动通知紧急服务。它们可能只由几个宿主组成。自动化工具可以使用统计方法，通过记录每个线程的处理器消耗发生变化的时间，来识别紧急情况下的执行阶段。这有助于识别性能改进的领域，特别是如果系统是由如此多的进程或线程实现的，以至于很难直观地识别处理能力的大量消耗者。我们将在本次演讲中详细阐述这些问题，并讨论对预期负载稳定的系统和预期负载在短暂时期内强度较大的系统的负载测试进行自动化分析的方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Companion Publication for ACM/SPEC on International Conference on Performance Engineering

自引率

0.00%

发文量