{"title":"Statistical Analysis of Latency Through Semantic Profiling","authors":"Jiamin Huang, Barzan Mozafari, T. Wenisch","doi":"10.1145/3064176.3064179","DOIUrl":null,"url":null,"abstract":"Most software profiling tools quantify average performance and rely on a program's control flow graph to organize and report results. However, in interactive server applications, performance predictability is often an equally important measure. Moreover, the end user is often concerned with the performance of a semantically defined interval of execution, such as a request or transaction, which may not directly map to any single function in the call graph, especially in high-performance applications that use asynchrony or event-based programming. It is difficult to distinguish functionality that lies on the critical path of a semantic interval from other activity (e.g., periodic logging or side operations) that may nevertheless appear prominent in a conventional profile. Existing profilers lack the ability to (i) aggregate results for a semantic interval and (ii) attribute its performance variance to individual functions. We propose a profiler called VProfiler that, given the source code of a software system and programmer annotations indicating the start and end of semantic intervals of interest, is able to identify the dominant sources of latency variance in a semantic context. Using a novel abstraction, called a variance tree, VProfiler analyzes the thread interleaving and deconstructs overall latency variance into variances and covariances of the execution time of individual functions. It then aggregates latency variance along a backwards path of dependence relationships among threads from the end of an interval to its start. We evaluate VProfiler's effectiveness on three popular open-source projects (MySQL, Postgres, and Apache Web Server). By identifying a few culprit functions in these complex code bases, VProfiler allows us to eliminate 27%--82% of the overall latency variance of these systems with a modest programming effort.","PeriodicalId":262089,"journal":{"name":"Proceedings of the Twelfth European Conference on Computer Systems","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"22","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Twelfth European Conference on Computer Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3064176.3064179","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 22
Abstract
Most software profiling tools quantify average performance and rely on a program's control flow graph to organize and report results. However, in interactive server applications, performance predictability is often an equally important measure. Moreover, the end user is often concerned with the performance of a semantically defined interval of execution, such as a request or transaction, which may not directly map to any single function in the call graph, especially in high-performance applications that use asynchrony or event-based programming. It is difficult to distinguish functionality that lies on the critical path of a semantic interval from other activity (e.g., periodic logging or side operations) that may nevertheless appear prominent in a conventional profile. Existing profilers lack the ability to (i) aggregate results for a semantic interval and (ii) attribute its performance variance to individual functions. We propose a profiler called VProfiler that, given the source code of a software system and programmer annotations indicating the start and end of semantic intervals of interest, is able to identify the dominant sources of latency variance in a semantic context. Using a novel abstraction, called a variance tree, VProfiler analyzes the thread interleaving and deconstructs overall latency variance into variances and covariances of the execution time of individual functions. It then aggregates latency variance along a backwards path of dependence relationships among threads from the end of an interval to its start. We evaluate VProfiler's effectiveness on three popular open-source projects (MySQL, Postgres, and Apache Web Server). By identifying a few culprit functions in these complex code bases, VProfiler allows us to eliminate 27%--82% of the overall latency variance of these systems with a modest programming effort.
大多数软件分析工具量化平均性能,并依赖于程序的控制流图来组织和报告结果。然而,在交互式服务器应用程序中,性能可预测性通常是同样重要的度量。此外,终端用户经常关心语义定义的执行间隔的性能,例如请求或事务,它们可能不会直接映射到调用图中的任何单个函数,特别是在使用异步或基于事件编程的高性能应用程序中。很难将位于语义间隔关键路径上的功能与其他活动(例如,周期性日志记录或侧操作)区分开来,这些活动可能在常规剖面中显得突出。现有的分析器缺乏以下能力:(i)对语义间隔的结果进行聚合,(ii)将其性能差异归因于单个函数。我们提出了一个名为VProfiler的分析器,给定软件系统的源代码和指示感兴趣的语义间隔的开始和结束的程序员注释,它能够识别语义上下文中延迟变化的主要来源。使用一种新颖的抽象,称为方差树,VProfiler分析线程交错,并将总体延迟方差分解为单个函数执行时间的方差和协方差。然后,它沿着线程之间从间隔结束到开始的依赖关系的反向路径聚合延迟方差。我们在三个流行的开源项目(MySQL、Postgres和Apache Web Server)上评估了VProfiler的有效性。通过识别这些复杂代码库中的几个罪魁祸首函数,VProfiler允许我们通过适度的编程工作消除这些系统的27%- 82%的总延迟方差。