Self-monitoring overhead of the Linux perf_ event performance counter interface

2015 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS) Pub Date : 2015-03-29 DOI:10.1109/ISPASS.2015.7095789

Vincent M. Weaver

引用次数: 39

Abstract

Most modern CPUs include hardware performance counters: architectural registers that allow programmers to gain low-level insight into system performance. Low-overhead access to these counters is necessary for accurate performance analysis, making the operating system interface critical to providing low-latency performance data. We investigate the overhead of self-monitoring performance counter measurements on the Linux perf_event interface. We find that default code (such as that used by PAPI) implementing the perf_event self-monitoring interface can have large overhead: up to an order of magnitude larger than the previously used perfctr and perfmon2 performance counter implementations. We investigate the causes of this overhead and find that with proper coding this overhead can be greatly reduced on recent Linux kernels.

查看原文本刊更多论文

Linux perf_事件性能计数器接口的自监视开销

大多数现代cpu包括硬件性能计数器:架构寄存器，允许程序员获得对系统性能的低级洞察。对这些计数器的低开销访问对于准确的性能分析是必要的，这使得操作系统接口对于提供低延迟性能数据至关重要。我们研究了Linux perf_event接口上的自监视性能计数器测量的开销。我们发现实现perf_event自监视接口的默认代码(如PAPI使用的代码)可能会有很大的开销:比以前使用的perfctr和perfmon2性能计数器实现要大一个数量级。我们研究了这种开销的原因，发现在最新的Linux内核上，通过适当的编码可以大大减少这种开销。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2015 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS)

自引率

0.00%

发文量