Host Hypervisor Trace Mining for Virtual Machine Workload Characterization

2019 IEEE International Conference on Cloud Engineering (IC2E) Pub Date : 2019-06-01 DOI:10.1109/IC2E.2019.00024

Hani Nemati, S. V. Azhari, M. Dagenais

{"title":"Host Hypervisor Trace Mining for Virtual Machine Workload Characterization","authors":"Hani Nemati, S. V. Azhari, M. Dagenais","doi":"10.1109/IC2E.2019.00024","DOIUrl":null,"url":null,"abstract":"The efficient operation and resource management of multi-tenant data centers hosting thousands of services is a demanding task, that requires precise and detailed information regarding the behaviour of each and every virtual machine (VM). Often, coarse measures such as CPU, memory, disk and network usage by VMs are considered in grouping them onto the same physical server, as detailed measures would require access to the guest operating system (OS), which is not feasible in a multi-tenant setting. In this paper, we propose host-level hypervisor tracing as a non-intrusive means to extract useful features, that can provide for fine grain characterization of VM behaviour. In particular, we extract VM blocking periods as well as virtual interrupt injection rates to detect multiple levels of resource intensiveness. In addition, we consider the resource contention rate due to other VMs and the host, along with reasons for exit from non-root to root privileged mode, revealing useful information about the nature of the underlying VM workload. We also use tracing to get information about the rate of process and thread preemption in each VM, extracting process and thread contention as another feature set. We then employ various feature selection strategies and assess the quality of the resulting workload clustering. Notably, we adopt a two-stage feature selection approach in addition to a one shot clustering scheme. Moreover, we consider inter-cluster and intra-cluster similarity metrics, such as the silhouette score, to discover distinct groups of workloads as well as workload groups with significant overlap. This information can be used by 1) data center administrators to gain deeper visibility into the nature of various VMs running on their infrastructure, 2) performance engineers to assist root cause analysis of VM issues and 3) IaaS providers to help in resource management based on VM behavior.","PeriodicalId":226094,"journal":{"name":"2019 IEEE International Conference on Cloud Engineering (IC2E)","volume":"76 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE International Conference on Cloud Engineering (IC2E)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IC2E.2019.00024","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 6

Abstract

The efficient operation and resource management of multi-tenant data centers hosting thousands of services is a demanding task, that requires precise and detailed information regarding the behaviour of each and every virtual machine (VM). Often, coarse measures such as CPU, memory, disk and network usage by VMs are considered in grouping them onto the same physical server, as detailed measures would require access to the guest operating system (OS), which is not feasible in a multi-tenant setting. In this paper, we propose host-level hypervisor tracing as a non-intrusive means to extract useful features, that can provide for fine grain characterization of VM behaviour. In particular, we extract VM blocking periods as well as virtual interrupt injection rates to detect multiple levels of resource intensiveness. In addition, we consider the resource contention rate due to other VMs and the host, along with reasons for exit from non-root to root privileged mode, revealing useful information about the nature of the underlying VM workload. We also use tracing to get information about the rate of process and thread preemption in each VM, extracting process and thread contention as another feature set. We then employ various feature selection strategies and assess the quality of the resulting workload clustering. Notably, we adopt a two-stage feature selection approach in addition to a one shot clustering scheme. Moreover, we consider inter-cluster and intra-cluster similarity metrics, such as the silhouette score, to discover distinct groups of workloads as well as workload groups with significant overlap. This information can be used by 1) data center administrators to gain deeper visibility into the nature of various VMs running on their infrastructure, 2) performance engineers to assist root cause analysis of VM issues and 3) IaaS providers to help in resource management based on VM behavior.

查看原文本刊更多论文

主机管理程序跟踪挖掘虚拟机工作负载表征

承载数千个服务的多租户数据中心的高效运行和资源管理是一项要求很高的任务，这需要关于每个虚拟机(VM)行为的精确和详细的信息。通常，在将虚拟机分组到同一物理服务器时，会考虑诸如CPU、内存、磁盘和网络使用等粗略度量，因为详细度量需要访问客户机操作系统(OS)，这在多租户设置中是不可实现的。在本文中，我们提出主机级管理程序跟踪作为一种非侵入性的方法来提取有用的特征，可以提供VM行为的细粒度表征。特别是，我们提取虚拟机阻塞周期以及虚拟中断注入率来检测多个级别的资源密集度。此外，我们还考虑了由于其他VM和主机引起的资源争用率，以及从非根特权模式退出到根特权模式的原因，从而揭示了有关底层VM工作负载性质的有用信息。我们还使用跟踪来获取关于每个VM中进程和线程抢占率的信息，提取进程和线程争用作为另一个特性集。然后，我们采用各种特征选择策略并评估结果工作负载聚类的质量。值得注意的是，除了单次聚类方案外，我们还采用了两阶段特征选择方法。此外，我们考虑集群间和集群内的相似性指标，如轮廓评分，以发现不同的工作负载组以及具有显著重叠的工作负载组。这些信息可以被以下人员使用:1)数据中心管理员更深入地了解在其基础设施上运行的各种VM的性质;2)性能工程师协助VM问题的根本原因分析;3)IaaS提供商帮助基于VM行为进行资源管理。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2019 IEEE International Conference on Cloud Engineering (IC2E)

自引率

0.00%

发文量