基于CART模型的存储设备性能预测

Performance Pub Date : 2004-06-01 DOI:10.1145/1005686.1005743

Mengzhi Wang, Kinman Au, A. Ailamaki, A. Brockwell, C. Faloutsos, G. Ganger

{"title":"基于CART模型的存储设备性能预测","authors":"Mengzhi Wang, Kinman Au, A. Ailamaki, A. Brockwell, C. Faloutsos, G. Ganger","doi":"10.1145/1005686.1005743","DOIUrl":null,"url":null,"abstract":"Storage device performance prediction is a key element of self-managed storage systems. The paper explores the application of a machine learning tool, CART (classification and regression trees) models, to storage device modeling. Our approach predicts a device's performance as a function of input workloads, requiring no knowledge of the device internals. We propose two uses of CART models: one that predicts per-request response times (and then derives aggregate values); one that predicts aggregate values directly from workload characteristics. After being trained on the device in question, both provide accurate black-box models across a range of test traces from real environments. Experiments show that these models predict the average and 90th percentile response time with a relative error as low as 19%, when the training workloads are similar to the testing workloads, and interpolate well across different workloads.","PeriodicalId":32394,"journal":{"name":"Performance","volume":"33 1","pages":"588-595"},"PeriodicalIF":0.0000,"publicationDate":"2004-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"175","resultStr":"{\"title\":\"Storage device performance prediction with CART models\",\"authors\":\"Mengzhi Wang, Kinman Au, A. Ailamaki, A. Brockwell, C. Faloutsos, G. Ganger\",\"doi\":\"10.1145/1005686.1005743\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Storage device performance prediction is a key element of self-managed storage systems. The paper explores the application of a machine learning tool, CART (classification and regression trees) models, to storage device modeling. Our approach predicts a device's performance as a function of input workloads, requiring no knowledge of the device internals. We propose two uses of CART models: one that predicts per-request response times (and then derives aggregate values); one that predicts aggregate values directly from workload characteristics. After being trained on the device in question, both provide accurate black-box models across a range of test traces from real environments. Experiments show that these models predict the average and 90th percentile response time with a relative error as low as 19%, when the training workloads are similar to the testing workloads, and interpolate well across different workloads.\",\"PeriodicalId\":32394,\"journal\":{\"name\":\"Performance\",\"volume\":\"33 1\",\"pages\":\"588-595\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2004-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"175\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Performance\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/1005686.1005743\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Performance","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/1005686.1005743","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 175

摘要

存储设备性能预测是实现自管理存储系统的关键。本文探讨了机器学习工具CART(分类和回归树)模型在存储设备建模中的应用。我们的方法将设备的性能预测为输入工作负载的函数，不需要了解设备内部的知识。我们提出了CART模型的两种用法:一种是预测每个请求的响应时间(然后得出汇总值);直接从工作负载特征预测汇总值。在接受有关设备的培训后，两者都可以在真实环境的一系列测试痕迹中提供准确的黑匣子模型。实验表明，当训练工作负载与测试工作负载相似时，这些模型预测平均和第90百分位响应时间的相对误差低至19%，并且在不同工作负载之间插值良好。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Storage device performance prediction with CART models

Storage device performance prediction is a key element of self-managed storage systems. The paper explores the application of a machine learning tool, CART (classification and regression trees) models, to storage device modeling. Our approach predicts a device's performance as a function of input workloads, requiring no knowledge of the device internals. We propose two uses of CART models: one that predicts per-request response times (and then derives aggregate values); one that predicts aggregate values directly from workload characteristics. After being trained on the device in question, both provide accurate black-box models across a range of test traces from real environments. Experiments show that these models predict the average and 90th percentile response time with a relative error as low as 19%, when the training workloads are similar to the testing workloads, and interpolate well across different workloads.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Performance

自引率

0.00%

发文量