仪器飞行:表征NCAR MSS-III工作负荷

Proceedings Thirteenth IEEE Symposium on Mass Storage Systems. Toward Distributed Storage and Data Management Systems Pub Date : 1994-06-12 DOI:10.1109/MASS.1994.373030

J. L. Sloan

{"title":"仪器飞行:表征NCAR MSS-III工作负荷","authors":"J. L. Sloan","doi":"10.1109/MASS.1994.373030","DOIUrl":null,"url":null,"abstract":"The NCAR Mass Storage System, MSS-III, generates 10 megabytes a day of transaction log, containing information about its workload. Traditional metrics, such as the average amount of data stored and retrieved per hour, are useful but omit information regarding temporality, locality, and burstiness. This information is critical to characterizing and understanding the MSS workload. NCAR has begun to use metrics usually applied to virtual memories, hardware caches, and network traffic to analyze the MSS-III transaction logs. Current MSS-III workload characterization falls into three broad categories: parametric statistics (for example, mean and variance for various file and data metrics), trace-driven analysis (for example, working set size), and trace-driven simulation (for example, compulsory and capacity cache miss ratios). Results from all of these methods are presented. Graphs of MSS-III transactions across a range of time scales show a self-similarity or \"fractal burstiness\", typical of network traffic. This suggests that measurements of self-similarity (for example, the Hurst parameter) may be useful. Also, the lack of normal distribution suggests that application of nonparametric statistics might be fruitful.<<ETX>>","PeriodicalId":436281,"journal":{"name":"Proceedings Thirteenth IEEE Symposium on Mass Storage Systems. Toward Distributed Storage and Data Management Systems","volume":"79 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1994-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Flying with instruments: characterizing the NCAR MSS-III workload\",\"authors\":\"J. L. Sloan\",\"doi\":\"10.1109/MASS.1994.373030\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The NCAR Mass Storage System, MSS-III, generates 10 megabytes a day of transaction log, containing information about its workload. Traditional metrics, such as the average amount of data stored and retrieved per hour, are useful but omit information regarding temporality, locality, and burstiness. This information is critical to characterizing and understanding the MSS workload. NCAR has begun to use metrics usually applied to virtual memories, hardware caches, and network traffic to analyze the MSS-III transaction logs. Current MSS-III workload characterization falls into three broad categories: parametric statistics (for example, mean and variance for various file and data metrics), trace-driven analysis (for example, working set size), and trace-driven simulation (for example, compulsory and capacity cache miss ratios). Results from all of these methods are presented. Graphs of MSS-III transactions across a range of time scales show a self-similarity or \\\"fractal burstiness\\\", typical of network traffic. This suggests that measurements of self-similarity (for example, the Hurst parameter) may be useful. Also, the lack of normal distribution suggests that application of nonparametric statistics might be fruitful.<<ETX>>\",\"PeriodicalId\":436281,\"journal\":{\"name\":\"Proceedings Thirteenth IEEE Symposium on Mass Storage Systems. Toward Distributed Storage and Data Management Systems\",\"volume\":\"79 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1994-06-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings Thirteenth IEEE Symposium on Mass Storage Systems. Toward Distributed Storage and Data Management Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/MASS.1994.373030\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings Thirteenth IEEE Symposium on Mass Storage Systems. Toward Distributed Storage and Data Management Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MASS.1994.373030","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

NCAR大容量存储系统MSS-III每天生成10兆字节的事务日志，其中包含有关其工作负载的信息。传统的度量，比如每小时存储和检索的平均数据量，是有用的，但是忽略了关于时间性、局域性和突发性的信息。这些信息对于描述和理解MSS工作负载至关重要。NCAR已经开始使用通常应用于虚拟内存、硬件缓存和网络流量的指标来分析MSS-III事务日志。当前的MSS-III工作负载表征分为三大类:参数统计(例如，各种文件和数据度量的平均值和方差)、跟踪驱动的分析(例如，工作集大小)和跟踪驱动的模拟(例如，强制和容量缓存缺失率)。给出了所有这些方法的结果。跨时间尺度范围的MSS-III交易图显示了自相似性或“分形突发性”，这是网络流量的典型特征。这表明自相似性的测量(例如，Hurst参数)可能是有用的。此外，缺乏正态分布表明非参数统计的应用可能是富有成效的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Flying with instruments: characterizing the NCAR MSS-III workload

The NCAR Mass Storage System, MSS-III, generates 10 megabytes a day of transaction log, containing information about its workload. Traditional metrics, such as the average amount of data stored and retrieved per hour, are useful but omit information regarding temporality, locality, and burstiness. This information is critical to characterizing and understanding the MSS workload. NCAR has begun to use metrics usually applied to virtual memories, hardware caches, and network traffic to analyze the MSS-III transaction logs. Current MSS-III workload characterization falls into three broad categories: parametric statistics (for example, mean and variance for various file and data metrics), trace-driven analysis (for example, working set size), and trace-driven simulation (for example, compulsory and capacity cache miss ratios). Results from all of these methods are presented. Graphs of MSS-III transactions across a range of time scales show a self-similarity or "fractal burstiness", typical of network traffic. This suggests that measurements of self-similarity (for example, the Hurst parameter) may be useful. Also, the lack of normal distribution suggests that application of nonparametric statistics might be fruitful.<>

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings Thirteenth IEEE Symposium on Mass Storage Systems. Toward Distributed Storage and Data Management Systems

自引率

0.00%

发文量