{"title":"存储系统问题","authors":"J. Shiers","doi":"10.1109/MASS.1994.373029","DOIUrl":null,"url":null,"abstract":"The NCAR Mass Storage System, MSS-III, generates I O megabytes a day of transaction log containing information about its workload. Traditional metrics, such as the average amount of data stored and retrieved per hour, are useful but omit information regarding temporality, locality, and burstiness. This infomation is critical to characterizing and understanding the MSS workload. NCAR has begun to use metrics usually applied to virtual memories, hardware caches, and network trafic to analyze the MSS-III transaction logs. Current MSS-III workload characterization falls into three broad categories: parametric statistics (for example, mean and variance for various file and data metrics), trace-driven analysis (for example, working set size), and trace-driven simulation (for example, compulsory and capacity cache miss ratios). Resultsfrom all of these methods are presented. Graphs of MSS-III transactions across a range of time scales show a self-similarity or ‘Ifractal burstiness” typical of network trafic. This suggests that measurements of sevsimilarity (for example, the Hurst parameter) may be useful. Also, the lack of normal distribution suggests that application of nonparametric statistics might be fncitful.","PeriodicalId":436281,"journal":{"name":"Proceedings Thirteenth IEEE Symposium on Mass Storage Systems. Toward Distributed Storage and Data Management Systems","volume":"95 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1994-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Storage System Issues\",\"authors\":\"J. Shiers\",\"doi\":\"10.1109/MASS.1994.373029\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The NCAR Mass Storage System, MSS-III, generates I O megabytes a day of transaction log containing information about its workload. Traditional metrics, such as the average amount of data stored and retrieved per hour, are useful but omit information regarding temporality, locality, and burstiness. This infomation is critical to characterizing and understanding the MSS workload. NCAR has begun to use metrics usually applied to virtual memories, hardware caches, and network trafic to analyze the MSS-III transaction logs. Current MSS-III workload characterization falls into three broad categories: parametric statistics (for example, mean and variance for various file and data metrics), trace-driven analysis (for example, working set size), and trace-driven simulation (for example, compulsory and capacity cache miss ratios). Resultsfrom all of these methods are presented. Graphs of MSS-III transactions across a range of time scales show a self-similarity or ‘Ifractal burstiness” typical of network trafic. This suggests that measurements of sevsimilarity (for example, the Hurst parameter) may be useful. Also, the lack of normal distribution suggests that application of nonparametric statistics might be fncitful.\",\"PeriodicalId\":436281,\"journal\":{\"name\":\"Proceedings Thirteenth IEEE Symposium on Mass Storage Systems. Toward Distributed Storage and Data Management Systems\",\"volume\":\"95 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1994-06-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings Thirteenth IEEE Symposium on Mass Storage Systems. Toward Distributed Storage and Data Management Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/MASS.1994.373029\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings Thirteenth IEEE Symposium on Mass Storage Systems. Toward Distributed Storage and Data Management Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MASS.1994.373029","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
The NCAR Mass Storage System, MSS-III, generates I O megabytes a day of transaction log containing information about its workload. Traditional metrics, such as the average amount of data stored and retrieved per hour, are useful but omit information regarding temporality, locality, and burstiness. This infomation is critical to characterizing and understanding the MSS workload. NCAR has begun to use metrics usually applied to virtual memories, hardware caches, and network trafic to analyze the MSS-III transaction logs. Current MSS-III workload characterization falls into three broad categories: parametric statistics (for example, mean and variance for various file and data metrics), trace-driven analysis (for example, working set size), and trace-driven simulation (for example, compulsory and capacity cache miss ratios). Resultsfrom all of these methods are presented. Graphs of MSS-III transactions across a range of time scales show a self-similarity or ‘Ifractal burstiness” typical of network trafic. This suggests that measurements of sevsimilarity (for example, the Hurst parameter) may be useful. Also, the lack of normal distribution suggests that application of nonparametric statistics might be fncitful.