{"title":"Storage System Trace Characterization, Compression, and Synthesis using Machine Learning – An Extended Abstract","authors":"Pratik Poudel","doi":"10.1145/3573900.3593632","DOIUrl":null,"url":null,"abstract":"This study addresses the knowledge gap in request-level storage trace analysis by incorporating workload characterization, compression, and synthesis. The aim is to better understand workload behavior and provide unique workloads for storage system testing under different scenarios. Machine learning techniques like K-means clustering and PCA analysis are employed to understand trace properties and reduce manual workload selection. By generating synthetic workloads, the proposed method facilitates simulation and modeling-based studies of storage systems, especially for emerging technologies like Storage Class Memory (SCM) with limited workload availability.","PeriodicalId":246048,"journal":{"name":"Proceedings of the 2023 ACM SIGSIM Conference on Principles of Advanced Discrete Simulation","volume":"85 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2023 ACM SIGSIM Conference on Principles of Advanced Discrete Simulation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3573900.3593632","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This study addresses the knowledge gap in request-level storage trace analysis by incorporating workload characterization, compression, and synthesis. The aim is to better understand workload behavior and provide unique workloads for storage system testing under different scenarios. Machine learning techniques like K-means clustering and PCA analysis are employed to understand trace properties and reduce manual workload selection. By generating synthetic workloads, the proposed method facilitates simulation and modeling-based studies of storage systems, especially for emerging technologies like Storage Class Memory (SCM) with limited workload availability.