{"title":"Cloud Storage Workload Characterization: An Approach with Time-Series Analysis","authors":"Abiola Adegboyega","doi":"10.1109/CCNC51664.2024.10454778","DOIUrl":null,"url":null,"abstract":"The cloud hosts diverse applications with different workload characteristics. Public cloud traces provide opportunities for analysis to gain insights informing autoscaling, forecasting among other operations. This paper presents the statistical analysis of a recent Alibaba cloud storage workload. The isolation & aggregation of all read/write time-series per recorded workload was done. Application of statistical methods yielded novel distributions from which forecasting solutions integrating time-varying variance captured workload burstiness. A 25% improvement in forecasting accuracy over current methods was achieved. The set of workload time-series has been made available online for further analysis by the research community.","PeriodicalId":518411,"journal":{"name":"2024 IEEE 21st Consumer Communications & Networking Conference (CCNC)","volume":"43 9","pages":"1090-1091"},"PeriodicalIF":0.0000,"publicationDate":"2024-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2024 IEEE 21st Consumer Communications & Networking Conference (CCNC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCNC51664.2024.10454778","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The cloud hosts diverse applications with different workload characteristics. Public cloud traces provide opportunities for analysis to gain insights informing autoscaling, forecasting among other operations. This paper presents the statistical analysis of a recent Alibaba cloud storage workload. The isolation & aggregation of all read/write time-series per recorded workload was done. Application of statistical methods yielded novel distributions from which forecasting solutions integrating time-varying variance captured workload burstiness. A 25% improvement in forecasting accuracy over current methods was achieved. The set of workload time-series has been made available online for further analysis by the research community.