{"title":"Server-Side Workload Identification for HPC I/O Requests","authors":"Lu Pang, K. Kant","doi":"10.1145/3526063.3535350","DOIUrl":null,"url":null,"abstract":"In this paper, we develop a method to identify High Performance Computing (HPC) workloads from a stream of incoming I/O requests. This characterization of workloads could then be used to intelligently schedule the I/O requests in the parallel file system (PFS) that most HPC systems use. We use a deep learning model for this purpose that is designed to pick up changes in the workload as they occur. We show that our method accurately determines the workload characteristics when evaluated on publicly available server-side HPC traces. We also show that the I/O scheduling based on such a characterization can substantially increase the available I/O bandwidth and thus reduce the latencies for the HPC workloads.","PeriodicalId":244248,"journal":{"name":"Proceedings of the 2nd Workshop on Performance EngineeRing, Modelling, Analysis, and VisualizatiOn Strategy","volume":"59 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2nd Workshop on Performance EngineeRing, Modelling, Analysis, and VisualizatiOn Strategy","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3526063.3535350","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
In this paper, we develop a method to identify High Performance Computing (HPC) workloads from a stream of incoming I/O requests. This characterization of workloads could then be used to intelligently schedule the I/O requests in the parallel file system (PFS) that most HPC systems use. We use a deep learning model for this purpose that is designed to pick up changes in the workload as they occur. We show that our method accurately determines the workload characteristics when evaluated on publicly available server-side HPC traces. We also show that the I/O scheduling based on such a characterization can substantially increase the available I/O bandwidth and thus reduce the latencies for the HPC workloads.