Zekun Sun, John Panneerselvam, Lu Liu, Yao Lu, Wan Tang
{"title":"大规模数据中心跟踪日志的单元间资源使用分析","authors":"Zekun Sun, John Panneerselvam, Lu Liu, Yao Lu, Wan Tang","doi":"10.1109/UCC56403.2022.00054","DOIUrl":null,"url":null,"abstract":"In recent years, modern cluster management systems are facing large volumes of workloads with complex queries. Smart processing of such workloads is essential, which requires a comprehensive understanding of large-scare data centres and the way workloads are processed in them. Understanding of workload and datacentre characteristics are also expected to contribute to the development of prediction models. The recent publication of the 2019 Google trace logs is expected to provide useful insights to researchers of energy efficient datacentres. However, insights from this trace log still remains largely uncovered. This paper presents a comprehensive analysis on the distribution of machine resource utilisation across various cells encompassed in the 2019 Google cluster trace log, uncovering various essential workload behavioural insights such as execution duration, workload task composition, CPU/memory utilisation etc. along with a longitudinal comparative analysis with two other large datasets, namely the 2018 Alibaba and 2011 Google trace logs.","PeriodicalId":203244,"journal":{"name":"2022 IEEE/ACM 15th International Conference on Utility and Cloud Computing (UCC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An inter-cell resource usage analysis of large-scale datacentre trace logs\",\"authors\":\"Zekun Sun, John Panneerselvam, Lu Liu, Yao Lu, Wan Tang\",\"doi\":\"10.1109/UCC56403.2022.00054\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In recent years, modern cluster management systems are facing large volumes of workloads with complex queries. Smart processing of such workloads is essential, which requires a comprehensive understanding of large-scare data centres and the way workloads are processed in them. Understanding of workload and datacentre characteristics are also expected to contribute to the development of prediction models. The recent publication of the 2019 Google trace logs is expected to provide useful insights to researchers of energy efficient datacentres. However, insights from this trace log still remains largely uncovered. This paper presents a comprehensive analysis on the distribution of machine resource utilisation across various cells encompassed in the 2019 Google cluster trace log, uncovering various essential workload behavioural insights such as execution duration, workload task composition, CPU/memory utilisation etc. along with a longitudinal comparative analysis with two other large datasets, namely the 2018 Alibaba and 2011 Google trace logs.\",\"PeriodicalId\":203244,\"journal\":{\"name\":\"2022 IEEE/ACM 15th International Conference on Utility and Cloud Computing (UCC)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE/ACM 15th International Conference on Utility and Cloud Computing (UCC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/UCC56403.2022.00054\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE/ACM 15th International Conference on Utility and Cloud Computing (UCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/UCC56403.2022.00054","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
An inter-cell resource usage analysis of large-scale datacentre trace logs
In recent years, modern cluster management systems are facing large volumes of workloads with complex queries. Smart processing of such workloads is essential, which requires a comprehensive understanding of large-scare data centres and the way workloads are processed in them. Understanding of workload and datacentre characteristics are also expected to contribute to the development of prediction models. The recent publication of the 2019 Google trace logs is expected to provide useful insights to researchers of energy efficient datacentres. However, insights from this trace log still remains largely uncovered. This paper presents a comprehensive analysis on the distribution of machine resource utilisation across various cells encompassed in the 2019 Google cluster trace log, uncovering various essential workload behavioural insights such as execution duration, workload task composition, CPU/memory utilisation etc. along with a longitudinal comparative analysis with two other large datasets, namely the 2018 Alibaba and 2011 Google trace logs.