{"title":"面向底层资源分配的多容器MD仿真性能分析","authors":"Shingo Okuno, Akira Hirai, Naoto Fukumoto","doi":"10.1109/IPDPSW55747.2022.00162","DOIUrl":null,"url":null,"abstract":"This study discusses scheduling strategies to maximize ensemble throughput, which is the total throughput of multiple containers running simultaneously. Such a strategy is useful, for example, in ensemble runs of molecular dynamics (MD) simulations. To design the strategies, we need to tackle two major challenges: (1) how many containers and how many threads per container we should allocate, and (2) which low-level resources we should allocate to reflect workload characteristics. In particular, the latter challenge is important and inevitable for performance-sensitive applications because they effectively utilize low-level hardware such as simultaneous multi-threading (SMT) to maximize performance, while most container platforms do not handle the challenge. In this paper, as a preliminary experiment to implement scheduling strategies related to SMT, we examined whether ensemble throughput of MD simulations can be improved by deploying containers on separate logical cores even when they share the same physical cores. As a result, we obtained a 2.22-fold ensemble throughput compared with a one-container execution with 10 physical cores.","PeriodicalId":286968,"journal":{"name":"2022 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)","volume":"2 2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Performance Analysis of Multi-Containerized MD Simulations for Low-Level Resource Allocation\",\"authors\":\"Shingo Okuno, Akira Hirai, Naoto Fukumoto\",\"doi\":\"10.1109/IPDPSW55747.2022.00162\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This study discusses scheduling strategies to maximize ensemble throughput, which is the total throughput of multiple containers running simultaneously. Such a strategy is useful, for example, in ensemble runs of molecular dynamics (MD) simulations. To design the strategies, we need to tackle two major challenges: (1) how many containers and how many threads per container we should allocate, and (2) which low-level resources we should allocate to reflect workload characteristics. In particular, the latter challenge is important and inevitable for performance-sensitive applications because they effectively utilize low-level hardware such as simultaneous multi-threading (SMT) to maximize performance, while most container platforms do not handle the challenge. In this paper, as a preliminary experiment to implement scheduling strategies related to SMT, we examined whether ensemble throughput of MD simulations can be improved by deploying containers on separate logical cores even when they share the same physical cores. As a result, we obtained a 2.22-fold ensemble throughput compared with a one-container execution with 10 physical cores.\",\"PeriodicalId\":286968,\"journal\":{\"name\":\"2022 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)\",\"volume\":\"2 2 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IPDPSW55747.2022.00162\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IPDPSW55747.2022.00162","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Performance Analysis of Multi-Containerized MD Simulations for Low-Level Resource Allocation
This study discusses scheduling strategies to maximize ensemble throughput, which is the total throughput of multiple containers running simultaneously. Such a strategy is useful, for example, in ensemble runs of molecular dynamics (MD) simulations. To design the strategies, we need to tackle two major challenges: (1) how many containers and how many threads per container we should allocate, and (2) which low-level resources we should allocate to reflect workload characteristics. In particular, the latter challenge is important and inevitable for performance-sensitive applications because they effectively utilize low-level hardware such as simultaneous multi-threading (SMT) to maximize performance, while most container platforms do not handle the challenge. In this paper, as a preliminary experiment to implement scheduling strategies related to SMT, we examined whether ensemble throughput of MD simulations can be improved by deploying containers on separate logical cores even when they share the same physical cores. As a result, we obtained a 2.22-fold ensemble throughput compared with a one-container execution with 10 physical cores.