面向底层资源分配的多容器MD仿真性能分析

2022 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) Pub Date : 2022-05-01 DOI:10.1109/IPDPSW55747.2022.00162

Shingo Okuno, Akira Hirai, Naoto Fukumoto

{"title":"面向底层资源分配的多容器MD仿真性能分析","authors":"Shingo Okuno, Akira Hirai, Naoto Fukumoto","doi":"10.1109/IPDPSW55747.2022.00162","DOIUrl":null,"url":null,"abstract":"This study discusses scheduling strategies to maximize ensemble throughput, which is the total throughput of multiple containers running simultaneously. Such a strategy is useful, for example, in ensemble runs of molecular dynamics (MD) simulations. To design the strategies, we need to tackle two major challenges: (1) how many containers and how many threads per container we should allocate, and (2) which low-level resources we should allocate to reflect workload characteristics. In particular, the latter challenge is important and inevitable for performance-sensitive applications because they effectively utilize low-level hardware such as simultaneous multi-threading (SMT) to maximize performance, while most container platforms do not handle the challenge. In this paper, as a preliminary experiment to implement scheduling strategies related to SMT, we examined whether ensemble throughput of MD simulations can be improved by deploying containers on separate logical cores even when they share the same physical cores. As a result, we obtained a 2.22-fold ensemble throughput compared with a one-container execution with 10 physical cores.","PeriodicalId":286968,"journal":{"name":"2022 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)","volume":"2 2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Performance Analysis of Multi-Containerized MD Simulations for Low-Level Resource Allocation\",\"authors\":\"Shingo Okuno, Akira Hirai, Naoto Fukumoto\",\"doi\":\"10.1109/IPDPSW55747.2022.00162\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This study discusses scheduling strategies to maximize ensemble throughput, which is the total throughput of multiple containers running simultaneously. Such a strategy is useful, for example, in ensemble runs of molecular dynamics (MD) simulations. To design the strategies, we need to tackle two major challenges: (1) how many containers and how many threads per container we should allocate, and (2) which low-level resources we should allocate to reflect workload characteristics. In particular, the latter challenge is important and inevitable for performance-sensitive applications because they effectively utilize low-level hardware such as simultaneous multi-threading (SMT) to maximize performance, while most container platforms do not handle the challenge. In this paper, as a preliminary experiment to implement scheduling strategies related to SMT, we examined whether ensemble throughput of MD simulations can be improved by deploying containers on separate logical cores even when they share the same physical cores. As a result, we obtained a 2.22-fold ensemble throughput compared with a one-container execution with 10 physical cores.\",\"PeriodicalId\":286968,\"journal\":{\"name\":\"2022 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)\",\"volume\":\"2 2 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IPDPSW55747.2022.00162\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IPDPSW55747.2022.00162","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

本文讨论了最大化集成吞吐量的调度策略，集成吞吐量是指多个容器同时运行时的总吞吐量。这样的策略是有用的，例如，在分子动力学(MD)模拟的集合运行。为了设计策略，我们需要解决两个主要挑战:(1)我们应该分配多少容器和每个容器分配多少线程，以及(2)我们应该分配哪些低级资源来反映工作负载特征。特别是，后一种挑战对于性能敏感的应用程序来说是重要且不可避免的，因为它们有效地利用低级硬件(如同步多线程(SMT))来最大化性能，而大多数容器平台无法处理这种挑战。在本文中，作为实现与SMT相关的调度策略的初步实验，我们检查了是否可以通过在单独的逻辑内核上部署容器来提高MD模拟的集成吞吐量，即使它们共享相同的物理内核。因此，与使用10个物理内核的单容器执行相比，我们获得了2.22倍的集成吞吐量。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Performance Analysis of Multi-Containerized MD Simulations for Low-Level Resource Allocation

This study discusses scheduling strategies to maximize ensemble throughput, which is the total throughput of multiple containers running simultaneously. Such a strategy is useful, for example, in ensemble runs of molecular dynamics (MD) simulations. To design the strategies, we need to tackle two major challenges: (1) how many containers and how many threads per container we should allocate, and (2) which low-level resources we should allocate to reflect workload characteristics. In particular, the latter challenge is important and inevitable for performance-sensitive applications because they effectively utilize low-level hardware such as simultaneous multi-threading (SMT) to maximize performance, while most container platforms do not handle the challenge. In this paper, as a preliminary experiment to implement scheduling strategies related to SMT, we examined whether ensemble throughput of MD simulations can be improved by deploying containers on separate logical cores even when they share the same physical cores. As a result, we obtained a 2.22-fold ensemble throughput compared with a one-container execution with 10 physical cores.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)

自引率

0.00%

发文量