集群性能及其对分布式异构网格性能的影响

Proceedings 9th Heterogeneous Computing Workshop (HCW 2000) (Cat. No.PR00556) Pub Date : 2000-05-01 DOI:10.1109/HCW.2000.843749

Craig A. Lee, C. DeMatteis, J. Stepanek, John Wang

{"title":"集群性能及其对分布式异构网格性能的影响","authors":"Craig A. Lee, C. DeMatteis, J. Stepanek, John Wang","doi":"10.1109/HCW.2000.843749","DOIUrl":null,"url":null,"abstract":"Examines the issues surrounding efficient execution in heterogeneous grid environments. The performances of a Linux cluster and a parallel supercomputer are initially compared using both benchmarks and an application. With an understanding of how benchmark and application performance is affected by processor and interconnect speed, a comparison is made with the bandwidth and latencies available in a tested grid. Of significant concern is the fact that the available communication bandwidth and latencies have a dynamic range of 3 to 4 orders of magnitude, while processor speeds have a range of about one-half order of magnitude. Also, while both processor speed and network bandwidth are increasing very rapidly, simple propagation delay will become more significant in the network latencies seen by many grid applications. That is to say, the pipes in a grid will be getting fatter but not commensurately shorter. How are we to effectively utilize such an infrastructure? Clearly, an attractive approach is to require sufficient concurrency in the application such that a coarse-grain, data-driven model of execution can be used to hide latencies while hopefully keeping context-switching overheads low. If the \"spatial component\" of an application is understood, then runtime systems could also apply established techniques like caching, compression, estimation and speculative pre-fetching. Ideally, this low-level performance management should be encapsulated in an easy-to-use abstraction.","PeriodicalId":351836,"journal":{"name":"Proceedings 9th Heterogeneous Computing Workshop (HCW 2000) (Cat. No.PR00556)","volume":"242 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2000-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":"{\"title\":\"Cluster performance and the implications for distributed, heterogeneous grid performance\",\"authors\":\"Craig A. Lee, C. DeMatteis, J. Stepanek, John Wang\",\"doi\":\"10.1109/HCW.2000.843749\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Examines the issues surrounding efficient execution in heterogeneous grid environments. The performances of a Linux cluster and a parallel supercomputer are initially compared using both benchmarks and an application. With an understanding of how benchmark and application performance is affected by processor and interconnect speed, a comparison is made with the bandwidth and latencies available in a tested grid. Of significant concern is the fact that the available communication bandwidth and latencies have a dynamic range of 3 to 4 orders of magnitude, while processor speeds have a range of about one-half order of magnitude. Also, while both processor speed and network bandwidth are increasing very rapidly, simple propagation delay will become more significant in the network latencies seen by many grid applications. That is to say, the pipes in a grid will be getting fatter but not commensurately shorter. How are we to effectively utilize such an infrastructure? Clearly, an attractive approach is to require sufficient concurrency in the application such that a coarse-grain, data-driven model of execution can be used to hide latencies while hopefully keeping context-switching overheads low. If the \\\"spatial component\\\" of an application is understood, then runtime systems could also apply established techniques like caching, compression, estimation and speculative pre-fetching. Ideally, this low-level performance management should be encapsulated in an easy-to-use abstraction.\",\"PeriodicalId\":351836,\"journal\":{\"name\":\"Proceedings 9th Heterogeneous Computing Workshop (HCW 2000) (Cat. No.PR00556)\",\"volume\":\"242 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2000-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings 9th Heterogeneous Computing Workshop (HCW 2000) (Cat. No.PR00556)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/HCW.2000.843749\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings 9th Heterogeneous Computing Workshop (HCW 2000) (Cat. No.PR00556)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HCW.2000.843749","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 7

摘要

研究围绕异构网格环境中高效执行的问题。首先使用基准测试和应用程序对Linux集群和并行超级计算机的性能进行比较。了解了处理器和互连速度如何影响基准测试和应用程序性能后，我们将与测试网格中的可用带宽和延迟进行比较。值得关注的事实是，可用的通信带宽和延迟具有3到4个数量级的动态范围，而处理器速度的范围约为一个半数量级。此外，虽然处理器速度和网络带宽都在快速增长，但在许多网格应用程序所看到的网络延迟中，简单的传播延迟将变得更加重要。也就是说，网格中的管道将变得越来越粗，但不会相应地变短。我们如何有效地利用这样的基础设施?显然，一种有吸引力的方法是在应用程序中要求足够的并发性，这样可以使用粗粒度、数据驱动的执行模型来隐藏延迟，同时希望保持较低的上下文切换开销。如果理解了应用程序的“空间组件”，那么运行时系统也可以应用现有的技术，如缓存、压缩、估计和推测预取。理想情况下，这种低级性能管理应该封装在易于使用的抽象中。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Cluster performance and the implications for distributed, heterogeneous grid performance

Examines the issues surrounding efficient execution in heterogeneous grid environments. The performances of a Linux cluster and a parallel supercomputer are initially compared using both benchmarks and an application. With an understanding of how benchmark and application performance is affected by processor and interconnect speed, a comparison is made with the bandwidth and latencies available in a tested grid. Of significant concern is the fact that the available communication bandwidth and latencies have a dynamic range of 3 to 4 orders of magnitude, while processor speeds have a range of about one-half order of magnitude. Also, while both processor speed and network bandwidth are increasing very rapidly, simple propagation delay will become more significant in the network latencies seen by many grid applications. That is to say, the pipes in a grid will be getting fatter but not commensurately shorter. How are we to effectively utilize such an infrastructure? Clearly, an attractive approach is to require sufficient concurrency in the application such that a coarse-grain, data-driven model of execution can be used to hide latencies while hopefully keeping context-switching overheads low. If the "spatial component" of an application is understood, then runtime systems could also apply established techniques like caching, compression, estimation and speculative pre-fetching. Ideally, this low-level performance management should be encapsulated in an easy-to-use abstraction.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings 9th Heterogeneous Computing Workshop (HCW 2000) (Cat. No.PR00556)

自引率

0.00%

发文量