使用科学和工程应用程序对“Nehalem”集群进行早期性能评估

S. Saini, Andrey Naraikin, R. Biswas, D. Barkai, T. Sandstrom
{"title":"使用科学和工程应用程序对“Nehalem”集群进行早期性能评估","authors":"S. Saini, Andrey Naraikin, R. Biswas, D. Barkai, T. Sandstrom","doi":"10.1145/1654059.1654084","DOIUrl":null,"url":null,"abstract":"In this paper, we present an early performance evaluation of a 624-core cluster based on the Intel<sup>®</sup> Xeon<sup>®</sup> Processor 5560 (code named \"Nehalem-EP\", and referred to as Xeon 5560 in this paper)---the third-generation quad-core architecture from Intel. This is the first processor from Intel with a non-uniform memory access (NUMA) architecture managed by on-chip integrated memory controller. It employs a point-to-point interconnect called the Intel<sup>®</sup> QuickPath Interconnect (QPI) between processors and to the input/output (I/O) hub. It also introduces to a quad-core architecture both Intel's hyper-threading technology (or simultaneous multi-threading, \"SMT\") and Intel<sup>®</sup> Turbo Boost Technology (\"Turbo mode\") that automatically allow processor cores to run faster than the base operating frequency if the processor is operating below rated power, temperature, and current specification limits. It can be engaged with any number of cores or logical processors enabled and active. We critically evaluate these features using the High Performance Computing Challenge (HPCC) benchmarks, NAS Parallel Benchmarks (NPB), and four full-scale scientific applications. We compare and contrast the results of a cluster based on the Xeon 5560 with an SGI<sup>®</sup> Altix<sup>®</sup> ICE 8200EX cluster of quad-core Intel<sup>®</sup> Xeon<sup>®</sup> 5472 Processor (\"Xeon 5472\" from here on) and another cluster of Intel<sup>®</sup> Xeon<sup>®</sup> 5462 Processor (\"Xeon 5462\"; the Xeon 5400 Series Processors are previous generation quad-core Intel processors and were code named Harpertown).","PeriodicalId":371415,"journal":{"name":"Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"20","resultStr":"{\"title\":\"Early performance evaluation of a \\\"Nehalem\\\" cluster using scientific and engineering applications\",\"authors\":\"S. Saini, Andrey Naraikin, R. Biswas, D. Barkai, T. Sandstrom\",\"doi\":\"10.1145/1654059.1654084\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we present an early performance evaluation of a 624-core cluster based on the Intel<sup>®</sup> Xeon<sup>®</sup> Processor 5560 (code named \\\"Nehalem-EP\\\", and referred to as Xeon 5560 in this paper)---the third-generation quad-core architecture from Intel. This is the first processor from Intel with a non-uniform memory access (NUMA) architecture managed by on-chip integrated memory controller. It employs a point-to-point interconnect called the Intel<sup>®</sup> QuickPath Interconnect (QPI) between processors and to the input/output (I/O) hub. It also introduces to a quad-core architecture both Intel's hyper-threading technology (or simultaneous multi-threading, \\\"SMT\\\") and Intel<sup>®</sup> Turbo Boost Technology (\\\"Turbo mode\\\") that automatically allow processor cores to run faster than the base operating frequency if the processor is operating below rated power, temperature, and current specification limits. It can be engaged with any number of cores or logical processors enabled and active. We critically evaluate these features using the High Performance Computing Challenge (HPCC) benchmarks, NAS Parallel Benchmarks (NPB), and four full-scale scientific applications. We compare and contrast the results of a cluster based on the Xeon 5560 with an SGI<sup>®</sup> Altix<sup>®</sup> ICE 8200EX cluster of quad-core Intel<sup>®</sup> Xeon<sup>®</sup> 5472 Processor (\\\"Xeon 5472\\\" from here on) and another cluster of Intel<sup>®</sup> Xeon<sup>®</sup> 5462 Processor (\\\"Xeon 5462\\\"; the Xeon 5400 Series Processors are previous generation quad-core Intel processors and were code named Harpertown).\",\"PeriodicalId\":371415,\"journal\":{\"name\":\"Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2009-11-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"20\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/1654059.1654084\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/1654059.1654084","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 20

摘要

在本文中,我们提出了基于英特尔®至强®处理器5560(代号为“Nehalem-EP”,在本文中称为至强5560)的624核集群的早期性能评估-英特尔的第三代四核架构。这是英特尔第一个采用非统一内存访问(NUMA)架构的处理器,该架构由片上集成内存控制器管理。它在处理器和输入/输出(I/O)集线器之间采用点对点互连,称为英特尔®快速路径互连(QPI)。它还为四核架构引入了英特尔的超线程技术(或同时多线程,“SMT”)和英特尔®Turbo Boost技术(“Turbo模式”),如果处理器运行在低于额定功率,温度和当前规格限制的情况下,自动允许处理器内核运行速度超过基本工作频率。它可以与任意数量的内核或逻辑处理器一起使用。我们使用高性能计算挑战(HPCC)基准、NAS并行基准(NPB)和四个全面的科学应用程序对这些特性进行了批判性评估。我们比较和对比了基于Xeon 5560的集群与SGI®Altix®ICE 8200EX四核Intel®Xeon®5472处理器集群(“Xeon 5472”从这里开始)和另一个Intel®Xeon®5462处理器集群(“Xeon 5462”;至强5400系列处理器是上一代四核英特尔处理器,代号为Harpertown)。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Early performance evaluation of a "Nehalem" cluster using scientific and engineering applications
In this paper, we present an early performance evaluation of a 624-core cluster based on the Intel® Xeon® Processor 5560 (code named "Nehalem-EP", and referred to as Xeon 5560 in this paper)---the third-generation quad-core architecture from Intel. This is the first processor from Intel with a non-uniform memory access (NUMA) architecture managed by on-chip integrated memory controller. It employs a point-to-point interconnect called the Intel® QuickPath Interconnect (QPI) between processors and to the input/output (I/O) hub. It also introduces to a quad-core architecture both Intel's hyper-threading technology (or simultaneous multi-threading, "SMT") and Intel® Turbo Boost Technology ("Turbo mode") that automatically allow processor cores to run faster than the base operating frequency if the processor is operating below rated power, temperature, and current specification limits. It can be engaged with any number of cores or logical processors enabled and active. We critically evaluate these features using the High Performance Computing Challenge (HPCC) benchmarks, NAS Parallel Benchmarks (NPB), and four full-scale scientific applications. We compare and contrast the results of a cluster based on the Xeon 5560 with an SGI® Altix® ICE 8200EX cluster of quad-core Intel® Xeon® 5472 Processor ("Xeon 5472" from here on) and another cluster of Intel® Xeon® 5462 Processor ("Xeon 5462"; the Xeon 5400 Series Processors are previous generation quad-core Intel processors and were code named Harpertown).
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信