2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing最新文献

An Adaptive Data Prefetcher for High-Performance Processors 高性能处理器的自适应数据预取器

2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing Pub Date : 2010-05-17 DOI: 10.1109/CCGRID.2010.61

Yong Chen, Huaiyu Zhu, Xian-He Sun

{"title":"An Adaptive Data Prefetcher for High-Performance Processors","authors":"Yong Chen, Huaiyu Zhu, Xian-He Sun","doi":"10.1109/CCGRID.2010.61","DOIUrl":"https://doi.org/10.1109/CCGRID.2010.61","url":null,"abstract":"While computing speed continues increasing rapidly, data-access technology is lagging behind. Data-access delay, not the processor speed, becomes the leading performance bottleneck of high-end/high-performance computing. Prefetching is an effective solution to masking the gap between computing speed and data-access speed. Existing works of prefetching, however, are very conservative in general, due to the computing power consumption concern of the past. They suffer in effectiveness especially when applications' access pattern changes. In this study, we propose an Algorithm-level Feedback-controlled Adaptive (AFA) data prefetcher to address these issues. The AFA prefetcher is based on the Data-Access History Cache, a hardware structure that is specifically designed for data prefetching. It provides an algorithm-level adaptation and is capable of dynamically adapting to appropriate prefetching algorithms at runtime. We have conducted extensive simulation testing with Simple Scalar simulator to validate the design and to illustrate the performance gain. The simulation results show that AFA prefetcher is effective and achieves considerable IPC (Instructions Per Cycle) improvement in average.","PeriodicalId":444485,"journal":{"name":"2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing","volume":"47 8","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-05-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120851160","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 15

A Realistic Integrated Model of Parallel System Workloads 并行系统负载的现实集成模型

2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing Pub Date : 2010-05-17 DOI: 10.1109/CCGRID.2010.32

T. Minh, L. Wolters, D. Epema

引用次数: 33

Designing Accelerator-Based Distributed Systems for High Performance 基于加速器的高性能分布式系统设计

2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing Pub Date : 2010-05-17 DOI: 10.1109/CCGRID.2010.109

M. M. Rafique, A. Butt, Dimitrios S. Nikolopoulos

{"title":"Designing Accelerator-Based Distributed Systems for High Performance","authors":"M. M. Rafique, A. Butt, Dimitrios S. Nikolopoulos","doi":"10.1109/CCGRID.2010.109","DOIUrl":"https://doi.org/10.1109/CCGRID.2010.109","url":null,"abstract":"Multi-core processors with accelerators are becoming commodity components for high-performance computing at scale. While accelerator-based processors have been studied in some detail, the design and management of clusters based on these processors have not received the same focus. In this paper, we present an exploration of four design and resource management alternatives, which can be used on large-scale asymmetric clusters with accelerators. Moreover, we adapt the popular MapReduce programming model to our proposed configurations. We enhance MapReduce with new dynamic data streaming and workload scheduling capabilities, which enable application writers to use asymmetric accelerator-based clusters without being concerned with the capabilities of individual components. We present an evaluation of the presented designs in a physical setting and show that our designs can provide significant performance advantages. Compared to a standard static MapReduce design, we achieve 62.5%, 73.1%, and 82.2% performance improvement using accelerators with limited general-purpose resources, well-provisioned shared general-purpose resources, and well-provisioned dedicated general-purpose resources, respectively.","PeriodicalId":444485,"journal":{"name":"2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing","volume":"57 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-05-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124500986","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 15

A Heuristic Query Optimization Approach for Heterogeneous Environments 异构环境下的启发式查询优化方法

2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing Pub Date : 2010-05-17 DOI: 10.1109/CCGRID.2010.65

P. Beran, W. Mach, R. Vigne, Juergen Mangler, E. Schikuta

引用次数: 3

Integration of Heterogeneous and Non-dedicated Environments for R 集成异构和非专用的R环境

2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing Pub Date : 2010-05-17 DOI: 10.1109/CCGRID.2010.102

Gonzalo Vera, R. Suppi

{"title":"Integration of Heterogeneous and Non-dedicated Environments for R","authors":"Gonzalo Vera, R. Suppi","doi":"10.1109/CCGRID.2010.102","DOIUrl":"https://doi.org/10.1109/CCGRID.2010.102","url":null,"abstract":"Parallel computing is becoming essential for nowadays data analysis in several disciplines. In order to profit from parallel processing of experimental data, specialized skills, software tools and suitable computing resources are required. Desktop grids and volunteer-based systems have proved themselves as powerful options where distributed idle resources from heterogeneous computers are aggregated to build powerful met computers. Software solutions are required to automate and assist the process of transformation and adaptation of current and new applications to run in these environments. Finally, it is desirable, for the same tool, to provide an efficient solution to orchestrate the execution of these programs using a diversity of dynamic environments. In this paper we describe an implementation of an integrated solution for the R language which allows the transformation and execution of parallel loops in heterogeneous and non-dedicated environments. The results obtained allow us to prove the feasibility of our proposal. Furthermore, several issues that tools like this must consider to improve their performance when integrating heterogeneous systems are described.","PeriodicalId":444485,"journal":{"name":"2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing","volume":"119 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-05-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133423920","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

ConnectX-2 InfiniBand Management Queues: First Investigation of the New Support for Network Offloaded Collective Operations ConnectX-2 InfiniBand管理队列:网络卸载集体操作新支持初探

2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing Pub Date : 2010-05-17 DOI: 10.1109/CCGRID.2010.9

R. Graham, Steve Poole, Pavel Shamis, Gil Bloch, N. Bloch, H. Chapman, Michael Kagan, Ariel Shahar, Ishai Rabinovitz, G. Shainer

{"title":"ConnectX-2 InfiniBand Management Queues: First Investigation of the New Support for Network Offloaded Collective Operations","authors":"R. Graham, Steve Poole, Pavel Shamis, Gil Bloch, N. Bloch, H. Chapman, Michael Kagan, Ariel Shahar, Ishai Rabinovitz, G. Shainer","doi":"10.1109/CCGRID.2010.9","DOIUrl":"https://doi.org/10.1109/CCGRID.2010.9","url":null,"abstract":"This paper introduces the newly developed Infini- Band (IB) Management Queue capability, used by the Host Channel Adapter (HCA) to manage network task data flow dependancies, and progress the communications associated with such flows. These tasks include sends, receives, and the newly supported wait task, and are scheduled by the HCA based on a data dependency description provided by the user. This functionality is supported by the ConnectX-2 HCA, and provides the means for delegating collective communication management and progress to the HCA, also known as collective communication offload. This provides a means for overlapping collective communications managed by the HCA and computation on the Central Processing Unit (CPU), thus making it possible to reduce the impact of system noise on parallel applications using collective operations. This paper further describes how this new capability can be used to implement scalable Message Passing Interface (MPI) collective operations, describing the high level details of how this new capability is used to implement the MPI Barrier collective operation, focusing on the latency sensitive performance aspects of this new capability. This paper concludes with small scale bench- mark experiments comparing implementations of the barrier collective operation, using the new network offload capabilities, with established point-to-point based implementations of these same algorithms, which manage the data flow using the central processing unit. These early results demonstrate the promise this new capability provides to improve the scalability of high- performance applications using collective communications. The latency of the HCA based implementation of the barrier is similar to that of the best performing point-to-point based implementation managed by the central processing unit, starting to outperform these as the number of processes involved in the collective operation increases.","PeriodicalId":444485,"journal":{"name":"2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing","volume":"216 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-05-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124269307","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 43

Using Cloud Constructs and Predictive Analysis to Enable Pre-Failure Process Migration in HPC Systems 使用云结构和预测分析在高性能计算系统中实现故障前流程迁移

2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing Pub Date : 2010-05-17 DOI: 10.1109/CCGRID.2010.31

J. Brandt, Frank Chen, Vincent De Sapio, A. Gentile, J. Mayo, P. Pébay, D. Roe, D. Thompson, M. Wong

引用次数: 7

Methodology for Efficient Execution of SPMD Applications on Multicore Environments 在多核环境中有效执行SPMD应用程序的方法

2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing Pub Date : 2010-05-17 DOI: 10.1109/CCGRID.2010.67

Ronal Muresano, Dolores Rexachs, E. Luque

引用次数: 10

Energy Efficient Resource Management in Virtualized Cloud Data Centers 虚拟化云数据中心的节能资源管理

2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing Pub Date : 2010-05-17 DOI: 10.1109/CCGRID.2010.46

A. Beloglazov, R. Buyya

引用次数: 839

Sky Computing: When Multiple Clouds Become One 天空计算:当多云合二为一

2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing Pub Date : 2010-05-17 DOI: 10.1109/CCGRID.2010.136

J. Fortes

{"title":"Sky Computing: When Multiple Clouds Become One","authors":"J. Fortes","doi":"10.1109/CCGRID.2010.136","DOIUrl":"https://doi.org/10.1109/CCGRID.2010.136","url":null,"abstract":"Summary form only given. The growing number of announced commercial and scientific clouds strongly suggests that in the near future these providers will be differentiated according to the types of their services, their cost, availability and quality. Users will be able to use these and other criteria to determine which clouds best suit their needs, a plausible scenario being the case when users need to aggregate capabilities provided by different clouds. In such scenarios it will be essential to provide virtual networking technologies that enable providers to support cross-cloud communication and users to deploy cross-cloud applications. This talk will describe one such technology, its salient features and remaining challenges. It will also put forward the idea of virtual clouds, i.e. providers of computing services overlaid on more than one cloud. A virtual cloud spans across multiple cloud providers and presents the view of a single logical cloud. Virtual clouds would enable high-level computing services to be provided by third parties who do not own physical resources, could be short or long-lived and highly dynamic. Enabling technologies, challenges and examples of sky computing will be presented.","PeriodicalId":444485,"journal":{"name":"2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing","volume":"55 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-05-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117301182","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5