2010 22nd International Symposium on Computer Architecture and High Performance Computing最新文献_第3页

BatchQueue: Fast and Memory-Thrifty Core to Core Communication BatchQueue:快速和内存节约的核心到核心通信

2010 22nd International Symposium on Computer Architecture and High Performance Computing Pub Date : 2010-10-27 DOI: 10.1109/SBAC-PAD.2010.34

Thomas Preud'homme, Julien Sopena, Gaël Thomas, B. Folliot

引用次数: 15

An Analytical Model on the Execution of Transactional Memory 事务性记忆执行的分析模型

2010 22nd International Symposium on Computer Architecture and High Performance Computing Pub Date : 2010-10-27 DOI: 10.1109/SBAC-PAD.2010.29

Xiao Yu, Zhengyu He, Bo Hong

引用次数: 5

Achieving Fault Tolerance on Grids with the CPPC Framework and the GridWay Metascheduler 用CPPC框架和GridWay元调度器实现网格容错

2010 22nd International Symposium on Computer Architecture and High Performance Computing Pub Date : 2010-10-27 DOI: 10.1109/SBAC-PAD.2010.22

Iván Cores, Gabriel Rodríguez, María J. Martín, P. González

引用次数: 3

Analyzing Cache Coherence Protocols for Server Consolidation 分析服务器整合的缓存一致性协议

2010 22nd International Symposium on Computer Architecture and High Performance Computing Pub Date : 2010-10-27 DOI: 10.1109/SBAC-PAD.2010.31

Antonio García-Guirado, Ricardo Fernández Pascual, José M. García

{"title":"Analyzing Cache Coherence Protocols for Server Consolidation","authors":"Antonio García-Guirado, Ricardo Fernández Pascual, José M. García","doi":"10.1109/SBAC-PAD.2010.31","DOIUrl":"https://doi.org/10.1109/SBAC-PAD.2010.31","url":null,"abstract":"Server consolidation is commonly used today to make the most out of all the cores of a chip multiprocessor by running several virtual machines (VMs) on it. Cache coherence protocols can be adapted to take advantage of such an scenario. In this line, Virtual Hierarchies (VHs) use two levels of cache coherence in a consolidated server. They isolate the coherence actions of each VM and improve performance by maximizing the number of memory accesses serviced by caches within the VM. In this paper we show how hierarchical protocols with no single ordering point for the requests, such as VHs in the form currently proposed, are prone to deadlocks. Besides, when memory deduplication is used, VHs cannot take advantage of memory deduplication at the cache level, both because deduplicated data is reduplicated in cache, and because accesses to deduplicated data often require the access to the cache tiles used by a different VM by means of broadcast. We analyze all these problems and we propose solutions for them, showing the actual performance of these protocols, and giving some insights for the future development of coherence protocols optimized for server consolidation.","PeriodicalId":432670,"journal":{"name":"2010 22nd International Symposium on Computer Architecture and High Performance Computing","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-10-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130850618","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Tree Projection-Based Frequent Itemset Mining on Multicore CPUs and GPUs 基于树投影的多核cpu和gpu频繁项集挖掘

2010 22nd International Symposium on Computer Architecture and High Performance Computing Pub Date : 2010-10-27 DOI: 10.1109/SBAC-PAD.2010.15

George Teodoro, Nathan Mariano, Wagner Meira Jr, R. Ferreira

引用次数: 23

Simultaneous Evaluation of Multiple I/O Strategies 多个I/O策略的同时评估

2010 22nd International Symposium on Computer Architecture and High Performance Computing Pub Date : 2010-10-27 DOI: 10.1109/SBAC-PAD.2010.30

Pilar González-Férez, J. Piernas, Toni Cortes

引用次数: 5

Towards a Peer-to-Peer Framework for Parallel and Distributed Computing 面向并行和分布式计算的对等框架

2010 22nd International Symposium on Computer Architecture and High Performance Computing Pub Date : 2010-10-27 DOI: 10.1109/SBAC-PAD.2010.23

L. José, Senger Márcio Augusto de Souza, D. Foltran

引用次数: 11

Performance Issues for Parallel Implementations of Bootstrap Simulation Algorithm Bootstrap仿真算法并行实现的性能问题

2010 22nd International Symposium on Computer Architecture and High Performance Computing Pub Date : 2010-10-27 DOI: 10.1109/SBAC-PAD.2010.28

R. Czekster, Paulo Fernandes, Afonso Sales, T. Webber

{"title":"Performance Issues for Parallel Implementations of Bootstrap Simulation Algorithm","authors":"R. Czekster, Paulo Fernandes, Afonso Sales, T. Webber","doi":"10.1109/SBAC-PAD.2010.28","DOIUrl":"https://doi.org/10.1109/SBAC-PAD.2010.28","url":null,"abstract":"The solution of state-based stochastic models is usually a demanding application, then it is a natural subject to high performance techniques. We are particularly interested in the speedup of Bootstrap Simulation of structured Markovian models. This approach is a quite recent development in the performance evaluation area, and it brings a considerable improvement in the results accuracy, despite the intrinsic effect of randomness in simulation experiments. Unfortunately, Bootstrap Simulation has higher computational cost than other alternatives. We present experiments with different options to optimize the parallel solution of Bootstrap Simulation applied to three practical examples described in Stochastic Automata Networks (SAN) formalism. This paper contribution resides in the discussion of theoretical implementation issues, the obtained speedup and the actual processing and communication times for all experiments. Additionally, we also suggest future works to improve even more the proposed solution and we discuss some interesting insights for parallelization of similar applications.","PeriodicalId":432670,"journal":{"name":"2010 22nd International Symposium on Computer Architecture and High Performance Computing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-10-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130848021","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

A Clock Synchronization Strategy for Minimizing Clock Variance at Runtime in High-End Computing Environments 在高端计算环境中最小化运行时时钟方差的时钟同步策略

2010 22nd International Symposium on Computer Architecture and High Performance Computing Pub Date : 2010-10-27 DOI: 10.1109/SBAC-PAD.2010.33

T. Jones, G. Koenig

引用次数: 14