Proceedings. Fifth International Conference on High Performance Computing (Cat. No. 98EX238)最新文献_第4页

How to improve local load balancing policies by distorting load information 如何通过扭曲负载信息来改进本地负载均衡策略

Proceedings. Fifth International Conference on High Performance Computing (Cat. No. 98EX238) Pub Date : 1998-12-17 DOI: 10.1109/HIPC.1998.738004

F. Zambonelli

引用次数: 5

Design alternatives for shared memory multiprocessors 为共享内存多处理器设计备选方案

Proceedings. Fifth International Conference on High Performance Computing (Cat. No. 98EX238) Pub Date : 1998-12-17 DOI: 10.1109/HIPC.1998.737969

J. Carter, Chen-Chi Kuo, R. Kuramkote, M. Swanson

{"title":"Design alternatives for shared memory multiprocessors","authors":"J. Carter, Chen-Chi Kuo, R. Kuramkote, M. Swanson","doi":"10.1109/HIPC.1998.737969","DOIUrl":"https://doi.org/10.1109/HIPC.1998.737969","url":null,"abstract":"We consider the design alternatives available for building the next generation DSM machine (e.g., the choice of memory architecture, network technology, and amount and location of per-node remote data cache). To investigate this design space, we have simulated five applications on a wide variety of possible DSM architectures that employ significantly different caching techniques. We also examine the impact of using a special purpose system interconnect designed specifically to support low latency DSM operation versus using a powerful off the shelf system interconnect. We found that two architectures have the best combination of good average performance and reasonable worst case performance: CC-NUMA employing a moderate sized DRAM remote access cache (RAC) and a hybrid CC-NUMA/S-COMA architecture called AS-COMA or adaptive S-COMA. Both pure CC-NUMA and pure S-COMA have serious performance problems for some applications, while CC-NUMA employing an SRAM RAC does not perform as well as the two architectures that employ larger DRAM caches. The paper concludes with several recommendations to designers of next generation DSM machines, complete with a discussion of the issues that led to each recommendation so that designers can decide which ones are relevant to them given changes in technology and corporate priorities.","PeriodicalId":175528,"journal":{"name":"Proceedings. Fifth International Conference on High Performance Computing (Cat. No. 98EX238)","volume":"311 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129639552","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Testing concurrency and communication in distributed objects 测试分布式对象中的并发性和通信

Proceedings. Fifth International Conference on High Performance Computing (Cat. No. 98EX238) Pub Date : 1998-12-17 DOI: 10.1109/HIPC.1998.738017

Adnan Bader, A. Sajeev, S. Ramakrishnan

引用次数: 14

Execution characteristics of object oriented programs on the UltraSPARC-II 面向对象程序在UltraSPARC-II上的执行特性

Proceedings. Fifth International Conference on High Performance Computing (Cat. No. 98EX238) Pub Date : 1998-12-17 DOI: 10.1109/HIPC.1998.737990

R. Radhakrishnan, L. John

{"title":"Execution characteristics of object oriented programs on the UltraSPARC-II","authors":"R. Radhakrishnan, L. John","doi":"10.1109/HIPC.1998.737990","DOIUrl":"https://doi.org/10.1109/HIPC.1998.737990","url":null,"abstract":"It is widely accepted that object-oriented design improves code reusability, facilitates code maintainability and enables higher levels of abstraction. Although software developers and the software engineering community have embraced object-oriented programming for these benefits, there have been wide concerns about the performance overhead associated with this programming paradigm on modern processors. We characterize the performance of several C and C++ benchmarks on an UltraSPARC-II processor. Various architectural data related to execution behavior of the benchmarks are collected using on-chip performance monitoring counters. Factors including CPI, instruction and data cache misses, processor stalls due to instruction cache misses and branch misprediction, from real execution of several programs are measured and presented. While previous research evaluates the behavioral differences between C and C++ programs based on profiling and simulation, we measure execution behavior. Results show that the programs in the C++ suite incur a higher CPI, higher i-cache misses, and higher branch mispredictions than the programs in the C suite. A strong correlation was observed between CPI and branch mispredictions for the C++ application programs.","PeriodicalId":175528,"journal":{"name":"Proceedings. Fifth International Conference on High Performance Computing (Cat. No. 98EX238)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131268396","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 16

A simple mechanism to deal with sequential code in dataflow architectures 在数据流体系结构中处理顺序代码的简单机制

Proceedings. Fifth International Conference on High Performance Computing (Cat. No. 98EX238) Pub Date : 1998-12-17 DOI: 10.1109/HIPC.1998.737988

M. A. Cavenaghi, G. Travieso, Á. G. Neto

引用次数: 5

Java data parallel extensions with runtime system support 具有运行时系统支持的Java数据并行扩展

Proceedings. Fifth International Conference on High Performance Computing (Cat. No. 98EX238) Pub Date : 1998-12-17 DOI: 10.1109/HIPC.1998.737978

Yuhong Wen, Bryan Carpenter, Geoffrey C. Fox, Guansong Zhang

引用次数: 1

Global reactive congestion control in multicomputer networks 多计算机网络中的全局响应拥塞控制

Proceedings. Fifth International Conference on High Performance Computing (Cat. No. 98EX238) Pub Date : 1998-12-17 DOI: 10.1109/HIPC.1998.737987

Abdel-Halim Smai, L. Thorelli

引用次数: 48

On-line diagnosibility of baseline interconnection network 基线互联网络的在线诊断

Proceedings. Fifth International Conference on High Performance Computing (Cat. No. 98EX238) Pub Date : 1998-12-17 DOI: 10.1109/HIPC.1998.737995

Sipra Das, A. Chaudhuri

引用次数: 0

Modulo-variable expansion sensitive scheduling 模变量扩展敏感调度

Proceedings. Fifth International Conference on High Performance Computing (Cat. No. 98EX238) Pub Date : 1998-12-17 DOI: 10.1109/HIPC.1998.738006

M. Valluri, R. Govindarajan

引用次数: 2

PERL-a registerless architecture perl—无寄存器体系结构

Proceedings. Fifth International Conference on High Performance Computing (Cat. No. 98EX238) Pub Date : 1998-12-17 DOI: 10.1109/HIPC.1998.737968

P. Suresh, R. Moona

引用次数: 3