Proceedings of the 20th Annual International Symposium on Computer Architecture最新文献

Design Tradeoffs For Software-managed Tlbs 软件管理Tlbs的设计权衡

Proceedings of the 20th Annual International Symposium on Computer Architecture Pub Date : 1994-08-01 DOI: 10.1109/ISCA.1993.698543

D. Nagle, R. Uhlig, Timothy J. Stanley, S. Sechrest, T. Mudge, Richard B. Brown

引用次数: 142

Hierarchical Performance Modeling With MACS: A Case Study Of The Convex C-240 用MACS进行分层性能建模:凸型C-240的案例研究

Proceedings of the 20th Annual International Symposium on Computer Architecture Pub Date : 1993-05-01 DOI: 10.1109/ISCA.1993.698561

E. Boyd, E. Davidson

引用次数: 19

The Performance Of Cache-coherent Ring-based Multiprocessors 基于缓存相干环的多处理器性能研究

Proceedings of the 20th Annual International Symposium on Computer Architecture Pub Date : 1993-05-01 DOI: 10.1109/ISCA.1993.698567

L. Barroso, M. Dubois

引用次数: 69

Evaluation Of Release Consistent Software Distributed Shared Memory On Emerging Network Technology 新兴网络技术下发布一致性软件分布式共享内存的评价

Proceedings of the 20th Annual International Symposium on Computer Architecture Pub Date : 1993-05-01 DOI: 10.1145/165123.165150

S. Dwarkadas, P. Keleher, A. Cox, W. Zwaenepoel

{"title":"Evaluation Of Release Consistent Software Distributed Shared Memory On Emerging Network Technology","authors":"S. Dwarkadas, P. Keleher, A. Cox, W. Zwaenepoel","doi":"10.1145/165123.165150","DOIUrl":"https://doi.org/10.1145/165123.165150","url":null,"abstract":"We evaluate the effect of processor speed, network characteristics, and software overhead on the performance of release-consistent software distributed shared memory. We examine five different protocols for implementing release consistency: eager update, eager invalidate, lazy update, lazy invalidate, and a new protocol called lazy hybrid. This lazy hybrid protocol combines the benefits of both lazy update and lazy invalidate.\u0000Our simulations indicate that with the processors and networks that are becoming available, coarse-grained applications such as Jacobi and TSP perform well, more or less independent of the protocol used. Medium-grained applications, such as Water, can achieve good performance, but the choice of protocol is critical. For sixteen processors, the best protocol, lazy hybrid, performed more than three times better than the worst, the eager update. Fine-grained applications such as Cholesky achieve little speedup regardless of the protocol used because of the frequency of synchronization operations and the high latency involved.\u0000While the use of relaxed memory models, lazy implementations, and multiple-writer protocols has reduced the impact of false sharing, synchronization latency remains a serious problem for software distributed shared memory systems. These results suggest that the future work on software DSMs should concentrate on reducing the amount of synchronization or its effect.","PeriodicalId":410022,"journal":{"name":"Proceedings of the 20th Annual International Symposium on Computer Architecture","volume":"51 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1993-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127951813","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 118

A Comparison Of Dynamic Branch Predictors That Use Two Levels Of Branch History 使用两层分支历史的动态分支预测器的比较

Proceedings of the 20th Annual International Symposium on Computer Architecture Pub Date : 1993-05-01 DOI: 10.1109/ISCA.1993.698566

Tse-Yu Yeh, Y. Patt

引用次数: 419

Improving AP1000 Parallel Computer Performance With Message Communication 利用消息通信提高AP1000并行计算机性能

Proceedings of the 20th Annual International Symposium on Computer Architecture Pub Date : 1993-05-01 DOI: 10.1145/165123.165168

T. Horie, K. Hayashi, T. Shimizu, H. Ishihata

引用次数: 17

The Chinese Remainder Theorem And The Prime Memory System 中国剩余定理与素数记忆系统

Proceedings of the 20th Annual International Symposium on Computer Architecture Pub Date : 1993-05-01 DOI: 10.1109/ISCA.1993.698573

Qing-Qiang Gao

引用次数: 43

The Architecture Of A Fault-tolerant Cached RAID Controller 容错缓存RAID控制器的结构

Proceedings of the 20th Annual International Symposium on Computer Architecture Pub Date : 1993-05-01 DOI: 10.1109/ISCA.1993.698547

J. Menon, Jim Cortney

引用次数: 96

Transactional Memory: Architectural Support For Lock-free Data Structures 事务性内存:无锁数据结构的体系结构支持

Proceedings of the 20th Annual International Symposium on Computer Architecture Pub Date : 1993-05-01 DOI: 10.1109/ISCA.1993.698569

Maurice Herlihy, J. E. B. Moss

引用次数: 2560

Adaptive Cache Coherency For Detecting Migratory Shared Data 自适应缓存一致性检测迁移共享数据

Proceedings of the 20th Annual International Symposium on Computer Architecture Pub Date : 1993-05-01 DOI: 10.1109/ISCA.1993.698549

A. Cox, R. Fowler

{"title":"Adaptive Cache Coherency For Detecting Migratory Shared Data","authors":"A. Cox, R. Fowler","doi":"10.1109/ISCA.1993.698549","DOIUrl":"https://doi.org/10.1109/ISCA.1993.698549","url":null,"abstract":"Parallel programs exhibit a small number of distinct data-sharing patterns. A common data-sharing pattern, migratory access, is characterized by exclusive read and write access by one processor at a time to a shared datum. We describe a family of adaptive cache coherency protocols that dynamically identify migratory shared data in order to reduce the cost of moving them. The protocols use a standard memory model and processor-cache interface. They do not require any compile-time or run-time software support. We describe implementations for bus-based multiprocessors and for shared-memory multiprocessors that use directory-based caches. These implementations are simple and would not significantly increase hardware cost. We use trace- and execution-driven simulation to compare the performance of the adaptive protocols to standard write-invalidate protocols. These simulations indicate that, compared to conventional protocols, the use of the adaptive protocol can almost halve the number of inter-node messages on some applications. Since cache coherency traffic represents a larger part of the total communication as cache size increases, the relative benefit of using the adaptive protocol also increases.","PeriodicalId":410022,"journal":{"name":"Proceedings of the 20th Annual International Symposium on Computer Architecture","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1993-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126756251","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 194