ACM/IEEE SC 1999 Conference (SC'99)最新文献_第6页

Evaluating Titanium SPMD Programs on the Tera MTA 在Tera MTA上评价钛SPMD项目

ACM/IEEE SC 1999 Conference (SC'99) Pub Date : 1900-01-01 DOI: 10.1145/331532.331575

Carleton Miyamoto, Chang Lin

{"title":"Evaluating Titanium SPMD Programs on the Tera MTA","authors":"Carleton Miyamoto, Chang Lin","doi":"10.1145/331532.331575","DOIUrl":"https://doi.org/10.1145/331532.331575","url":null,"abstract":"While the common trend in building large-scale multiprocessors is to use commodity compute nodes that are increasingly powerful and have deep memory hierarchies, the Tera MTA uses a different design point, with a relatively flat memory system, no processor caches, and hardware support for light-weight multithreading, which is used to mask memory latency. In this paper we explore the implementation of Titanium, a language with coarse-grained SPMD parallelism, onto the MTA. The major concerns in obtaining high performance on the MTA are sufficient degrees of parallelism, good load balance, and low synchronization overhead. We show that by adding loop level parallelism, Titanium applications have sufficient parallelism for the MTA, and as expected, application writers do not need to orchestrate data layout. We evaluate multiple implementations of the Titanium synchronization constructs, which include barriers and monitors. We then explore several scheduling strategies, and find that the distinction between SPMD and loop level parallelism proves to be surprisingly useful. The two-level parallelism structure can be used to throttle thread migration, which lowers thread creation overhead and synchronization. We use a combination of micro-benchmarks and applications to demonstrate these results.","PeriodicalId":354898,"journal":{"name":"ACM/IEEE SC 1999 Conference (SC'99)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115151511","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Adaptive Performance Prediction for Distributed Data-Intensive Applications 分布式数据密集型应用的自适应性能预测

ACM/IEEE SC 1999 Conference (SC'99) Pub Date : 1900-01-01 DOI: 10.1145/331532.331568

M. Faerman, Alan Su, R. Wolski, F. Berman

引用次数: 99

The Diesel Combustion Collaboratory: Combustion Researchers Collaborating over the Internet 柴油燃烧合作实验室:燃烧研究人员在互联网上合作

ACM/IEEE SC 1999 Conference (SC'99) Pub Date : 1900-01-01 DOI: 10.1145/331532.331596

C. Pancerella, L. Rahn, Christine L. Yang

引用次数: 29

Terascale Spectral Element Algorithms and Implementations 兆级光谱元素算法和实现

ACM/IEEE SC 1999 Conference (SC'99) Pub Date : 1900-01-01 DOI: 10.1145/331532.331599

H. Tufo, P. Fischer

引用次数: 137

Scheduling Constrained Dynamic Applications on Clusters 受调度约束的集群动态应用

ACM/IEEE SC 1999 Conference (SC'99) Pub Date : 1900-01-01 DOI: 10.1145/331532.331578

K. Knobe, James M. Rehg, A. Chauhan, R. Nikhil, U. Ramachandran

引用次数: 14

H-RMC: A Hybrid Reliable Multicast Protocol for the Linux Kernel H-RMC: Linux内核的混合可靠组播协议

ACM/IEEE SC 1999 Conference (SC'99) Pub Date : 1900-01-01 DOI: 10.1145/331532.331540

P. McKinley, R. Rao, R. F. Wright

引用次数: 11

DeepView: A Channel for Distributed Microscopy and Informatics 深度视图:分布式显微镜和信息学的通道

ACM/IEEE SC 1999 Conference (SC'99) Pub Date : 1900-01-01 DOI: 10.1145/331532.331597

B. Parvin, John R. Taylor, G. Cong, M. O'Keefe, M. Barcellos-Hoff

{"title":"DeepView: A Channel for Distributed Microscopy and Informatics","authors":"B. Parvin, John R. Taylor, G. Cong, M. O'Keefe, M. Barcellos-Hoff","doi":"10.1145/331532.331597","DOIUrl":"https://doi.org/10.1145/331532.331597","url":null,"abstract":"This paper outlines the requirements, architecture, and design of a \"Microscopy Channel\" over the wide area network. A microscopy channel advertises a listing of available online microscopes, where users can seamlessly participate in an experiment, acquire expert opinions, collect and process data, and store this information in their electronic notebook. The proposed channel is a collaborative problem solving environment (CPSE) that allows for both synchronous and asynchronous collaboration. Our testbed includes several unique electron and optical microscopes with applications ranging from material science to cell biology. We have studied current commercial CORBA services and concluded that three basic services are needed to meet the extensibility and functionality constraints. These include: Instrument Services (IS), Exchange Services (ES), and Computational Services (CS). These services sit on top of CORBA and its enabling services (naming, trading, security, and notification). IS provide a layer of abstraction for controlling any type of microscope. ES provide a common set of utilities for information management and transaction. CS provide the analytical capabilities needed for online microscopy and PSE.","PeriodicalId":354898,"journal":{"name":"ACM/IEEE SC 1999 Conference (SC'99)","volume":"42 1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131611165","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 15

Compiler-supported simulation of highly scalable parallel applications 编译器支持的高度可扩展并行应用程序模拟

ACM/IEEE SC 1999 Conference (SC'99) Pub Date : 1900-01-01 DOI: 10.1145/331532.331533

Vikram S. Adve, R. Bagrodia, E. Deelman, T. Phan, R. Sakellariou

{"title":"Compiler-supported simulation of highly scalable parallel applications","authors":"Vikram S. Adve, R. Bagrodia, E. Deelman, T. Phan, R. Sakellariou","doi":"10.1145/331532.331533","DOIUrl":"https://doi.org/10.1145/331532.331533","url":null,"abstract":"In this paper, we propose and evaluate practical, automatic techniques that exploit compiler analysis to facilitate simulation of very large message-passing systems. We use a compiler-synthesized static task graph model to identify the control-flow and the subset of the computations that determine the parallelism, communication and synchronization of the code, and to generate symbolic estimates of sequential task execution times. This information allows us to avoid executing or simulating large portions of the computational code during the simulation. We have used these techniques to integrate the MPI-Sim parallel simulator at UCLA with the Rice dHPF compiler infrastructure. The integrated system can simulate unmodified High Performance Fortran (HPF) programs compiled to the Message-Passing Interface standard (MPI) by the dHPF compiler, and we expect to simulate MPI programs as well. We evaluate the accuracy and benefits of these techniques for three standard benchmarks on a wide range of problem and system sizes. Our results show that the optimized simulator has errors of less than 17% compared with direct program measurement in all the cases we studied, and typically much smaller errors. Furthermore, it requires factors of 5 to 2000 less memory and up to a factor of 10 less time to execute than the original simulator. These dramatic savings allow us to simulate systems and problem sizes 10 to 100 times larger than is possible with the original simulator.","PeriodicalId":354898,"journal":{"name":"ACM/IEEE SC 1999 Conference (SC'99)","volume":"43 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130643254","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 21

Running EveryWare on the Computational Grid 在计算网格上运行每个软件

ACM/IEEE SC 1999 Conference (SC'99) Pub Date : 1900-01-01 DOI: 10.1145/331532.331538

R. Wolski, J. Brevik, C. Krintz, Graziano Obertelli, N. Spring, Alan Su

引用次数: 41

A Personal Supercomputer for Climate Research 一台用于气候研究的个人超级计算机

ACM/IEEE SC 1999 Conference (SC'99) Pub Date : 1900-01-01 DOI: 10.1145/331532.331591

J. Hoe, C. Hill, A. Adcroft

{"title":"A Personal Supercomputer for Climate Research","authors":"J. Hoe, C. Hill, A. Adcroft","doi":"10.1145/331532.331591","DOIUrl":"https://doi.org/10.1145/331532.331591","url":null,"abstract":"We describe and analyze the performance of a cluster of personal computers dedicated to coupled climate simulations. This climate modeling system performs comparably to state-of-the-art supercomputers and yet is affordable by individual research groups, thus enabling more spontaneous application of high-end numerical models to climate science. The cluster's novelty centers around the Arctic Switch Fabric and the StarT-X network interface, a system-area interconnect substrate developed at MIT. A significant fraction of the interconnect's hardware performance is made available to our climate model through an application-specific communication library. In addition to reporting the overall application performance of our cluster, we develop an analytical performance model of our application. Based on this model, we define a metric, Potential Floating-Pointing Performance, which we use to quantify the role of high-speed interconnects in determining application performance. Our results show that a high-performance interconnect, in conjunction with a light-weight application-specific library, provides efficient support for our fine-grain parallel application on an otherwise general-purpose commodity system.","PeriodicalId":354898,"journal":{"name":"ACM/IEEE SC 1999 Conference (SC'99)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115881603","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3