CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings.最新文献_第8页

Application-bypass broadcast in MPICH over GM 基于GM的MPICH应用旁路广播

CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings. Pub Date : 2003-05-12 DOI: 10.1109/CCGRID.2003.1199346

Darius Buntinas, D. Panda, R. Brightwell

{"title":"Application-bypass broadcast in MPICH over GM","authors":"Darius Buntinas, D. Panda, R. Brightwell","doi":"10.1109/CCGRID.2003.1199346","DOIUrl":"https://doi.org/10.1109/CCGRID.2003.1199346","url":null,"abstract":"Processes of a parallel program can become unsynchronized, or skewed, during the course of running an application. Processes can become skewed as a result of unbalanced or asymmetric rode, or through the use of heterogeneous systems, where nodes in the system have different performance characteristics, as well as random, unpredictable effects such as the processes not being started at exactly the same time, or processors receiving interrupts during computation. Geographically distributed systems may have more severe skew because of variable communication times. Such skew can have a significant impact on the performance of collective communication operations which impose an implicit synchronization. The broadcast operation in MPICH is one such operation. An application-bypass broadcast operation is one which does not depend on the application running at a process to make progress. Such an operation would not be as sensitive to process skew. This paper describes the design and implementation of an application-bypass broadcast operation. We evaluated the implementation and find a factor of improvement of up to 16 for application-bypass broadcast compared to non-application-bypass broadcast when processes are skewed. Furthermore we see that as the system size increases, the effects of skew on non-application-bypass broadcast also increase. The application-bypass broadcast is much less sensitive to process skew which makes it more scalable than the non-application-bypass broadcast operation.","PeriodicalId":433323,"journal":{"name":"CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings.","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117168717","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 17

An exposed approach to reliable multicast in heterogeneous logistical networks 异构物流网络中可靠组播的一种公开方法

CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings. Pub Date : 2003-05-12 DOI: 10.1109/CCGRID.2003.1199410

Micah Beck, Ying Ding, E. Fuentes, Sharmila Kancherl

引用次数: 14

Performance of cluster-enabled OpenMP for the SCASH software distributed shared memory system 在SCASH软件分布式共享内存系统中启用集群的OpenMP性能

CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings. Pub Date : 2003-05-12 DOI: 10.1109/CCGRID.2003.1199400

Y. Ojima, M. Sato, H. Harada, Y. Ishikawa

引用次数: 23

Noncontiguous I/O accesses through MPI-IO 通过MPI-IO进行不连续I/O访问

CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings. Pub Date : 2003-05-12 DOI: 10.1109/CCGRID.2003.1199358

A. Ching, A. Choudhary, Kenin Coloma, W. Liao, R. Ross, W. Gropp

引用次数: 83

An agent version of a cluster server 集群服务器的代理版本

CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings. Pub Date : 2003-05-12 DOI: 10.1109/CCGRID.2003.1199426

Andraz Bezek, M. Gams

引用次数: 1

Fault-tolerant distributed mass storage for LHC computing LHC计算的容错分布式海量存储

CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings. Pub Date : 2003-05-12 DOI: 10.1109/CCGRID.2003.1199377

A. Wiebalck, Peter T. Breuer, V. Lindenstruth, T. Steinbeck

引用次数: 13

Implementation of page management in Mome, a user-level DSM 在Mome中实现页面管理，一个用户级的DSM

CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings. Pub Date : 2003-05-12 DOI: 10.1109/CCGRID.2003.1199404

Y. Jégou

引用次数: 16

Using topology-aware communication services in grid environments 在网格环境中使用拓扑感知通信服务

CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings. Pub Date : 2003-05-12 DOI: 10.1109/CCGRID.2003.1199411

Craig A. Lee, E. Coe, B. S. Michel, J. Stepanek, I. Solis, J. Clark, Brooks Davis

{"title":"Using topology-aware communication services in grid environments","authors":"Craig A. Lee, E. Coe, B. S. Michel, J. Stepanek, I. Solis, J. Clark, Brooks Davis","doi":"10.1109/CCGRID.2003.1199411","DOIUrl":"https://doi.org/10.1109/CCGRID.2003.1199411","url":null,"abstract":"This paper investigates the use of advanced communication services in grid environments. Such services can include augmented communication semantics (e.g., filtering), collective operations, content-based and policy-based routing, and managing communication scope to manage feasibility. These services could be implemented and deployed in a variety of ways, such as a traditional network of servers, or as a middleware forwarding and routing layer, or even in an active network. In any of these approaches, topology-awareness can play a major role in their performance and scalability. As a case study, we demonstrate here a communication service to support time management in distributed simulations that is managed using a grid computing toolkit. We also present emulation and simulation results to demonstrate the scalability that topology-awareness enables for services such as time management. Since the design space for communication services offers so many possibilities and alternatives, we argue for the definition of proper high-level models and APIs such that the underlying implementations and scope of deployment can be developed and improved with minimal impact on applications.","PeriodicalId":433323,"journal":{"name":"CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings.","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126784138","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

Magnetic resonance imaging (MRI) simulation on a grid computing architecture 基于网格计算架构的磁共振成像(MRI)仿真

CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings. Pub Date : 2003-05-12 DOI: 10.1109/CCGRID.2003.1199417

H. Benoit-Cattin, F. Bellet, J. Montagnat, C. Odet

引用次数: 24

Recovering internet symmetry in distributed computing 在分布式计算中恢复互联网对称性

CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings. Pub Date : 2003-05-12 DOI: 10.1109/CCGRID.2003.1199412

Se-Chang Son, M. Livny

{"title":"Recovering internet symmetry in distributed computing","authors":"Se-Chang Son, M. Livny","doi":"10.1109/CCGRID.2003.1199412","DOIUrl":"https://doi.org/10.1109/CCGRID.2003.1199412","url":null,"abstract":"This paper describes two systems to recover the Internet connectivity impaired by private networks and firewalls. These devices cause asymmetry in the Internet, making peer-to-peer computing difficult or even impossible. The Condor system is one of those that are severely impaired by the asymmetry. Compared to normal peer-to-peer computing applications, Condor has stricter requirements, which are representative to any grid computing. To make Condor seamlessly work across private networks and over firewalls, we designed and implemented Dynamic Port Forwarding (DPF) and Generic Connection Brokering (GCB). Both DPF and GCB satisfy the representative requirements. Furthermore DPF supports dedicated large clusters very well because it is simple, efficient, and highly scalable. On the other hand, GCB perfectly supports non-dedicated or personal clusters because it is independent to private network or firewall technologies and does not require airy administrative power to deploy it. In this paper, we describe the implementations of DPF and GCB and analyze them with respect to performance, deployability, security, and scalability.","PeriodicalId":433323,"journal":{"name":"CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings.","volume":"146 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127602066","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 48