2009 International Conference on Parallel Processing最新文献_第2页

Constructing Gene Regulatory Networks on Clusters of Cell Processors 在细胞处理器集群上构建基因调控网络

2009 International Conference on Parallel Processing Pub Date : 2009-09-22 DOI: 10.1109/ICPP.2009.35

J. Zola, Abhinav Sarje, S. Aluru

引用次数: 6

Integrated Performance Views in Charm++: Projections Meets TAU 在Charm++中的综合性能视图:投影满足TAU

2009 International Conference on Parallel Processing Pub Date : 2009-09-22 DOI: 10.1109/ICPP.2009.49

Scott Biersdorff, Chee Wai Lee, A. Malony, L. Kalé

引用次数: 8

On the Scalability of Parallel Verilog Simulation 并行Verilog仿真的可扩展性研究

2009 International Conference on Parallel Processing Pub Date : 2009-09-22 DOI: 10.1109/ICPP.2009.9

S. Meraji, Wei Zhang, C. Tropper

引用次数: 14

Cache-Efficient, Intranode, Large-Message MPI Communication with MPICH2-Nemesis 缓存高效、内部网、大消息MPI通信与MPICH2-Nemesis

2009 International Conference on Parallel Processing Pub Date : 2009-09-22 DOI: 10.1109/ICPP.2009.22

Darius Buntinas, Brice Goglin, David Goodell, Guillaume Mercier, Stéphanie Moreaud

{"title":"Cache-Efficient, Intranode, Large-Message MPI Communication with MPICH2-Nemesis","authors":"Darius Buntinas, Brice Goglin, David Goodell, Guillaume Mercier, Stéphanie Moreaud","doi":"10.1109/ICPP.2009.22","DOIUrl":"https://doi.org/10.1109/ICPP.2009.22","url":null,"abstract":"The emergence of multicore processors raises the need to efficiently transfer large amounts of data between local processes. MPICH2 is a highly portable MPI implementation whose large-message communication schemes suffer from high CPU utilization and cache pollution because of the use of a double-buffering strategy, common to many MPI implementations. We introduce two strategies offering a kernel-assisted, single-copy model with support for noncontiguous and asynchronous transfers. The first one uses the now widely available vmsplice Linux system call; the second one further improves performance thanks to a custom kernel module called KNEM. The latter also offers I/OAT copy offload, which is dynamically enabled depending on both hardware cache characteristics and message size. These new solutions outperform the standard transfer method in the MPICH2 implementation when no cache is shared between the processing cores or when very large messages are being transferred. Collective communication operations show a dramatic improvement, and the IS NAS parallel benchmark shows a 25% speedup and better cache efficiency.","PeriodicalId":169408,"journal":{"name":"2009 International Conference on Parallel Processing","volume":"43 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128169567","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 72

A Distributed Three-hop Routing Protocol to Increase the Capacity of Hybrid Networks 提高混合网络容量的分布式三跳路由协议

2009 International Conference on Parallel Processing Pub Date : 2009-09-22 DOI: 10.1109/ICPP.2009.36

Ze Li, Haiying Shen

{"title":"A Distributed Three-hop Routing Protocol to Increase the Capacity of Hybrid Networks","authors":"Ze Li, Haiying Shen","doi":"10.1109/ICPP.2009.36","DOIUrl":"https://doi.org/10.1109/ICPP.2009.36","url":null,"abstract":"Hybrid wireless networks combining the advantages of both ad-hoc networks and infrastructure wireless networks have been receiving increasingly attentions because of their ultra-high performance. An efficient data routing protocol is an important component in such networks for high capacity and scalability. However, most routing protocols for the networks simply combine an ad-hoc transmission mode and a cellular transmission mode, which fail to take advantage of the dual-feature architecture. This paper presents a distributed Three-hop Routing (DTR) protocol for hybrid wireless networks. DTR divides a message data stream into segments and transmits the segments in a distributed manner. It makes full spatial reuse of system via high speed ad-hoc interface and alleviate mobile gateway congestion via cellular interface. Furthermore, sending segments to a number of base stations simultaneously increases the throughput, and makes full use of wide-spread base stations. In addition, DTR significantly reduces overhead due to short path length and eliminates route discovery and maintenance overhead. Theoretical analysis and simulation results show the superiority of DTR in comparison with other routing protocols in terms of throughput capacity, scalability and mobility resilience.","PeriodicalId":169408,"journal":{"name":"2009 International Conference on Parallel Processing","volume":"134 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127435513","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 8

Optimizing Communication Scheduling Using Dataflow Semantics 使用数据流语义优化通信调度

2009 International Conference on Parallel Processing Pub Date : 2009-09-22 DOI: 10.1109/ICPP.2009.66

Adrian Soviani, J. Singh

引用次数: 5

A Resource Optimized Remote-Memory-Access Architecture for Low-latency Communication 面向低延迟通信的资源优化远程内存访问架构

2009 International Conference on Parallel Processing Pub Date : 2009-09-22 DOI: 10.1109/ICPP.2009.62

M. Nüssle, Martin Scherer, U. Brüning

引用次数: 34

Scalable Parallel Execution of an Event-Based Radio Signal Propagation Model for Cluttered 3D Terrains 杂乱三维地形中基于事件的无线电信号传播模型的可扩展并行执行

2009 International Conference on Parallel Processing Pub Date : 2009-09-22 DOI: 10.1109/ICPP.2009.42

S. Seal, K. Perumalla

引用次数: 2

LeWI: A Runtime Balancing Algorithm for Nested Parallelism LeWI:一种嵌套并行的运行时平衡算法

2009 International Conference on Parallel Processing Pub Date : 2009-09-22 DOI: 10.1109/ICPP.2009.56

Marta Garcia, J. Corbalán, J. Labarta

引用次数: 23

Group Operation Assembly Language - A Flexible Way to Express Collective Communication 群操作汇编语言——一种灵活的表达集体交流的方式

2009 International Conference on Parallel Processing Pub Date : 2009-09-22 DOI: 10.1109/ICPP.2009.70

T. Hoefler, Christian Siebert, A. Lumsdaine

引用次数: 32