Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis最新文献_第5页

Multi-core acceleration of chemical kinetics for simulation and prediction 用于模拟和预测的化学动力学多核加速

Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis Pub Date : 2009-11-14 DOI: 10.1145/1654059.1654067

J. C. Linford, J. Michalakes, Manish Vachharajani, Adrian Sandu

引用次数: 58

Improving GridFTP performance using the Phoebus session layer 使用Phoebus会话层改进GridFTP性能

Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis Pub Date : 2009-11-14 DOI: 10.1145/1654059.1654094

E. Kissel, M. Swany, Aaron Brown

{"title":"Improving GridFTP performance using the Phoebus session layer","authors":"E. Kissel, M. Swany, Aaron Brown","doi":"10.1145/1654059.1654094","DOIUrl":"https://doi.org/10.1145/1654059.1654094","url":null,"abstract":"Phoebus is an infrastructure for improving end-to-end throughput in high-bandwidth, long-distance networks by using a \"session layer\" protocol and \"gateways\" in the network. Phoebus has the ability to dynamically allocate network resources and to use segmentspecific transport protocols between gateways, as well as to apply other performance-improving techniques on behalf of the user. One of the key data movement applications in high-performance and Grid computing is GridFTP from the Globus project. GridFTP features a modular library interface called XIO that allows it to use alternative transport mechanisms. To facilitate use of the Phoebus system, we have implemented a Globus XIO driver for Phoebus. This paper presents tests of the Phoebus-enabled GridFTP over a network testbed that allows us to modify latency and loss rates. We discuss use of various transport connections, both end-to-end and hop-by-hop, and evaluate the performance of a variety of cases. We demonstrate that Phoebus can easily improve performance in a diverse set of scenarios and of cases, in many instance it outperforms the state of the art.","PeriodicalId":371415,"journal":{"name":"Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130691786","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 23

Terascale data organization for discovering multivariate climatic trends 用于发现多变量气候趋势的兆级数据组织

Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis Pub Date : 2009-11-14 DOI: 10.1145/1654059.1654075

W. Kendall, M. Glatter, Jian Huang, T. Peterka, R. Latham, R. Ross

引用次数: 20

Dynamic task scheduling for linear algebra algorithms on distributed-memory multicore systems 分布式多核系统上线性代数算法的动态任务调度

Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis Pub Date : 2009-11-14 DOI: 10.1145/1654059.1654079

Fengguang Song, A. YarKhan, J. Dongarra

引用次数: 123

Scalable work stealing 可扩展的工作窃取

Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis Pub Date : 2009-11-14 DOI: 10.1145/1654059.1654113

James Dinan, D. B. Larkins, P. Sadayappan, S. Krishnamoorthy, J. Nieplocha

引用次数: 287

Leveraging 3D PCRAM technologies to reduce checkpoint overhead for future exascale systems 利用3D PCRAM技术减少未来百亿亿级系统的检查点开销

Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis Pub Date : 2009-11-14 DOI: 10.1145/1654059.1654117

Xiangyu Dong, Naveen Muralimanohar, N. Jouppi, R. Kaufmann, Yuan Xie

引用次数: 155

Auto-tuning 3-D FFT library for CUDA GPUs CUDA gpu的自动调优3-D FFT库

Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis Pub Date : 2009-11-14 DOI: 10.1145/1654059.1654090

Akira Nukada, S. Matsuoka

引用次数: 140

Router designs for elastic buffer on-chip networks 弹性缓冲片上网络的路由器设计

Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis Pub Date : 2009-11-14 DOI: 10.1145/1654059.1654062

George Michelogiannakis, W. Dally

引用次数: 24

Minimizing communication in sparse matrix solvers 在稀疏矩阵解算器中最小化通信

Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis Pub Date : 2009-11-14 DOI: 10.1145/1654059.1654096

M. Mohiyuddin, M. Hoemmen, J. Demmel, K. Yelick

引用次数: 155

Liquid water: obtaining the right answer for the right reasons 液态水:以正确的理由得到正确的答案

Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis Pub Date : 2009-11-14 DOI: 10.1145/1654059.1654127

E. Aprá, Alistair P. Rendell, R. Harrison, V. Tipparaju, W. A. Jong, S. Xantheas

{"title":"Liquid water: obtaining the right answer for the right reasons","authors":"E. Aprá, Alistair P. Rendell, R. Harrison, V. Tipparaju, W. A. Jong, S. Xantheas","doi":"10.1145/1654059.1654127","DOIUrl":"https://doi.org/10.1145/1654059.1654127","url":null,"abstract":"Water is ubiquitous on our planet and plays an essential role in several key chemical and biological processes. Accurate models for water are crucial in understanding, controlling and predicting the physical and chemical properties of complex aqueous systems. Over the last few years we have been developing a molecular-level based approach for a macroscopic model for water that is based on the explicit description of the underlying intermolecular interactions between molecules in water clusters. In the absence of detailed experimental data for small water clusters, highly-accurate theoretical results are required to validate and parameterize model potentials. As an example of the benchmarks needed for the development of accurate models for the interaction between water molecules, for the most stable structure of (H2O)20 we ran a coupled-cluster calculation on the ORNL's Jaguar petaflop computer that used over 100 TB of memory for a sustained performance of 487 TFLOP/s (double precision) on 96,000 processors, lasting for 2 hours. By this summer we will have studied multiple structures of both (H2O)20 and (H2O)24 and completed basis set and other convergence studies and anticipate the sustained performance rising close to 1 PFLOP/s.","PeriodicalId":371415,"journal":{"name":"Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126030792","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 53