ACM/IEEE SC 2002 Conference (SC'02)最新文献_第2页

Asserting Performance Expectations 确立绩效预期

ACM/IEEE SC 2002 Conference (SC'02) Pub Date : 2002-11-16 DOI: 10.1109/SC.2002.10046

J. Vetter, P. Worley

引用次数: 58

Active Proxy-G: Optimizing the Query Execution Process in the Grid 主动代理- g:优化网格中的查询执行过程

ACM/IEEE SC 2002 Conference (SC'02) Pub Date : 2002-11-16 DOI: 10.1109/SC.2002.10031

H. Andrade, T. Kurç, A. Sussman, J. Saltz

引用次数: 35

Scalable Directory Services Using Proactivity 使用主动性的可伸缩目录服务

ACM/IEEE SC 2002 Conference (SC'02) Pub Date : 2002-11-16 DOI: 10.5555/762761.762786

F. Bustamante, Patrick M. Widener, K. Schwan

引用次数: 32

Performance Optimizations and Bounds for Sparse Matrix-Vector Multiply 稀疏矩阵向量乘法的性能优化与边界

ACM/IEEE SC 2002 Conference (SC'02) Pub Date : 2002-11-16 DOI: 10.1109/SC.2002.10025

R. Vuduc, J. Demmel, K. Yelick, S. Kamil, R. Nishtala, Benjamin C. Lee

{"title":"Performance Optimizations and Bounds for Sparse Matrix-Vector Multiply","authors":"R. Vuduc, J. Demmel, K. Yelick, S. Kamil, R. Nishtala, Benjamin C. Lee","doi":"10.1109/SC.2002.10025","DOIUrl":"https://doi.org/10.1109/SC.2002.10025","url":null,"abstract":"We consider performance tuning, by code and data structure reorganization, of sparse matrix-vector multiply (SpM×V), one of the most important computational kernels in scientific applications. This paper addresses the fundamental questions of what limits exist on such performance tuning, and how closely tuned code approaches these limits. Specifically, we develop upper and lower bounds on the performance (Mflop/s) of SpM×V when tuned using our previously proposed register blocking optimization. These bounds are based on the non-zero pattern in the matrix and the cost of basic memory operations, such as cache hits and misses. We evaluate our tuned implementations with respect to these bounds using hardware counter data on 4 different platforms and on test set of 44 sparse matrices. We find that we can often get within 20% of the upper bound, particularly on class of matrices from finite element modeling (FEM) problems; on non-FEM matrices, performance improvements of 2× are still possible. Lastly, we present new heuristic that selects optimal or near-optimal register block sizes (the key tuning parameters) more accurately than our previous heuristic. Using the new heuristic, we show improvements in SpM×V performance (Mflop/s) by as much as 2.5× over an untuned implementation. Collectively, our results suggest that future performance improvements, beyond those that we have already demonstrated for SpM×V, will come from two sources: (1) consideration of higher-level matrix structures (e.g. exploiting symmetry, matrix reordering, multiple register block sizes), and (2) optimizing kernels with more opportunity for data reuse (e.g. sparse matrix-multiple vector multiply, multiplication of AT A by a vector).","PeriodicalId":302800,"journal":{"name":"ACM/IEEE SC 2002 Conference (SC'02)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117330874","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 152

Compact Application Signatures for Parallel and Distributed Scientific Codes 并行和分布式科学代码的紧凑应用签名

ACM/IEEE SC 2002 Conference (SC'02) Pub Date : 2002-11-16 DOI: 10.1109/SC.2002.10059

Charng-Da Lu, D. Reed

引用次数: 27

Executing Multiple Pipelined Data Analysis Operations in the Grid 在网格中执行多个流水线数据分析操作

ACM/IEEE SC 2002 Conference (SC'02) Pub Date : 2002-11-16 DOI: 10.1109/SC.2002.10015

M. Spencer, R. Ferreira, M. Beynon, T. Kurç, Ümit V. Çatalyürek, A. Sussman, J. Saltz

引用次数: 57

Interoperable Web Services for Computational Portals 用于计算门户的可互操作Web服务

ACM/IEEE SC 2002 Conference (SC'02) Pub Date : 2002-11-16 DOI: 10.1109/SC.2002.10030

M. Pierce, G. Fox, Choon-Han Youn, S. Mock, K. Mueller, Ozgur Balsoy

引用次数: 52

SIGMA: A Simulator Infrastructure to Guide Memory Analysis SIGMA:引导内存分析的模拟器基础结构

ACM/IEEE SC 2002 Conference (SC'02) Pub Date : 2002-11-16 DOI: 10.1109/SC.2002.10055

L. D. Rose, K. Ekanadham, J. Hollingsworth, S. Sbaraglia

引用次数: 82

A New Scheduling Algorithm for Parallel Sparse LU Factorization with Static Pivoting 一种新的静态旋转并行稀疏LU分解调度算法

ACM/IEEE SC 2002 Conference (SC'02) Pub Date : 2002-11-16 DOI: 10.1109/SC.2002.10032

L. Grigori, X. Li

引用次数: 13

Data Reservoir: Utilization of Multi-Gigabit Backbone Network for Data-Intensive Research 数据库:利用多千兆骨干网进行数据密集型研究

ACM/IEEE SC 2002 Conference (SC'02) Pub Date : 2002-11-16 DOI: 10.5555/762761.762826

K. Hiraki, M. Inaba, J. Tamatsukuri, Ryutaro Kurusu, Yukichi Ikuta, Hisashi Koga, A. Zinzaki

引用次数: 16