NIKE3D与PVSOLVE在矢量和并行计算机上的性能

Computing Systems in Engineering Pub Date : 1994-08-01 DOI:10.1016/0956-0521(94)90018-3

Bradley N. Maker , Qin Jiangning , Duc T. Nguyen

{"title":"NIKE3D与PVSOLVE在矢量和并行计算机上的性能","authors":"Bradley N. Maker , Qin Jiangning , Duc T. Nguyen","doi":"10.1016/0956-0521(94)90018-3","DOIUrl":null,"url":null,"abstract":"<div><p>The cost of large implicit finite element calculations is dominated by the linear equation solver. In this work, several versions of the direct linear equation solver PVSOLVE were implemented in the general purpose, nonlinear finite element code NIKE3D. Timing studies were performed on Sun and IBM workstations, CRAY Y/MP and C90 supercomputers, and the distributed memory MEIKO CS-2 parallel computer. Cost breakdowns and efficiency curves are presented for a benchmark problem with several mesh densities. The distributed memory implementation is shown to be most efficient for large problems, where MEIKO performance is comparable to that of a single CRAY processor.</p></div>","PeriodicalId":100325,"journal":{"name":"Computing Systems in Engineering","volume":"5 4","pages":"Pages 363-368"},"PeriodicalIF":0.0000,"publicationDate":"1994-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/0956-0521(94)90018-3","citationCount":"8","resultStr":"{\"title\":\"Performance of NIKE3D with PVSOLVE on vector and parallel computers\",\"authors\":\"Bradley N. Maker , Qin Jiangning , Duc T. Nguyen\",\"doi\":\"10.1016/0956-0521(94)90018-3\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>The cost of large implicit finite element calculations is dominated by the linear equation solver. In this work, several versions of the direct linear equation solver PVSOLVE were implemented in the general purpose, nonlinear finite element code NIKE3D. Timing studies were performed on Sun and IBM workstations, CRAY Y/MP and C90 supercomputers, and the distributed memory MEIKO CS-2 parallel computer. Cost breakdowns and efficiency curves are presented for a benchmark problem with several mesh densities. The distributed memory implementation is shown to be most efficient for large problems, where MEIKO performance is comparable to that of a single CRAY processor.</p></div>\",\"PeriodicalId\":100325,\"journal\":{\"name\":\"Computing Systems in Engineering\",\"volume\":\"5 4\",\"pages\":\"Pages 363-368\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1994-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1016/0956-0521(94)90018-3\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computing Systems in Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/0956052194900183\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computing Systems in Engineering","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/0956052194900183","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 8

摘要

大型隐式有限元计算的成本主要由线性方程求解器控制。在这项工作中，在通用的非线性有限元代码NIKE3D中实现了几个版本的直接线性方程求解器PVSOLVE。在Sun和IBM工作站、CRAY Y/MP和C90超级计算机以及分布式内存MEIKO CS-2并行计算机上进行时序研究。给出了具有多种网格密度的基准问题的成本分解和效率曲线。分布式内存实现对于大型问题是最有效的，其中MEIKO的性能与单个CRAY处理器相当。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Performance of NIKE3D with PVSOLVE on vector and parallel computers

The cost of large implicit finite element calculations is dominated by the linear equation solver. In this work, several versions of the direct linear equation solver PVSOLVE were implemented in the general purpose, nonlinear finite element code NIKE3D. Timing studies were performed on Sun and IBM workstations, CRAY Y/MP and C90 supercomputers, and the distributed memory MEIKO CS-2 parallel computer. Cost breakdowns and efficiency curves are presented for a benchmark problem with several mesh densities. The distributed memory implementation is shown to be most efficient for large problems, where MEIKO performance is comparable to that of a single CRAY processor.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Computing Systems in Engineering

自引率

0.00%

发文量