NIKE3D与PVSOLVE在矢量和并行计算机上的性能

Bradley N. Maker , Qin Jiangning , Duc T. Nguyen
{"title":"NIKE3D与PVSOLVE在矢量和并行计算机上的性能","authors":"Bradley N. Maker ,&nbsp;Qin Jiangning ,&nbsp;Duc T. Nguyen","doi":"10.1016/0956-0521(94)90018-3","DOIUrl":null,"url":null,"abstract":"<div><p>The cost of large implicit finite element calculations is dominated by the linear equation solver. In this work, several versions of the direct linear equation solver PVSOLVE were implemented in the general purpose, nonlinear finite element code NIKE3D. Timing studies were performed on Sun and IBM workstations, CRAY Y/MP and C90 supercomputers, and the distributed memory MEIKO CS-2 parallel computer. Cost breakdowns and efficiency curves are presented for a benchmark problem with several mesh densities. The distributed memory implementation is shown to be most efficient for large problems, where MEIKO performance is comparable to that of a single CRAY processor.</p></div>","PeriodicalId":100325,"journal":{"name":"Computing Systems in Engineering","volume":"5 4","pages":"Pages 363-368"},"PeriodicalIF":0.0000,"publicationDate":"1994-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/0956-0521(94)90018-3","citationCount":"8","resultStr":"{\"title\":\"Performance of NIKE3D with PVSOLVE on vector and parallel computers\",\"authors\":\"Bradley N. Maker ,&nbsp;Qin Jiangning ,&nbsp;Duc T. Nguyen\",\"doi\":\"10.1016/0956-0521(94)90018-3\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>The cost of large implicit finite element calculations is dominated by the linear equation solver. In this work, several versions of the direct linear equation solver PVSOLVE were implemented in the general purpose, nonlinear finite element code NIKE3D. Timing studies were performed on Sun and IBM workstations, CRAY Y/MP and C90 supercomputers, and the distributed memory MEIKO CS-2 parallel computer. Cost breakdowns and efficiency curves are presented for a benchmark problem with several mesh densities. The distributed memory implementation is shown to be most efficient for large problems, where MEIKO performance is comparable to that of a single CRAY processor.</p></div>\",\"PeriodicalId\":100325,\"journal\":{\"name\":\"Computing Systems in Engineering\",\"volume\":\"5 4\",\"pages\":\"Pages 363-368\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1994-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1016/0956-0521(94)90018-3\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computing Systems in Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/0956052194900183\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computing Systems in Engineering","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/0956052194900183","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8

摘要

大型隐式有限元计算的成本主要由线性方程求解器控制。在这项工作中,在通用的非线性有限元代码NIKE3D中实现了几个版本的直接线性方程求解器PVSOLVE。在Sun和IBM工作站、CRAY Y/MP和C90超级计算机以及分布式内存MEIKO CS-2并行计算机上进行时序研究。给出了具有多种网格密度的基准问题的成本分解和效率曲线。分布式内存实现对于大型问题是最有效的,其中MEIKO的性能与单个CRAY处理器相当。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Performance of NIKE3D with PVSOLVE on vector and parallel computers

The cost of large implicit finite element calculations is dominated by the linear equation solver. In this work, several versions of the direct linear equation solver PVSOLVE were implemented in the general purpose, nonlinear finite element code NIKE3D. Timing studies were performed on Sun and IBM workstations, CRAY Y/MP and C90 supercomputers, and the distributed memory MEIKO CS-2 parallel computer. Cost breakdowns and efficiency curves are presented for a benchmark problem with several mesh densities. The distributed memory implementation is shown to be most efficient for large problems, where MEIKO performance is comparable to that of a single CRAY processor.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信