{"title":"Implementation of the conjugate gradient algorithm on a vector hypercube multiprocessor","authors":"C. Aykanat, F. Özgüner, D. Scott","doi":"10.1145/63047.63127","DOIUrl":null,"url":null,"abstract":"In this paper, parallel implementation and vectorization of the Scaled Conjugate Gradient (SCG) algorithm for the solution of large sparse linear system of equations, on a vector hypercube multiprocessor (iPSC- VX), is described. Computations in the SCG algorithm consist mainly of matrix operations that can be vectorized and are implemented on the Vector Processor on each node of the hypercube. The implementation described here achieves efficient parallelization by overlapping vectorized computations with inter-node communication. A speed-up of 58 over a µVax II is obtained for large finite element meshes.","PeriodicalId":299435,"journal":{"name":"Conference on Hypercube Concurrent Computers and Applications","volume":"42 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1989-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Conference on Hypercube Concurrent Computers and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/63047.63127","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
In this paper, parallel implementation and vectorization of the Scaled Conjugate Gradient (SCG) algorithm for the solution of large sparse linear system of equations, on a vector hypercube multiprocessor (iPSC- VX), is described. Computations in the SCG algorithm consist mainly of matrix operations that can be vectorized and are implemented on the Vector Processor on each node of the hypercube. The implementation described here achieves efficient parallelization by overlapping vectorized computations with inter-node communication. A speed-up of 58 over a µVax II is obtained for large finite element meshes.