Artem Mavliutov, Giovanni Isotton, Carlo Janna, Alessandro Celestini, Massimo Bernaschi
{"title":"Multi GPU Sparse Matrix by Sparse Matrix Multiplication","authors":"Artem Mavliutov, Giovanni Isotton, Carlo Janna, Alessandro Celestini, Massimo Bernaschi","doi":"10.1002/cpe.70313","DOIUrl":null,"url":null,"abstract":"<p>The paper focuses on the improvement of the existing <i>nsparse</i> Nagasaka et al. algorithm and its extension to the multi-GPU setting for the application of real engineering problems. In this work, we propose a distributed multi-GPU framework for <i>SpGEMM</i> that is designed specifically for the <i>nsparse</i> like algorithms. The results show ∼2 times speed-up for <i>nsparse</i> and close to ideal scalability of the multi-GPU extension with the number of GPUs. Finally, we test the proposed algorithm in the AMG setting by computing the double <i>SpGEMM</i> product.</p>","PeriodicalId":55214,"journal":{"name":"Concurrency and Computation-Practice & Experience","volume":"37 25-26","pages":""},"PeriodicalIF":1.5000,"publicationDate":"2025-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/cpe.70313","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Concurrency and Computation-Practice & Experience","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cpe.70313","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}
引用次数: 0
Abstract
The paper focuses on the improvement of the existing nsparse Nagasaka et al. algorithm and its extension to the multi-GPU setting for the application of real engineering problems. In this work, we propose a distributed multi-GPU framework for SpGEMM that is designed specifically for the nsparse like algorithms. The results show ∼2 times speed-up for nsparse and close to ideal scalability of the multi-GPU extension with the number of GPUs. Finally, we test the proposed algorithm in the AMG setting by computing the double SpGEMM product.
期刊介绍:
Concurrency and Computation: Practice and Experience (CCPE) publishes high-quality, original research papers, and authoritative research review papers, in the overlapping fields of:
Parallel and distributed computing;
High-performance computing;
Computational and data science;
Artificial intelligence and machine learning;
Big data applications, algorithms, and systems;
Network science;
Ontologies and semantics;
Security and privacy;
Cloud/edge/fog computing;
Green computing; and
Quantum computing.