Rogeli Grima Torres, Pablo Vizcaíno, Filippo Mantovani, José Julio Gutiérrez Moreno
{"title":"在 RISC-V 向量架构上共同设计 ab initio 电子结构方法。","authors":"Rogeli Grima Torres, Pablo Vizcaíno, Filippo Mantovani, José Julio Gutiérrez Moreno","doi":"10.12688/openreseurope.18321.1","DOIUrl":null,"url":null,"abstract":"<p><p><i>Ab initio</i> electronic structure applications are among the most widely used in High-Performance Computing (HPC), and the eigenvalue problem is often their main computational bottleneck. This article presents our initial efforts in porting these codes to a RISC-V prototype platform leveraging a wide Vector Processing Unit (VPU). Our software tester is based on a mini-app extracted from the ELPA eigensolver library. The user-space emulator Vehave and a RISC-V vector architecture implemented on an FPGA were tested. Metrics from both systems and different vectorisation strategies were extracted, ranging from the simplest and most portable one (using autovectorisation and assisting this by fusing loops in the code) to the more complex one (using intrinsics). We observed a progressive reduction in the number of vectorised instructions, executed instructions and computing cycles with the different methodologies, which will lead to a substantial speed-up in the calculations. The obtained outcomes are crucial in advancing the porting of computational materials and molecular science codes to (post)-exascale architectures using RISC-V-based technologies fully developed within the EU. Our evaluation also provides valuable feedback for hardware designers, engineers and compiler developers, making this use case pivotal for co-design efforts.</p>","PeriodicalId":74359,"journal":{"name":"Open research Europe","volume":"4 ","pages":"165"},"PeriodicalIF":0.0000,"publicationDate":"2024-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11358678/pdf/","citationCount":"0","resultStr":"{\"title\":\"Co-designing ab initio electronic structure methods on a RISC-V vector architecture.\",\"authors\":\"Rogeli Grima Torres, Pablo Vizcaíno, Filippo Mantovani, José Julio Gutiérrez Moreno\",\"doi\":\"10.12688/openreseurope.18321.1\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p><i>Ab initio</i> electronic structure applications are among the most widely used in High-Performance Computing (HPC), and the eigenvalue problem is often their main computational bottleneck. This article presents our initial efforts in porting these codes to a RISC-V prototype platform leveraging a wide Vector Processing Unit (VPU). Our software tester is based on a mini-app extracted from the ELPA eigensolver library. The user-space emulator Vehave and a RISC-V vector architecture implemented on an FPGA were tested. Metrics from both systems and different vectorisation strategies were extracted, ranging from the simplest and most portable one (using autovectorisation and assisting this by fusing loops in the code) to the more complex one (using intrinsics). We observed a progressive reduction in the number of vectorised instructions, executed instructions and computing cycles with the different methodologies, which will lead to a substantial speed-up in the calculations. The obtained outcomes are crucial in advancing the porting of computational materials and molecular science codes to (post)-exascale architectures using RISC-V-based technologies fully developed within the EU. Our evaluation also provides valuable feedback for hardware designers, engineers and compiler developers, making this use case pivotal for co-design efforts.</p>\",\"PeriodicalId\":74359,\"journal\":{\"name\":\"Open research Europe\",\"volume\":\"4 \",\"pages\":\"165\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-11-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11358678/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Open research Europe\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.12688/openreseurope.18321.1\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Open research Europe","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.12688/openreseurope.18321.1","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
Ab initio 电子结构应用是高性能计算(HPC)中使用最广泛的应用之一,而特征值问题往往是其主要的计算瓶颈。本文介绍了我们利用宽矢量处理单元(VPU)将这些代码移植到 RISC-V 原型平台的初步工作。我们的软件测试器基于从 ELPA eigensolver 库中提取的迷你应用程序。用户空间 Vehave 和在 FPGA 上实现的 RISC-V 矢量架构都经过了测试。我们从这两个系统和不同的矢量化策略中提取了指标,从最简单、最便携的策略(使用自动矢量化,并通过融合代码中的循环来辅助矢量化)到更复杂的策略(使用内在函数)。我们发现,使用不同的方法,矢量指令、执行指令和计算周期的数量都在逐步减少,这将大大加快计算速度。所取得的成果对于将计算材料和分子科学代码移植到(后)超大规模架构至关重要,这些架构使用的是欧盟内部完全开发的基于 RISC-V 的技术。我们的评估还为硬件设计人员、工程师和编译器开发人员提供了有价值的反馈,使这一用例成为协同设计工作的关键。
Co-designing ab initio electronic structure methods on a RISC-V vector architecture.
Ab initio electronic structure applications are among the most widely used in High-Performance Computing (HPC), and the eigenvalue problem is often their main computational bottleneck. This article presents our initial efforts in porting these codes to a RISC-V prototype platform leveraging a wide Vector Processing Unit (VPU). Our software tester is based on a mini-app extracted from the ELPA eigensolver library. The user-space emulator Vehave and a RISC-V vector architecture implemented on an FPGA were tested. Metrics from both systems and different vectorisation strategies were extracted, ranging from the simplest and most portable one (using autovectorisation and assisting this by fusing loops in the code) to the more complex one (using intrinsics). We observed a progressive reduction in the number of vectorised instructions, executed instructions and computing cycles with the different methodologies, which will lead to a substantial speed-up in the calculations. The obtained outcomes are crucial in advancing the porting of computational materials and molecular science codes to (post)-exascale architectures using RISC-V-based technologies fully developed within the EU. Our evaluation also provides valuable feedback for hardware designers, engineers and compiler developers, making this use case pivotal for co-design efforts.