Spica: Exploring FPGA Optimizations to Enable an Efficient SpMV Implementation for Computations at Edge

2023 IEEE International Conference on Edge Computing and Communications (EDGE) Pub Date : 2023-07-01 DOI:10.1109/EDGE60047.2023.00018

Dheeraj Ramchandani, Bahar Asgari, Hyesoon Kim

{"title":"Spica: Exploring FPGA Optimizations to Enable an Efficient SpMV Implementation for Computations at Edge","authors":"Dheeraj Ramchandani, Bahar Asgari, Hyesoon Kim","doi":"10.1109/EDGE60047.2023.00018","DOIUrl":null,"url":null,"abstract":"With the emergence of FPGA boards equipped with high bandwidth memory (HBM), these boards have become more attractive for implementing memory-intensive computational kernels such as sparse matrix-vector multiplication (SpMV), with a wide range of applications in edge computations from deep learning to robotics. Specialized implementation of SpMV on FPGAs enables efficient utilization of the limited resources in edge systems. High-level synthesis (HLS) compilers, on the other hand, have eased the programming of FPGAs, leading to a faster development cycle. Even though the programming of FPGAs has become easier, obtaining maximum throughput even for the straightforward kernel of SpMV still requires careful optimizations. Therefore, this paper explores the impact of deploying various optimization techniques such as temporal parallelism, spatial parallelism, and memory alignment to help SpMV fully utilize the available memory bandwidth of HBM on a Xilinx FPGA board to achieve close-to-peak throughput without wasting the resources. We conclude the optimizations by suggesting Spica, a high-throughput tree-based SpMV implementation.","PeriodicalId":369407,"journal":{"name":"2023 IEEE International Conference on Edge Computing and Communications (EDGE)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE International Conference on Edge Computing and Communications (EDGE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/EDGE60047.2023.00018","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

With the emergence of FPGA boards equipped with high bandwidth memory (HBM), these boards have become more attractive for implementing memory-intensive computational kernels such as sparse matrix-vector multiplication (SpMV), with a wide range of applications in edge computations from deep learning to robotics. Specialized implementation of SpMV on FPGAs enables efficient utilization of the limited resources in edge systems. High-level synthesis (HLS) compilers, on the other hand, have eased the programming of FPGAs, leading to a faster development cycle. Even though the programming of FPGAs has become easier, obtaining maximum throughput even for the straightforward kernel of SpMV still requires careful optimizations. Therefore, this paper explores the impact of deploying various optimization techniques such as temporal parallelism, spatial parallelism, and memory alignment to help SpMV fully utilize the available memory bandwidth of HBM on a Xilinx FPGA board to achieve close-to-peak throughput without wasting the resources. We conclude the optimizations by suggesting Spica, a high-throughput tree-based SpMV implementation.

查看原文本刊更多论文

Spica:探索FPGA优化以实现有效的边缘计算SpMV实现

随着配备高带宽存储器(HBM)的FPGA板的出现，这些板对于实现内存密集型计算内核(如稀疏矩阵向量乘法(SpMV))变得更具吸引力，在从深度学习到机器人技术的边缘计算中具有广泛的应用。SpMV在fpga上的专门实现使边缘系统中有限资源的有效利用成为可能。另一方面，高级综合(HLS)编译器简化了fpga的编程，从而加快了开发周期。尽管fpga的编程已经变得更加容易，但即使对于SpMV的直接内核，获得最大吞吐量仍然需要仔细的优化。因此，本文探讨了部署各种优化技术的影响，如时间并行、空间并行和内存对齐，以帮助SpMV充分利用Xilinx FPGA板上HBM的可用内存带宽，从而在不浪费资源的情况下实现接近峰值的吞吐量。我们通过推荐Spica来结束优化，Spica是一个高吞吐量的基于树的SpMV实现。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2023 IEEE International Conference on Edge Computing and Communications (EDGE)

自引率

0.00%

发文量