RISC-V vector processor for acceleration of machine learning algorithms

Nikola Kovačević, Đorđe Mišeljić, Aleksa Stojković
{"title":"RISC-V vector processor for acceleration of machine learning algorithms","authors":"Nikola Kovačević, Đorđe Mišeljić, Aleksa Stojković","doi":"10.1109/TELFOR56187.2022.9983779","DOIUrl":null,"url":null,"abstract":"In this paper we present an RTL implementation of a 32-bit parametrizable vector processor for acceleration of algorithms working in fixed-point arithmetic. The processor uses the latest RISC-V vector extension ISA specification and is deployed and tested on a Zynq Soc using Avnet Zedboard. Our microarchitecture exploits the inherent parallelism in algorithms by splitting execution across multiple vector lanes and enabling chaining of vector instructions. To provide the required number of read/write ports for instruction chaining, the vector register bank uses the double-pumping technique in combination with an XOR-based approach. First, the microarchitecture of the system is explained in detail, and the results of the implementation on the Zedboard are presented for some different processor configurations. We then compared the performance of the implemented design with some different modern processor cores.","PeriodicalId":277553,"journal":{"name":"2022 30th Telecommunications Forum (TELFOR)","volume":"299 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 30th Telecommunications Forum (TELFOR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TELFOR56187.2022.9983779","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

In this paper we present an RTL implementation of a 32-bit parametrizable vector processor for acceleration of algorithms working in fixed-point arithmetic. The processor uses the latest RISC-V vector extension ISA specification and is deployed and tested on a Zynq Soc using Avnet Zedboard. Our microarchitecture exploits the inherent parallelism in algorithms by splitting execution across multiple vector lanes and enabling chaining of vector instructions. To provide the required number of read/write ports for instruction chaining, the vector register bank uses the double-pumping technique in combination with an XOR-based approach. First, the microarchitecture of the system is explained in detail, and the results of the implementation on the Zedboard are presented for some different processor configurations. We then compared the performance of the implemented design with some different modern processor cores.
RISC-V矢量处理器,用于加速机器学习算法
在本文中,我们提出了一个32位可参数化矢量处理器的RTL实现,用于加速定点算法。该处理器采用最新的RISC-V矢量扩展ISA规范,并使用安富利Zedboard在Zynq Soc上进行部署和测试。我们的微架构利用了算法的固有并行性,通过在多个矢量通道上分割执行并启用矢量指令链。为了提供指令链所需的读/写端口数量,矢量寄存器组将双泵送技术与基于xor的方法相结合。首先,详细介绍了系统的微体系结构,并给出了不同处理器配置下在Zedboard上的实现结果。然后,我们将实现的设计与一些不同的现代处理器内核的性能进行了比较。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信