{"title":"基于RISC-V软核CPU的国产FPGA实现CNN异构方案","authors":"Hailong Wu, Jindong Li, Xiang Chen","doi":"10.1109/ICTA56932.2022.9963056","DOIUrl":null,"url":null,"abstract":"Field Programmable Gate Array (FPGA) has the characteristics of low power consumption, high performance and flexibility. Research on FPGA neural network acceleration is emerging, but most of the researches are based on foreign FPGA devices. In order to improve the current situation of domestic FPGA, a novel Convolutional neural networks (CNNs) accelerator for domestic FPGA equipped with lightweight RISC-V soft core is proposed. The peak performance of the proposed accelerator reaches 153.6 GOP/s, occupying only 14K LUTs (Look-Up-Table), 32 DRMs (Dedicated RAM Modules) and 208 APMs (Arithmetic Process Modules). The proposed accelerator has enough computing power for most of the Edge-AI applications and embedded systems, providing a possible AI inference acceleration solution for domestic FPGA.","PeriodicalId":325602,"journal":{"name":"2022 IEEE International Conference on Integrated Circuits, Technologies and Applications (ICTA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Implementation of CNN Heterogeneous Scheme Based on Domestic FPGA with RISC-V Soft Core CPU\",\"authors\":\"Hailong Wu, Jindong Li, Xiang Chen\",\"doi\":\"10.1109/ICTA56932.2022.9963056\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Field Programmable Gate Array (FPGA) has the characteristics of low power consumption, high performance and flexibility. Research on FPGA neural network acceleration is emerging, but most of the researches are based on foreign FPGA devices. In order to improve the current situation of domestic FPGA, a novel Convolutional neural networks (CNNs) accelerator for domestic FPGA equipped with lightweight RISC-V soft core is proposed. The peak performance of the proposed accelerator reaches 153.6 GOP/s, occupying only 14K LUTs (Look-Up-Table), 32 DRMs (Dedicated RAM Modules) and 208 APMs (Arithmetic Process Modules). The proposed accelerator has enough computing power for most of the Edge-AI applications and embedded systems, providing a possible AI inference acceleration solution for domestic FPGA.\",\"PeriodicalId\":325602,\"journal\":{\"name\":\"2022 IEEE International Conference on Integrated Circuits, Technologies and Applications (ICTA)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-10-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE International Conference on Integrated Circuits, Technologies and Applications (ICTA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICTA56932.2022.9963056\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Integrated Circuits, Technologies and Applications (ICTA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICTA56932.2022.9963056","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Implementation of CNN Heterogeneous Scheme Based on Domestic FPGA with RISC-V Soft Core CPU
Field Programmable Gate Array (FPGA) has the characteristics of low power consumption, high performance and flexibility. Research on FPGA neural network acceleration is emerging, but most of the researches are based on foreign FPGA devices. In order to improve the current situation of domestic FPGA, a novel Convolutional neural networks (CNNs) accelerator for domestic FPGA equipped with lightweight RISC-V soft core is proposed. The peak performance of the proposed accelerator reaches 153.6 GOP/s, occupying only 14K LUTs (Look-Up-Table), 32 DRMs (Dedicated RAM Modules) and 208 APMs (Arithmetic Process Modules). The proposed accelerator has enough computing power for most of the Edge-AI applications and embedded systems, providing a possible AI inference acceleration solution for domestic FPGA.