Xiang Yu, Zhijie Yang, LingHui Peng, Bo Lin, Wenjing Yang, Lei Wang
{"title":"CNN Specific ISA Extensions Based on RISC-V Processors","authors":"Xiang Yu, Zhijie Yang, LingHui Peng, Bo Lin, Wenjing Yang, Lei Wang","doi":"10.1109/iccss55260.2022.9802445","DOIUrl":null,"url":null,"abstract":"The CNNs have achieved excellent performance in pattern recognition and target detection, which have a wide range of applications in industrial control, medical imaging, autonomous driving, and other fields. However, it is very inefficient to execute data-intensive CNN applications on edge devices with limited computing and power resources. It is necessary to add a domain-specific acceleration module on the edge devices to improve the performance when performing intensive calculations. In this work, we present ISA extensions based on the RISC-V ISA, including data operation instruction and data transfer instruction, aimed at boosting the computational efficiency of CNNs on edge devices. The microarchitecture supporting our proposed extensions is built on top of an open-source RISC-V core. In addition, extended instructions have been added to the GCC Binutils toolchain. To evaluate the effect of our extended instructions, we performed a set of workloads on the baseline and extended core, our proposed ISA extensions have a speed-up ratio of 1.5$\\times$ when executing a CNN, and reaches 2.48$\\times-2.82\\times$ when only performing convolution calculations. The results show that our proposed ISA extensions can effectively improve the performance of CNNs.","PeriodicalId":254992,"journal":{"name":"2022 5th International Conference on Circuits, Systems and Simulation (ICCSS)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 5th International Conference on Circuits, Systems and Simulation (ICCSS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/iccss55260.2022.9802445","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
The CNNs have achieved excellent performance in pattern recognition and target detection, which have a wide range of applications in industrial control, medical imaging, autonomous driving, and other fields. However, it is very inefficient to execute data-intensive CNN applications on edge devices with limited computing and power resources. It is necessary to add a domain-specific acceleration module on the edge devices to improve the performance when performing intensive calculations. In this work, we present ISA extensions based on the RISC-V ISA, including data operation instruction and data transfer instruction, aimed at boosting the computational efficiency of CNNs on edge devices. The microarchitecture supporting our proposed extensions is built on top of an open-source RISC-V core. In addition, extended instructions have been added to the GCC Binutils toolchain. To evaluate the effect of our extended instructions, we performed a set of workloads on the baseline and extended core, our proposed ISA extensions have a speed-up ratio of 1.5$\times$ when executing a CNN, and reaches 2.48$\times-2.82\times$ when only performing convolution calculations. The results show that our proposed ISA extensions can effectively improve the performance of CNNs.