{"title":"A Configurable Mixed-Precision Convolution Processing Unit Generator in Chisel","authors":"Jure Vreča, Anton Biasizzo","doi":"10.1109/DDECS57882.2023.10139758","DOIUrl":null,"url":null,"abstract":"We present a configurable implementation of a convolution processing unit suitable for computing mixed-precision quantized neural networks. The design is implemented as a hardware generator written in Chisel, which is a software framework for writing hardware circuit generators. Our generator is designed to use minimal hardware resources and is very flexible in regards to various aspects of the convolution operation, including: image size, kernel size, image bitwidth, kernel bitwidth, activation function, and more. The processing unit is configurable only at generation time, thus we don’t pay the price of using more general hardware, instead we can tailor it to the problem at hand.","PeriodicalId":220690,"journal":{"name":"2023 26th International Symposium on Design and Diagnostics of Electronic Circuits and Systems (DDECS)","volume":"120 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 26th International Symposium on Design and Diagnostics of Electronic Circuits and Systems (DDECS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DDECS57882.2023.10139758","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
We present a configurable implementation of a convolution processing unit suitable for computing mixed-precision quantized neural networks. The design is implemented as a hardware generator written in Chisel, which is a software framework for writing hardware circuit generators. Our generator is designed to use minimal hardware resources and is very flexible in regards to various aspects of the convolution operation, including: image size, kernel size, image bitwidth, kernel bitwidth, activation function, and more. The processing unit is configurable only at generation time, thus we don’t pay the price of using more general hardware, instead we can tailor it to the problem at hand.