Yuta Masuda, Yasuhiro Nakahara, M. Amagasaki, M. Iida
{"title":"Automatic executable code generation for DNN accelerator ReNA","authors":"Yuta Masuda, Yasuhiro Nakahara, M. Amagasaki, M. Iida","doi":"10.1109/CANDARW53999.2021.00025","DOIUrl":null,"url":null,"abstract":"We have been developing ReNA as a DNN accelerator for the edge, which is controlled by directly specifying control signals for each circuit by microcode instructions. The current control method is not efficient because of its low readability and manual generation of execution code. In addition, it requires a large amount of instructions and large SRAM size to store the control signals. In this paper, we try to solve this problem by abstracting the microcode instructions and reducing the amount of instructions. We also improve efficiency of model implementation by enabling automatic generation of the microcode. As a result, we were able to reduce the required SRAM capacity by about 86% and halve the area of the SRAM for storing instructions.","PeriodicalId":325028,"journal":{"name":"2021 Ninth International Symposium on Computing and Networking Workshops (CANDARW)","volume":"44 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 Ninth International Symposium on Computing and Networking Workshops (CANDARW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CANDARW53999.2021.00025","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
We have been developing ReNA as a DNN accelerator for the edge, which is controlled by directly specifying control signals for each circuit by microcode instructions. The current control method is not efficient because of its low readability and manual generation of execution code. In addition, it requires a large amount of instructions and large SRAM size to store the control signals. In this paper, we try to solve this problem by abstracting the microcode instructions and reducing the amount of instructions. We also improve efficiency of model implementation by enabling automatic generation of the microcode. As a result, we were able to reduce the required SRAM capacity by about 86% and halve the area of the SRAM for storing instructions.