Yuta Masuda, Yasuhiro Nakahara, M. Amagasaki, M. Iida
{"title":"DNN加速器ReNA的自动执行代码生成","authors":"Yuta Masuda, Yasuhiro Nakahara, M. Amagasaki, M. Iida","doi":"10.1109/CANDARW53999.2021.00025","DOIUrl":null,"url":null,"abstract":"We have been developing ReNA as a DNN accelerator for the edge, which is controlled by directly specifying control signals for each circuit by microcode instructions. The current control method is not efficient because of its low readability and manual generation of execution code. In addition, it requires a large amount of instructions and large SRAM size to store the control signals. In this paper, we try to solve this problem by abstracting the microcode instructions and reducing the amount of instructions. We also improve efficiency of model implementation by enabling automatic generation of the microcode. As a result, we were able to reduce the required SRAM capacity by about 86% and halve the area of the SRAM for storing instructions.","PeriodicalId":325028,"journal":{"name":"2021 Ninth International Symposium on Computing and Networking Workshops (CANDARW)","volume":"44 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Automatic executable code generation for DNN accelerator ReNA\",\"authors\":\"Yuta Masuda, Yasuhiro Nakahara, M. Amagasaki, M. Iida\",\"doi\":\"10.1109/CANDARW53999.2021.00025\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We have been developing ReNA as a DNN accelerator for the edge, which is controlled by directly specifying control signals for each circuit by microcode instructions. The current control method is not efficient because of its low readability and manual generation of execution code. In addition, it requires a large amount of instructions and large SRAM size to store the control signals. In this paper, we try to solve this problem by abstracting the microcode instructions and reducing the amount of instructions. We also improve efficiency of model implementation by enabling automatic generation of the microcode. As a result, we were able to reduce the required SRAM capacity by about 86% and halve the area of the SRAM for storing instructions.\",\"PeriodicalId\":325028,\"journal\":{\"name\":\"2021 Ninth International Symposium on Computing and Networking Workshops (CANDARW)\",\"volume\":\"44 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 Ninth International Symposium on Computing and Networking Workshops (CANDARW)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CANDARW53999.2021.00025\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 Ninth International Symposium on Computing and Networking Workshops (CANDARW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CANDARW53999.2021.00025","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Automatic executable code generation for DNN accelerator ReNA
We have been developing ReNA as a DNN accelerator for the edge, which is controlled by directly specifying control signals for each circuit by microcode instructions. The current control method is not efficient because of its low readability and manual generation of execution code. In addition, it requires a large amount of instructions and large SRAM size to store the control signals. In this paper, we try to solve this problem by abstracting the microcode instructions and reducing the amount of instructions. We also improve efficiency of model implementation by enabling automatic generation of the microcode. As a result, we were able to reduce the required SRAM capacity by about 86% and halve the area of the SRAM for storing instructions.