DNN加速器ReNA的自动执行代码生成

2021 Ninth International Symposium on Computing and Networking Workshops (CANDARW) Pub Date : 2021-11-01 DOI:10.1109/CANDARW53999.2021.00025

Yuta Masuda, Yasuhiro Nakahara, M. Amagasaki, M. Iida

{"title":"DNN加速器ReNA的自动执行代码生成","authors":"Yuta Masuda, Yasuhiro Nakahara, M. Amagasaki, M. Iida","doi":"10.1109/CANDARW53999.2021.00025","DOIUrl":null,"url":null,"abstract":"We have been developing ReNA as a DNN accelerator for the edge, which is controlled by directly specifying control signals for each circuit by microcode instructions. The current control method is not efficient because of its low readability and manual generation of execution code. In addition, it requires a large amount of instructions and large SRAM size to store the control signals. In this paper, we try to solve this problem by abstracting the microcode instructions and reducing the amount of instructions. We also improve efficiency of model implementation by enabling automatic generation of the microcode. As a result, we were able to reduce the required SRAM capacity by about 86% and halve the area of the SRAM for storing instructions.","PeriodicalId":325028,"journal":{"name":"2021 Ninth International Symposium on Computing and Networking Workshops (CANDARW)","volume":"44 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Automatic executable code generation for DNN accelerator ReNA\",\"authors\":\"Yuta Masuda, Yasuhiro Nakahara, M. Amagasaki, M. Iida\",\"doi\":\"10.1109/CANDARW53999.2021.00025\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We have been developing ReNA as a DNN accelerator for the edge, which is controlled by directly specifying control signals for each circuit by microcode instructions. The current control method is not efficient because of its low readability and manual generation of execution code. In addition, it requires a large amount of instructions and large SRAM size to store the control signals. In this paper, we try to solve this problem by abstracting the microcode instructions and reducing the amount of instructions. We also improve efficiency of model implementation by enabling automatic generation of the microcode. As a result, we were able to reduce the required SRAM capacity by about 86% and halve the area of the SRAM for storing instructions.\",\"PeriodicalId\":325028,\"journal\":{\"name\":\"2021 Ninth International Symposium on Computing and Networking Workshops (CANDARW)\",\"volume\":\"44 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 Ninth International Symposium on Computing and Networking Workshops (CANDARW)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CANDARW53999.2021.00025\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 Ninth International Symposium on Computing and Networking Workshops (CANDARW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CANDARW53999.2021.00025","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

我们一直在开发ReNA作为边缘的DNN加速器，它通过微码指令直接指定每个电路的控制信号来控制。目前的控制方法可读性低，执行代码需要手工生成，效率不高。此外，它需要大量的指令和大的SRAM容量来存储控制信号。本文试图通过抽象微码指令和减少指令的数量来解决这一问题。我们还通过启用微码的自动生成来提高模型实现的效率。因此，我们能够将所需的SRAM容量减少约86%，并将用于存储指令的SRAM面积减半。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Automatic executable code generation for DNN accelerator ReNA

We have been developing ReNA as a DNN accelerator for the edge, which is controlled by directly specifying control signals for each circuit by microcode instructions. The current control method is not efficient because of its low readability and manual generation of execution code. In addition, it requires a large amount of instructions and large SRAM size to store the control signals. In this paper, we try to solve this problem by abstracting the microcode instructions and reducing the amount of instructions. We also improve efficiency of model implementation by enabling automatic generation of the microcode. As a result, we were able to reduce the required SRAM capacity by about 86% and halve the area of the SRAM for storing instructions.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2021 Ninth International Symposium on Computing and Networking Workshops (CANDARW)

自引率

0.00%

发文量