加速人工神经网络前向传播的数据驱动逻辑合成器

2015 33rd IEEE International Conference on Computer Design (ICCD) Pub Date : 2015-10-18 DOI:10.1109/ICCD.2015.7357142

K. Mahmoud, W. E. Smith, Mark Fishkin, Timothy N. Miller

{"title":"加速人工神经网络前向传播的数据驱动逻辑合成器","authors":"K. Mahmoud, W. E. Smith, Mark Fishkin, Timothy N. Miller","doi":"10.1109/ICCD.2015.7357142","DOIUrl":null,"url":null,"abstract":"We present a tool for automatically generating efficient feed-forward logic for hardware acceleration of artificial neural networks (ANNs). It produces circuitry in the form of synthesizable Verilog code that is optimized based on analyzing training data to minimize the numbers of bits in weights and values, thereby minimizing the number of logic gates in ANN components such as adders and multipliers. For an optimized ANN, different implementation topologies can be generated, including fully pipelined and simple state machines. Additional insights about hardware acceleration for neural networks are also presented. We show the impact of reducing precision relative to floating point and present area, power, delay, throughput, and energy estimates by circuit synthesis.","PeriodicalId":129506,"journal":{"name":"2015 33rd IEEE International Conference on Computer Design (ICCD)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Data-driven logic synthesizer for acceleration of Forward propagation in artificial neural networks\",\"authors\":\"K. Mahmoud, W. E. Smith, Mark Fishkin, Timothy N. Miller\",\"doi\":\"10.1109/ICCD.2015.7357142\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We present a tool for automatically generating efficient feed-forward logic for hardware acceleration of artificial neural networks (ANNs). It produces circuitry in the form of synthesizable Verilog code that is optimized based on analyzing training data to minimize the numbers of bits in weights and values, thereby minimizing the number of logic gates in ANN components such as adders and multipliers. For an optimized ANN, different implementation topologies can be generated, including fully pipelined and simple state machines. Additional insights about hardware acceleration for neural networks are also presented. We show the impact of reducing precision relative to floating point and present area, power, delay, throughput, and energy estimates by circuit synthesis.\",\"PeriodicalId\":129506,\"journal\":{\"name\":\"2015 33rd IEEE International Conference on Computer Design (ICCD)\",\"volume\":\"17 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-10-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 33rd IEEE International Conference on Computer Design (ICCD)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCD.2015.7357142\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 33rd IEEE International Conference on Computer Design (ICCD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCD.2015.7357142","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

提出了一种自动生成高效前馈逻辑的工具，用于人工神经网络的硬件加速。它以可合成Verilog代码的形式产生电路，该电路基于分析训练数据进行优化，以最小化权重和值中的比特数，从而最小化ANN组件(如加法器和乘法器)中的逻辑门的数量。对于优化的人工神经网络，可以生成不同的实现拓扑，包括完全流水线的和简单的状态机。还介绍了关于神经网络硬件加速的其他见解。我们展示了通过电路合成降低相对于浮点数和当前面积、功率、延迟、吞吐量和能量估计的精度的影响。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Data-driven logic synthesizer for acceleration of Forward propagation in artificial neural networks

We present a tool for automatically generating efficient feed-forward logic for hardware acceleration of artificial neural networks (ANNs). It produces circuitry in the form of synthesizable Verilog code that is optimized based on analyzing training data to minimize the numbers of bits in weights and values, thereby minimizing the number of logic gates in ANN components such as adders and multipliers. For an optimized ANN, different implementation topologies can be generated, including fully pipelined and simple state machines. Additional insights about hardware acceleration for neural networks are also presented. We show the impact of reducing precision relative to floating point and present area, power, delay, throughput, and energy estimates by circuit synthesis.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2015 33rd IEEE International Conference on Computer Design (ICCD)

自引率

0.00%

发文量