{"title":"Folded and Deterministic Stochastic MAC for High Accuracy and Hardware Efficient Convolution Function","authors":"M. Wong, A. Do","doi":"10.1109/SOCC46988.2019.1570543844","DOIUrl":null,"url":null,"abstract":"A family of Stochastic Computing (SC) circuit using folded and deterministic sequence is presented in this work. The SC circuit produces shorter and higher progressive precision sequences, and without the use of PRNG (often implemented as LFSR in hardware). Using the new stochastic representation, a design of SC multiply-and-accumulate (MAC) function is proposed and implemented as hardware convolution accelerator on a Virtex-4 FPGA. Experimental results obtained from detailed test analysis has proven that the new SC-MAC unit has higher computational accuracy and achieved a better ratio of throughput performance over area cost (hardware efficiency) in comparison with the conventional SC approach.","PeriodicalId":253998,"journal":{"name":"2019 32nd IEEE International System-on-Chip Conference (SOCC)","volume":"207 9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 32nd IEEE International System-on-Chip Conference (SOCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SOCC46988.2019.1570543844","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
A family of Stochastic Computing (SC) circuit using folded and deterministic sequence is presented in this work. The SC circuit produces shorter and higher progressive precision sequences, and without the use of PRNG (often implemented as LFSR in hardware). Using the new stochastic representation, a design of SC multiply-and-accumulate (MAC) function is proposed and implemented as hardware convolution accelerator on a Virtex-4 FPGA. Experimental results obtained from detailed test analysis has proven that the new SC-MAC unit has higher computational accuracy and achieved a better ratio of throughput performance over area cost (hardware efficiency) in comparison with the conventional SC approach.