AutoDSE: Enabling Software Programmers to Design Efficient FPGA Accelerators

ACM Transactions on Design Automation of Electronic Systems (TODAES) Pub Date : 2020-09-30 DOI:10.1145/3494534

Atefeh Sohrabizadeh, Cody Hao Yu, Min Gao, J. Cong

{"title":"AutoDSE: Enabling Software Programmers to Design Efficient FPGA Accelerators","authors":"Atefeh Sohrabizadeh, Cody Hao Yu, Min Gao, J. Cong","doi":"10.1145/3494534","DOIUrl":null,"url":null,"abstract":"Adopting FPGA as an accelerator in datacenters is becoming mainstream for customized computing, but the fact that FPGAs are hard to program creates a steep learning curve for software programmers. Even with the help of high-level synthesis (HLS), accelerator designers still have to manually perform code reconstruction and cumbersome parameter tuning to achieve optimal performance. While many learning models have been leveraged by existing work to automate the design of efficient accelerators, the unpredictability of modern HLS tools becomes a major obstacle for them to maintain high accuracy. To address this problem, we propose an automated DSE framework—AutoDSE—that leverages a bottleneck-guided coordinate optimizer to systematically find a better design point. AutoDSE detects the bottleneck of the design in each step and focuses on high-impact parameters to overcome it. The experimental results show that AutoDSE is able to identify the design point that achieves, on the geometric mean, 19.9× speedup over one CPU core for MachSuite and Rodinia benchmarks. Compared to the manually optimized HLS vision kernels in Xilinx Vitis libraries, AutoDSE can reduce their optimization pragmas by 26.38× while achieving similar performance. With less than one optimization pragma per design on average, we are making progress towards democratizing customizable computing by enabling software programmers to design efficient FPGA accelerators.","PeriodicalId":6933,"journal":{"name":"ACM Transactions on Design Automation of Electronic Systems (TODAES)","volume":"177 1","pages":"1 - 27"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"37","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM Transactions on Design Automation of Electronic Systems (TODAES)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3494534","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 37

Abstract

Adopting FPGA as an accelerator in datacenters is becoming mainstream for customized computing, but the fact that FPGAs are hard to program creates a steep learning curve for software programmers. Even with the help of high-level synthesis (HLS), accelerator designers still have to manually perform code reconstruction and cumbersome parameter tuning to achieve optimal performance. While many learning models have been leveraged by existing work to automate the design of efficient accelerators, the unpredictability of modern HLS tools becomes a major obstacle for them to maintain high accuracy. To address this problem, we propose an automated DSE framework—AutoDSE—that leverages a bottleneck-guided coordinate optimizer to systematically find a better design point. AutoDSE detects the bottleneck of the design in each step and focuses on high-impact parameters to overcome it. The experimental results show that AutoDSE is able to identify the design point that achieves, on the geometric mean, 19.9× speedup over one CPU core for MachSuite and Rodinia benchmarks. Compared to the manually optimized HLS vision kernels in Xilinx Vitis libraries, AutoDSE can reduce their optimization pragmas by 26.38× while achieving similar performance. With less than one optimization pragma per design on average, we are making progress towards democratizing customizable computing by enabling software programmers to design efficient FPGA accelerators.

查看原文本刊更多论文

AutoDSE:使软件程序员能够设计高效的FPGA加速器

采用FPGA作为数据中心的加速器正在成为定制计算的主流，但是FPGA很难编程的事实给软件程序员带来了一个陡峭的学习曲线。即使在高级合成(HLS)的帮助下，加速器设计人员仍然必须手动执行代码重构和繁琐的参数调优，以实现最佳性能。虽然许多学习模型已经被现有的工作利用来自动化高效加速器的设计，但现代HLS工具的不可预测性成为它们保持高精度的主要障碍。为了解决这个问题，我们提出了一个自动化的DSE框架- autose -它利用瓶颈引导坐标优化器系统地找到更好的设计点。AutoDSE在每个步骤中检测设计的瓶颈，并专注于高影响参数来克服它。实验结果表明，AutoDSE能够在MachSuite和Rodinia基准测试中识别出在一个CPU内核上实现19.9倍几何平均加速的设计点。与Xilinx Vitis库中手动优化的HLS视觉内核相比，AutoDSE可以在达到相似性能的同时减少26.38倍的优化程序。由于每个设计平均不到一个优化pragma，我们正在通过使软件程序员能够设计高效的FPGA加速器来实现可定制计算的民主化。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

ACM Transactions on Design Automation of Electronic Systems (TODAES)

自引率

0.00%

发文量