{"title":"基于fpga的基于高级综合方法的核密度估计带宽选择","authors":"A. Gramacki, Marek Sawerwain, J. Gramacki","doi":"10.1515/BPASTS-2016-0091","DOIUrl":null,"url":null,"abstract":"FPGA technology can offer significantly hi\\-gher performance at much lower power consumption than is available from CPUs and GPUs in many computational problems. Unfortunately, programming for FPGA (using ha\\-rdware description languages, HDL) is a difficult and not-trivial task and is not intuitive for C/C++/Java programmers. To bring the gap between programming effectiveness and difficulty the High Level Synthesis (HLS) approach is promoting by main FPGA vendors. Nowadays, time-intensive calculations are mainly performed on GPU/CPU architectures, but can also be successfully performed using HLS approach. In the paper we implement a bandwidth selection algorithm for kernel density estimation (KDE) using HLS and show techniques which were used to optimize the final FPGA implementation. We are also going to show that FPGA speedups, comparing to highly optimized CPU and GPU implementations, are quite substantial. Moreover, power consumption for FPGA devices is usually much less than typical power consumption of the present CPUs and GPUs.","PeriodicalId":298801,"journal":{"name":"arXiv: Other Computer Science","volume":"43 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":"{\"title\":\"FPGA-Based Bandwidth Selection for Kernel Density Estimation Using High Level Synthesis Approach\",\"authors\":\"A. Gramacki, Marek Sawerwain, J. Gramacki\",\"doi\":\"10.1515/BPASTS-2016-0091\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"FPGA technology can offer significantly hi\\\\-gher performance at much lower power consumption than is available from CPUs and GPUs in many computational problems. Unfortunately, programming for FPGA (using ha\\\\-rdware description languages, HDL) is a difficult and not-trivial task and is not intuitive for C/C++/Java programmers. To bring the gap between programming effectiveness and difficulty the High Level Synthesis (HLS) approach is promoting by main FPGA vendors. Nowadays, time-intensive calculations are mainly performed on GPU/CPU architectures, but can also be successfully performed using HLS approach. In the paper we implement a bandwidth selection algorithm for kernel density estimation (KDE) using HLS and show techniques which were used to optimize the final FPGA implementation. We are also going to show that FPGA speedups, comparing to highly optimized CPU and GPU implementations, are quite substantial. Moreover, power consumption for FPGA devices is usually much less than typical power consumption of the present CPUs and GPUs.\",\"PeriodicalId\":298801,\"journal\":{\"name\":\"arXiv: Other Computer Science\",\"volume\":\"43 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-05-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv: Other Computer Science\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1515/BPASTS-2016-0091\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv: Other Computer Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1515/BPASTS-2016-0091","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8
摘要
在许多计算问题中,FPGA技术可以以比cpu和gpu低得多的功耗提供显著更高的性能。不幸的是,为FPGA编程(使用ha\硬件描述语言,HDL)是一项困难且不平凡的任务,对于C/ c++ /Java程序员来说并不直观。为了消除编程效率和编程难度之间的差距,各大FPGA厂商都在推广高级综合(High Level Synthesis, HLS)方法。目前,时间密集型计算主要在GPU/CPU架构上执行,但也可以使用HLS方法成功执行。在本文中,我们使用HLS和show技术实现了一种用于核密度估计(KDE)的带宽选择算法,该算法用于优化最终的FPGA实现。我们还将展示,与高度优化的CPU和GPU实现相比,FPGA的加速是相当可观的。此外,FPGA器件的功耗通常比当前cpu和gpu的典型功耗低得多。
FPGA-Based Bandwidth Selection for Kernel Density Estimation Using High Level Synthesis Approach
FPGA technology can offer significantly hi\-gher performance at much lower power consumption than is available from CPUs and GPUs in many computational problems. Unfortunately, programming for FPGA (using ha\-rdware description languages, HDL) is a difficult and not-trivial task and is not intuitive for C/C++/Java programmers. To bring the gap between programming effectiveness and difficulty the High Level Synthesis (HLS) approach is promoting by main FPGA vendors. Nowadays, time-intensive calculations are mainly performed on GPU/CPU architectures, but can also be successfully performed using HLS approach. In the paper we implement a bandwidth selection algorithm for kernel density estimation (KDE) using HLS and show techniques which were used to optimize the final FPGA implementation. We are also going to show that FPGA speedups, comparing to highly optimized CPU and GPU implementations, are quite substantial. Moreover, power consumption for FPGA devices is usually much less than typical power consumption of the present CPUs and GPUs.