2018 International Conference on Field-Programmable Technology (FPT)最新文献_第7页

Scheduling Algorithms for High Performance Network Switching on FPGAs: A Survey 基于fpga的高性能网络交换调度算法综述

2018 International Conference on Field-Programmable Technology (FPT) Pub Date : 2018-12-01 DOI: 10.1109/FPT.2018.00033

Nadeen Gebara, Jiuxi Meng, W. Luk, Paolo Costa

引用次数: 5

Enabling Overclocking Through Algorithm-Level Error Detection 通过算法级错误检测使能超频

2018 International Conference on Field-Programmable Technology (FPT) Pub Date : 2018-12-01 DOI: 10.1109/FPT.2018.00034

T. Marty, Tomofumi Yuki, Steven Derrien

引用次数: 7

Mapping Estimator for OpenCL Heterogeneous Accelerators OpenCL异构加速器的映射估计器

2018 International Conference on Field-Programmable Technology (FPT) Pub Date : 2018-12-01 DOI: 10.1109/FPT.2018.00057

A. B. Perina, Vanderlei Bonato

引用次数: 1

Lens Distortion Self-Calibration Using the Hough Transform 基于霍夫变换的透镜畸变自校正

2018 International Conference on Field-Programmable Technology (FPT) Pub Date : 2018-12-01 DOI: 10.1109/FPT.2018.00080

D. Bailey, Yuan Chang, S. L. Moan

引用次数: 1

Checking for Electrical Level Security Threats in Bitstreams for Multi-tenant FPGAs 检查多租户fpga位流中的电级安全威胁

2018 International Conference on Field-Programmable Technology (FPT) Pub Date : 2018-12-01 DOI: 10.1109/FPT.2018.00055

Dennis R. E. Gnad, Sascha Rapp, Jonas Krautter, M. Tahoori

引用次数: 21

ReFiRe: Efficient Deployment of Remote Fine-Grained Reconfigurable Accelerators ReFiRe:远程细粒度可重构加速器的有效部署

2018 International Conference on Field-Programmable Technology (FPT) Pub Date : 2018-12-01 DOI: 10.1109/FPT.2018.00064

Emmanouil Pissadakis, Nikolaos S. Alachiotis, P. Skrimponis, D. Theodoropoulos, T. Korakis, D. Pnevmatikatos

{"title":"ReFiRe: Efficient Deployment of Remote Fine-Grained Reconfigurable Accelerators","authors":"Emmanouil Pissadakis, Nikolaos S. Alachiotis, P. Skrimponis, D. Theodoropoulos, T. Korakis, D. Pnevmatikatos","doi":"10.1109/FPT.2018.00064","DOIUrl":"https://doi.org/10.1109/FPT.2018.00064","url":null,"abstract":"The need for specialized hardware acceleration in today's computing platforms is well established, due to power and efficiency reasons. Broadening an accelerator's scope of application is highly desirable, but requires a finer-grained architecture with basic primitives, which inevitably exhibits increased communication and synchronization requirements. In disaggregated-computing environ-ments, where data transfers between remote nodes are realized via datacenter-wide packet exchanges, reducing communication and synchronization is a prerequisite for the effective employment of remote acceleration. To this end, we present ReFiRe (Remote Fine-grained Reconfigurable acceleration), a generic deployment framework with native support for partial reconfiguration that allows to considerably reduce communication needs between a processor and remote accelerators. This is achieved by shifting control flow, partial reconfiguration, and execution decisions to the remote side through arbitrarily long instructions that encapsulate complex sequences of operations and their re-spective synchronization requirements. ReFiRe outperforms an SDSoC-generated accelerator system that employs the same accelerator cores to boost performance of a genomics application that detects positive selection.","PeriodicalId":434541,"journal":{"name":"2018 International Conference on Field-Programmable Technology (FPT)","volume":"79 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130214049","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Enhancing Memory Bandwidth in a Single Stream Computation with Multiple FPGAs 用多个fpga增强单流计算中的内存带宽

2018 International Conference on Field-Programmable Technology (FPT) Pub Date : 2018-12-01 DOI: 10.1109/FPT.2018.00078

Antoniette Mondigo, K. Sano, H. Takizawa

引用次数: 1

Speed and Resource Optimization of BFGS Quasi-Newton Implementation on FPGA Using Inexact Line Search Method for Neural Network Training 基于非精确线搜索法的BFGS准牛顿实现在FPGA上的速度和资源优化

2018 International Conference on Field-Programmable Technology (FPT) Pub Date : 2018-12-01 DOI: 10.1109/FPT.2018.00074

Jia Liu, Qiang Liu

{"title":"Speed and Resource Optimization of BFGS Quasi-Newton Implementation on FPGA Using Inexact Line Search Method for Neural Network Training","authors":"Jia Liu, Qiang Liu","doi":"10.1109/FPT.2018.00074","DOIUrl":"https://doi.org/10.1109/FPT.2018.00074","url":null,"abstract":"Quasi-Newton (QN) method is one of the most effective Neural Network (NN) training methods. However, QN training often needs long time especially when the NN architecture is large. The BFGS-QN has been implemented on FPGA for accelerating the training process. The experimental results show that the line search module of BFGS-QN is the most timeconsuming module because of its frequent objective function evaluation. In order to solve the issue, an inexact line search method, Armijo-Goldstein (AG) method, is implemented to replace the original exact line search method-Golden Section (GS) method. For the highest training speed, an end-to-end FPGA version of BFGS using AG method is implemented. Moreover, the efficiency AG method makes it possible for hardware-software co-design. The objective function evalution unit in line search module which consumes the most computional resource is moved to CPU for a speed and resource tradeoff. The experimental results show that the end-to-end FPGA BFGS-AG implementation achieves up to 239 times speed up compared with software implementation. The FPGA+CPU BFGS-AG implementation is up to 153.1 times faster than the end-to-end software implementation and achieves up to 45% LUT, 29% FF and 64% DSP reduction.","PeriodicalId":434541,"journal":{"name":"2018 International Conference on Field-Programmable Technology (FPT)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122686485","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

High Throughput CNN Accelerator Design Based on FPGA 基于FPGA的高吞吐量CNN加速器设计

2018 International Conference on Field-Programmable Technology (FPT) Pub Date : 2018-12-01 DOI: 10.1109/FPT.2018.00052

Liang Xie, Xitian Fan, Wei Cao, Lingli Wang

引用次数: 7

LeFlow: Automatic Compilation of TensorFlow Machine Learning Applications to FPGAs 自动编译TensorFlow机器学习在fpga上的应用

2018 International Conference on Field-Programmable Technology (FPT) Pub Date : 2018-12-01 DOI: 10.1109/FPT.2018.00082

D. H. Noronha, Kahlan Gibson, B. Salehpour, S. Wilton

引用次数: 9