{"title":"Parallel Algorithms for FIR Computation Mapped to ESCA Architecture","authors":"Pan Chen, Kui Dai, Dan Wu, J. Rao, X. Zou","doi":"10.1109/ICIE.2010.37","DOIUrl":null,"url":null,"abstract":"IN this paper we present a parallel algorithm for FIR (Finite Impulse Response) filter computation based on Engineering and Scientific Computation Accelerator (ESCA) System. ESCA is a heterogeneous multi-core architecture aiming to accelerate the compute-intensive parallel computing in high performance applications. By taking advantage of SIMD processing elements (PEs) and hierarchical on-chip networks with high-bandwidth and low-latency inside ESCA, we can get a good performance at parallel computation, and find a way to implement the FIR kernel. By translating the FIR computation into Matrix-Vector multiplication, we proposed an improved implementation of FIR algorithm, which achieved higher performance.","PeriodicalId":353239,"journal":{"name":"2010 WASE International Conference on Information Engineering","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 WASE International Conference on Information Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIE.2010.37","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
IN this paper we present a parallel algorithm for FIR (Finite Impulse Response) filter computation based on Engineering and Scientific Computation Accelerator (ESCA) System. ESCA is a heterogeneous multi-core architecture aiming to accelerate the compute-intensive parallel computing in high performance applications. By taking advantage of SIMD processing elements (PEs) and hierarchical on-chip networks with high-bandwidth and low-latency inside ESCA, we can get a good performance at parallel computation, and find a way to implement the FIR kernel. By translating the FIR computation into Matrix-Vector multiplication, we proposed an improved implementation of FIR algorithm, which achieved higher performance.