Jacobium处理元件中的多端口存储器和浮点Cordic管道

1998 IEEE Workshop on Signal Processing Systems. SIPS 98. Design and Implementation (Cat. No.98TH8374) Pub Date : 1998-10-08 DOI:10.1109/SIPS.1998.715803

Alco Looye, G. Hekstra, E. Deprettere

{"title":"Jacobium处理元件中的多端口存储器和浮点Cordic管道","authors":"Alco Looye, G. Hekstra, E. Deprettere","doi":"10.1109/SIPS.1998.715803","DOIUrl":null,"url":null,"abstract":"The Jacobium is a dataflow processor intended for high-speed execution of a set of algorithms that are akin to the so-called Jacobi method for reducing a symmetric matrix to diagonal form using Givens rotations. The design of this processor has been undertaken as one of two cases in a recently proposed method for the quantitative analysis of domain-specific dataflow architectures. The method presupposes that the exploration of the processor's design space starts out of a given architecture template whose free parameters are to be determined in such a way that the ultimate specification is in some sense optimal for a set of applications that are given from the outset. Two templates have been considered in the Jacobium case: one for the complete (multiple processor element) processor and one for a typical processor element (PE). A parametrized VHDL version of the latter has been designed as well. The architecture of such a typical PE is presented here. It is equipped with a deep floating point Cordic pipeline, on-chip multiport memory to buffer operands and results, and several high-speed communication buses for communication between processing elements and the host. This parametrized architecture serves two purposes: it can provide realistic estimates for the PE parameters at the level of the complete processor; and it can be used to validate the exploration results.","PeriodicalId":151031,"journal":{"name":"1998 IEEE Workshop on Signal Processing Systems. SIPS 98. Design and Implementation (Cat. No.98TH8374)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1998-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":"{\"title\":\"Multiport memory and floating point Cordic pipeline in Jacobium processing elements\",\"authors\":\"Alco Looye, G. Hekstra, E. Deprettere\",\"doi\":\"10.1109/SIPS.1998.715803\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The Jacobium is a dataflow processor intended for high-speed execution of a set of algorithms that are akin to the so-called Jacobi method for reducing a symmetric matrix to diagonal form using Givens rotations. The design of this processor has been undertaken as one of two cases in a recently proposed method for the quantitative analysis of domain-specific dataflow architectures. The method presupposes that the exploration of the processor's design space starts out of a given architecture template whose free parameters are to be determined in such a way that the ultimate specification is in some sense optimal for a set of applications that are given from the outset. Two templates have been considered in the Jacobium case: one for the complete (multiple processor element) processor and one for a typical processor element (PE). A parametrized VHDL version of the latter has been designed as well. The architecture of such a typical PE is presented here. It is equipped with a deep floating point Cordic pipeline, on-chip multiport memory to buffer operands and results, and several high-speed communication buses for communication between processing elements and the host. This parametrized architecture serves two purposes: it can provide realistic estimates for the PE parameters at the level of the complete processor; and it can be used to validate the exploration results.\",\"PeriodicalId\":151031,\"journal\":{\"name\":\"1998 IEEE Workshop on Signal Processing Systems. SIPS 98. Design and Implementation (Cat. No.98TH8374)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1998-10-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"9\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"1998 IEEE Workshop on Signal Processing Systems. SIPS 98. Design and Implementation (Cat. No.98TH8374)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SIPS.1998.715803\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"1998 IEEE Workshop on Signal Processing Systems. SIPS 98. Design and Implementation (Cat. No.98TH8374)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SIPS.1998.715803","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 9

摘要

Jacobium是一种数据流处理器，用于高速执行一组算法，这些算法类似于所谓的Jacobi方法，用于使用给定旋转将对称矩阵简化为对角线形式。该处理器的设计是最近提出的领域特定数据流体系结构定量分析方法中的两个案例之一。该方法假定对处理器设计空间的探索从给定的体系结构模板开始，该模板的自由参数将以这样一种方式确定，即最终规范在某种意义上对于从一开始就给定的一组应用程序是最优的。Jacobium案例中考虑了两个模板:一个用于完整的(多处理器元素)处理器，另一个用于典型的处理器元素(PE)。并设计了后者的参数化VHDL版本。本文给出了这种典型PE的体系结构。它配备了一个深度浮点Cordic管道，片上多端口存储器来缓冲操作数和结果，以及几条高速通信总线用于处理元件和主机之间的通信。这种参数化架构有两个目的:它可以提供完整处理器级别的PE参数的实际估计;并可用于验证勘探结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Multiport memory and floating point Cordic pipeline in Jacobium processing elements

The Jacobium is a dataflow processor intended for high-speed execution of a set of algorithms that are akin to the so-called Jacobi method for reducing a symmetric matrix to diagonal form using Givens rotations. The design of this processor has been undertaken as one of two cases in a recently proposed method for the quantitative analysis of domain-specific dataflow architectures. The method presupposes that the exploration of the processor's design space starts out of a given architecture template whose free parameters are to be determined in such a way that the ultimate specification is in some sense optimal for a set of applications that are given from the outset. Two templates have been considered in the Jacobium case: one for the complete (multiple processor element) processor and one for a typical processor element (PE). A parametrized VHDL version of the latter has been designed as well. The architecture of such a typical PE is presented here. It is equipped with a deep floating point Cordic pipeline, on-chip multiport memory to buffer operands and results, and several high-speed communication buses for communication between processing elements and the host. This parametrized architecture serves two purposes: it can provide realistic estimates for the PE parameters at the level of the complete processor; and it can be used to validate the exploration results.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

1998 IEEE Workshop on Signal Processing Systems. SIPS 98. Design and Implementation (Cat. No.98TH8374)

自引率

0.00%

发文量