A. Genusov, Peter Feldman, R. Friedlander, V. Fruchter, R. Jaliff, A. Mohr, R. Shenhav
{"title":"一种新型的、高度并行的32位浮点DSP矢量信号处理器","authors":"A. Genusov, Peter Feldman, R. Friedlander, V. Fruchter, R. Jaliff, A. Mohr, R. Shenhav","doi":"10.1109/ICASSP.1988.197049","DOIUrl":null,"url":null,"abstract":"A new 32-bit floating point (IEEE standard) (digital signal processing) DSP vector signal processor architecture is described. The internal architecture is highly parallel. It is based on six well coordinated, independent machines. The ALU (arithmetic logic unit) has a pipeline structure optimized for the execution of DSP and matrix operations (FFT butterflies in particular). The highly flexible set of vectorized instructions allows for most efficient utilization of the internal assets. Together these features yield a high performance, high throughput processor with 31-Mflops computation power and very minimal overhead. A description is given of the architecture of the device, the different internal units and their coordination. The instruction set basic features are presented, and a few benchmarks of a single processor are given. A simple, minimal system architecture combining two processors sharing a single bus, doubling the throughput of a single processor system, is suggested.<<ETX>>","PeriodicalId":448544,"journal":{"name":"ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1988-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"A new, highly parallel, 32 bit floating point DSP vector signal processor\",\"authors\":\"A. Genusov, Peter Feldman, R. Friedlander, V. Fruchter, R. Jaliff, A. Mohr, R. Shenhav\",\"doi\":\"10.1109/ICASSP.1988.197049\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A new 32-bit floating point (IEEE standard) (digital signal processing) DSP vector signal processor architecture is described. The internal architecture is highly parallel. It is based on six well coordinated, independent machines. The ALU (arithmetic logic unit) has a pipeline structure optimized for the execution of DSP and matrix operations (FFT butterflies in particular). The highly flexible set of vectorized instructions allows for most efficient utilization of the internal assets. Together these features yield a high performance, high throughput processor with 31-Mflops computation power and very minimal overhead. A description is given of the architecture of the device, the different internal units and their coordination. The instruction set basic features are presented, and a few benchmarks of a single processor are given. A simple, minimal system architecture combining two processors sharing a single bus, doubling the throughput of a single processor system, is suggested.<<ETX>>\",\"PeriodicalId\":448544,\"journal\":{\"name\":\"ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing\",\"volume\":\"15 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1988-04-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICASSP.1988.197049\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICASSP.1988.197049","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A new, highly parallel, 32 bit floating point DSP vector signal processor
A new 32-bit floating point (IEEE standard) (digital signal processing) DSP vector signal processor architecture is described. The internal architecture is highly parallel. It is based on six well coordinated, independent machines. The ALU (arithmetic logic unit) has a pipeline structure optimized for the execution of DSP and matrix operations (FFT butterflies in particular). The highly flexible set of vectorized instructions allows for most efficient utilization of the internal assets. Together these features yield a high performance, high throughput processor with 31-Mflops computation power and very minimal overhead. A description is given of the architecture of the device, the different internal units and their coordination. The instruction set basic features are presented, and a few benchmarks of a single processor are given. A simple, minimal system architecture combining two processors sharing a single bus, doubling the throughput of a single processor system, is suggested.<>