Processor/memory/array size tradeoffs in the design of SIMD arrays for a spatially mapped workload

M. Herbordt, A. Anand, O. Kidwai, R. Sam, C. Weems
{"title":"Processor/memory/array size tradeoffs in the design of SIMD arrays for a spatially mapped workload","authors":"M. Herbordt, A. Anand, O. Kidwai, R. Sam, C. Weems","doi":"10.1109/CAMP.1997.631884","DOIUrl":null,"url":null,"abstract":"Though massively parallel SIMD arrays continue to be promising for many computer vision applications, they have undergone few systematic empirical studies. The problems include the size of the architecture space, the lack of portability of the test programs, and the inherent complexity of simulating up to hundreds of thousands of processing elements. The latter two issues have been addressed previously, here we describe how spreadsheets and tk/tcl are used to endow our simulator with the flexibility to model a large variety of designs. The utility of this approach is shown in the second half of the paper where results are presented as to the performance of a large number of array size, datapath, register file, and application code combinations. The conclusions derived include the utility of multiplier and floating point support, the cost of virtual PE emulation, likely datapath/memory combinations, and overall designs with the most promising performance/chip area ratios.","PeriodicalId":274177,"journal":{"name":"Proceedings Fourth IEEE International Workshop on Computer Architecture for Machine Perception. CAMP'97","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1997-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings Fourth IEEE International Workshop on Computer Architecture for Machine Perception. CAMP'97","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CAMP.1997.631884","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 10

Abstract

Though massively parallel SIMD arrays continue to be promising for many computer vision applications, they have undergone few systematic empirical studies. The problems include the size of the architecture space, the lack of portability of the test programs, and the inherent complexity of simulating up to hundreds of thousands of processing elements. The latter two issues have been addressed previously, here we describe how spreadsheets and tk/tcl are used to endow our simulator with the flexibility to model a large variety of designs. The utility of this approach is shown in the second half of the paper where results are presented as to the performance of a large number of array size, datapath, register file, and application code combinations. The conclusions derived include the utility of multiplier and floating point support, the cost of virtual PE emulation, likely datapath/memory combinations, and overall designs with the most promising performance/chip area ratios.
为空间映射工作负载设计SIMD阵列时处理器/内存/阵列大小的权衡
尽管大规模并行SIMD阵列在许多计算机视觉应用中仍然很有前景,但它们很少经过系统的实证研究。这些问题包括体系结构空间的大小,测试程序的可移植性的缺乏,以及模拟多达数十万个处理元素的固有复杂性。后两个问题以前已经解决了,这里我们描述了如何使用电子表格和tk/tcl赋予我们的模拟器灵活性,以模拟各种各样的设计。本文的后半部分展示了这种方法的效用,其中给出了大量数组大小、数据路径、寄存器文件和应用程序代码组合的性能结果。得出的结论包括乘法器和浮点支持的效用、虚拟PE仿真的成本、可能的数据路径/内存组合以及具有最有希望的性能/芯片面积比的总体设计。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信