Proceedings Fourth IEEE International Workshop on Computer Architecture for Machine Perception. CAMP'97最新文献

筛选
英文 中文
A parallelizing method for implementing image processing tasks on SIMD linear processor arrays 一种在SIMD线性处理器阵列上实现图像处理任务的并行化方法
S. Kyo, S. Okazaki, Y. Fujita, N. Yamashita
{"title":"A parallelizing method for implementing image processing tasks on SIMD linear processor arrays","authors":"S. Kyo, S. Okazaki, Y. Fujita, N. Yamashita","doi":"10.1109/CAMP.1997.631944","DOIUrl":"https://doi.org/10.1109/CAMP.1997.631944","url":null,"abstract":"This paper describes Stack-Based Method (SBM), a general parallelizing method for implementing region parallel operations on Linear Processor Arrays (LPAs). By taking full advantage of the addressing autonomy of each PE. SBM proceeds a pixel-updating-wave (PUW) in a image content directed way across the the image which is separately located on local memories of PEs. Based on SBM, each PE can detect and operate only on pixel data which are ready to be operated in a parallel and autonomous way under the SIMD constraint. Enhancements of existing LPA architectures for full support of SBM is also discussed.","PeriodicalId":274177,"journal":{"name":"Proceedings Fourth IEEE International Workshop on Computer Architecture for Machine Perception. CAMP'97","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116958143","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Three dimensional graphics algorithms on the Micro-Grain Array Processor. II 微颗粒阵列处理器上的三维图形算法。2
B. Bishop, Yan Zhang, K. Acken, M. J. Irwin, R. Owens
{"title":"Three dimensional graphics algorithms on the Micro-Grain Array Processor. II","authors":"B. Bishop, Yan Zhang, K. Acken, M. J. Irwin, R. Owens","doi":"10.1109/CAMP.1997.632000","DOIUrl":"https://doi.org/10.1109/CAMP.1997.632000","url":null,"abstract":"High performance graphics subsystems play a critical role in many computer systems, but result in high priced systems that often still fall short of the required graphics performance. The root of the problem lies in the complex graphics algorithms that require large amounts of object data to be manipulated with high throughput, such as what is required for ray tracing. This has resulted in complex, real-time 3D graphics to be limited to high-end systems. In this paper, we present a library of 3D graphics algorithms that have been mapped to the Micro-Grain Array Processor (MGAP), an inexpensive and versatile SIMD processing board capable of fitting in a typical workstation. Our results show that the MGAP can produce comparable data throughput as more costly graphics subsystems, while maintaining the flexibility of being a general purpose parallel machine.","PeriodicalId":274177,"journal":{"name":"Proceedings Fourth IEEE International Workshop on Computer Architecture for Machine Perception. CAMP'97","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126168702","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
FPGA-based computing in computer vision 基于fpga的计算机视觉计算
N. Ratha, A. K. Jain
{"title":"FPGA-based computing in computer vision","authors":"N. Ratha, A. K. Jain","doi":"10.1109/CAMP.1997.631921","DOIUrl":"https://doi.org/10.1109/CAMP.1997.631921","url":null,"abstract":"Algorithms in computer vision are characterized by (i) complex and repetitive operations; (ii) large amount of data and (iii) a variety of data interaction (e.g., point operations, neighborhood operations, global operations). Based on the computation and communication complexity, vision algorithms have been characterized into three categories: (i) low-level, (ii) intermediate-level and (iii) high-level. In this paper, we describe the usage of custom computing approach to meet the computation and communication needs of computer vision algorithms. By customizing hardware architecture for every application at the instruction level, the optimal grain size needed for the problem at hand and the instruction granularity can be matched. Field Programmable Gate Array (FPGA) based processing elements (PEs) are being used to provide this facility. Using programmable communication resources, the diverse communication requirements can be met. A vision system needs to integrate hardware for the three levels. A custom computing approach alleviates the problem of achieving optimal granularity for different stages as the same hardware gets reconfigured at a software level for different levels of the application. We demonstrate the advantages of our approach using Splash 2-a Xilinx 4010-based custom computer.","PeriodicalId":274177,"journal":{"name":"Proceedings Fourth IEEE International Workshop on Computer Architecture for Machine Perception. CAMP'97","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128232239","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 25
Real-time hierarchical visual tracking using a configurable computing machine 使用可配置计算机的实时分层视觉跟踪
B. Pudipeddi, A. L. Abbott, P. Athanas
{"title":"Real-time hierarchical visual tracking using a configurable computing machine","authors":"B. Pudipeddi, A. L. Abbott, P. Athanas","doi":"10.1109/CAMP.1997.631931","DOIUrl":"https://doi.org/10.1109/CAMP.1997.631931","url":null,"abstract":"This paper describes a custom computing approach to real-time visual tracking. Traditionally, tracking systems require dedicated hardware to accommodate the computational demands and input/output rates imposed by real-time video sources. A radical alternative is represented by custom computing machines such as Splash 2, which use interconnected Field-Programmable Gate Arrays (FPGAs) to provide fine-grain parallelism and reconfigurability so that high-speed performance is possible for many different applications. The efficacy of such architectures to image-based computing is illustrated here through the implementation of a tracking system that consists of two parts: a Gaussian pyramid generator and a correlation-based tracker. The pyramid generator converts each input image to a hierarchy of images, each representing the original image at a different resolution. An object is tracked on successive frames by a coarse-to-fine search through this image hierarchy, using the sum of absolute differences as the matching criterion. Splash 2 performs these operations at rates of 15 or 30 frames per second. Its performance therefore rivals that of application-specific systems, although the architecture is inherently general-purpose in nature.","PeriodicalId":274177,"journal":{"name":"Proceedings Fourth IEEE International Workshop on Computer Architecture for Machine Perception. CAMP'97","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129356923","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Hardware-software aspects of shift-register based NEWS networks for the focal plane 基于移位寄存器的焦平面新闻网络的硬件软件方面
R. Nguyen, D. Mercier, A. Jullian, T. Bernard
{"title":"Hardware-software aspects of shift-register based NEWS networks for the focal plane","authors":"R. Nguyen, D. Mercier, A. Jullian, T. Bernard","doi":"10.1109/CAMP.1997.631904","DOIUrl":"https://doi.org/10.1109/CAMP.1997.631904","url":null,"abstract":"Processing medium-size images in the chip where they are sensed has now become technologically possible. Such devices are called artificial retinas. In order to set up a whole programmable boolean array processor in the focal plane, a specific NEWS interconnection network is required that trades off between speed, silicon area, energy consumption, and controllability. Shift-register based solutions are considered here. A scalable design is presented in which silicon area only is minimized, as the major constraint. Its performances are then quantitatively analysed with respect to the three other criteria and shown to present some significant weaknesses. This leads to relax the area constraint and to propose a second solution that improves the overall trade-off at the expense of a small area increase.","PeriodicalId":274177,"journal":{"name":"Proceedings Fourth IEEE International Workshop on Computer Architecture for Machine Perception. CAMP'97","volume":"122 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115956835","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Investigating real-time validation of real-time image processing ASICs 研究实时图像处理asic的实时验证
I. Kraljic, F. Verdier, G. Quénot, B. Zavidovique
{"title":"Investigating real-time validation of real-time image processing ASICs","authors":"I. Kraljic, F. Verdier, G. Quénot, B. Zavidovique","doi":"10.1109/CAMP.1997.631914","DOIUrl":"https://doi.org/10.1109/CAMP.1997.631914","url":null,"abstract":"The research presented in this paper aims at designing real-time image processing Application Specific Integrated Circuits (ASICs), with emphasis on the need for correct circuits. The methodology is based on a dedicated emulator, the Data-Flow Functional Computer (DFFC), whose peak capacity is 20 million gates operating at 25 MHz. Applications are firstly validated in their target environment (real time, real-world scenes) during emulation on the DFFC. Two integration methods have been implemented: derivation and synthesis. The derivation method optimizes the architecture validated on the emulator, while the synthesis approach is not constrained by the emulator architecture, and thus allows to generate other (optimized) architectures.","PeriodicalId":274177,"journal":{"name":"Proceedings Fourth IEEE International Workshop on Computer Architecture for Machine Perception. CAMP'97","volume":"57 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133978008","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Vision chip architecture using general-purpose processing elements for 1 ms vision system 采用通用处理元件的视觉芯片架构,用于1ms视觉系统
T. Komuro, I. Ishii, M. Ishikawa
{"title":"Vision chip architecture using general-purpose processing elements for 1 ms vision system","authors":"T. Komuro, I. Ishii, M. Ishikawa","doi":"10.1109/CAMP.1997.632052","DOIUrl":"https://doi.org/10.1109/CAMP.1997.632052","url":null,"abstract":"This paper describes a vision chip architecture for high-speed vision systems that we propose. The chip has general-purpose processing elements (PEs) in massively parallel architecture, with each PE directly connected to photo-detectors. Control programs allow various visual processing applications and algorithms to be implemented. A sampling rate of 1 ms is enough to realize high-speed visual feedback for robot control. To integrate as many PEs as possible on a single chip a compact design is required, so we aim to create a very simple architecture. The sample design has been implemented into an FPGA chip; a full custom chip has also been designed and has been submitted for fabrication.","PeriodicalId":274177,"journal":{"name":"Proceedings Fourth IEEE International Workshop on Computer Architecture for Machine Perception. CAMP'97","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132860655","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 58
The CC/IPP, an MIMD-SIMD architecture for image processing and pattern recognition CC/IPP,一种用于图像处理和模式识别的MIMD-SIMD体系结构
P. Jonker, J. Vogelbruch
{"title":"The CC/IPP, an MIMD-SIMD architecture for image processing and pattern recognition","authors":"P. Jonker, J. Vogelbruch","doi":"10.1109/CAMP.1997.631887","DOIUrl":"https://doi.org/10.1109/CAMP.1997.631887","url":null,"abstract":"This paper describes the design of a heterogeneous system for high speed image processing, based on commodity components. The architecture is built around MIMD processor boards linked by a high speed network. SIMD boards can be connected to MIMD nodes through the PCI bus. This paper describes the system, a framework to use this heterogeneous system as a single entity, and the work in progress on the creation of a homogeneous programming model for fast application building. To show the validity of the concept, the paper concludes with a performance analysis of a VLSI mask inspection application for an MIMD system with and without attached SIMD system.","PeriodicalId":274177,"journal":{"name":"Proceedings Fourth IEEE International Workshop on Computer Architecture for Machine Perception. CAMP'97","volume":"136 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116380760","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Processor/memory/array size tradeoffs in the design of SIMD arrays for a spatially mapped workload 为空间映射工作负载设计SIMD阵列时处理器/内存/阵列大小的权衡
M. Herbordt, A. Anand, O. Kidwai, R. Sam, C. Weems
{"title":"Processor/memory/array size tradeoffs in the design of SIMD arrays for a spatially mapped workload","authors":"M. Herbordt, A. Anand, O. Kidwai, R. Sam, C. Weems","doi":"10.1109/CAMP.1997.631884","DOIUrl":"https://doi.org/10.1109/CAMP.1997.631884","url":null,"abstract":"Though massively parallel SIMD arrays continue to be promising for many computer vision applications, they have undergone few systematic empirical studies. The problems include the size of the architecture space, the lack of portability of the test programs, and the inherent complexity of simulating up to hundreds of thousands of processing elements. The latter two issues have been addressed previously, here we describe how spreadsheets and tk/tcl are used to endow our simulator with the flexibility to model a large variety of designs. The utility of this approach is shown in the second half of the paper where results are presented as to the performance of a large number of array size, datapath, register file, and application code combinations. The conclusions derived include the utility of multiplier and floating point support, the cost of virtual PE emulation, likely datapath/memory combinations, and overall designs with the most promising performance/chip area ratios.","PeriodicalId":274177,"journal":{"name":"Proceedings Fourth IEEE International Workshop on Computer Architecture for Machine Perception. CAMP'97","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123614403","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
External loop unrolling of image processing programs: optimal register allocation for RISC architectures 图像处理程序的外部循环展开:RISC体系结构的最佳寄存器分配
N. Zingirian, M. Maresca
{"title":"External loop unrolling of image processing programs: optimal register allocation for RISC architectures","authors":"N. Zingirian, M. Maresca","doi":"10.1109/CAMP.1997.631891","DOIUrl":"https://doi.org/10.1109/CAMP.1997.631891","url":null,"abstract":"Most of today's image processing applications rely on the computing power delivered by RISC processors. RISC processors are load/store architectures in the sense that their instructions can process only operands present in CPU registers. Finding a register allocation that reduces or possibly minimizes the number of load/store instructions is one of the main concerns in the efficient implementation of image processing programs on load/store architectures. The execution speedups delivered by source program transformations and in particular by external loop unrolling transformation applied to image processing programs-largely experimented in our previous works-led us to undertake an analytical investigation on the register allocation delivered by such source program transformations. In this paper we present a proof that external loop unrolling asymptotically achieves an optimal register allocation for a large class of image processing programs.","PeriodicalId":274177,"journal":{"name":"Proceedings Fourth IEEE International Workshop on Computer Architecture for Machine Perception. CAMP'97","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132377047","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信