P. Morrow, D. Crookes, T. Brown, G. McAleese, D. Roantree, I. Spence
{"title":"Efficient implementation of a portable parallel programming model for image processing","authors":"P. Morrow, D. Crookes, T. Brown, G. McAleese, D. Roantree, I. Spence","doi":"10.1002/(SICI)1096-9128(199909)11:11%3C671::AID-CPE450%3E3.0.CO;2-6","DOIUrl":null,"url":null,"abstract":"This paper describes a domain specific programming model for execution on parallel and distributed architectures. The model has initially been targeted at the application area of image processing, though the techniques developed may be more generally applicable to other domains where an algebraic or library-based approach is common. Efficiency is achieved by the concept of a self-optimising class library of primitive image processing operations, which allows programs to be written in a high level, algebraic notation and which is automatically parallelised (using an application-specific data parallel approach). The class library is extended automatically with optimised operations, generated by a transformation system, giving improved execution performance. The parallel implementation of the model described here is based on MPI and has been tested on a C40 processor network, a quad-processor Unix workstation, and a network of PCs running Linux. Timings are included to indicate the impact of the automatic optimisation facility (rather than the effect of parallelisation). Copyright © 1999 John Wiley & Sons, Ltd.","PeriodicalId":199059,"journal":{"name":"Concurr. Pract. Exp.","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1999-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"24","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Concurr. Pract. Exp.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1002/(SICI)1096-9128(199909)11:11%3C671::AID-CPE450%3E3.0.CO;2-6","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 24
一种可移植的图像处理并行编程模型的高效实现
本文描述了一种在并行和分布式体系结构上执行的特定领域编程模型。该模型最初是针对图像处理的应用领域,尽管所开发的技术可能更普遍地适用于其他领域,其中代数或基于库的方法是常见的。效率是通过自优化原始图像处理操作类库的概念实现的,它允许程序以高级代数符号编写,并自动并行化(使用特定于应用程序的数据并行方法)。类库通过转换系统生成的优化操作自动扩展,从而提高了执行性能。本文描述的模型的并行实现基于MPI,并已在C40处理器网络、四处理器Unix工作站和运行Linux的pc网络上进行了测试。包括计时来指示自动优化设施的影响(而不是并行化的影响)。版权所有©1999 John Wiley & Sons, Ltd
本文章由计算机程序翻译,如有差异,请以英文原文为准。