{"title":"Computing parallel prefix and reduction using coterie structures","authors":"M. Herbordt, C. Weems","doi":"10.1109/FMPC.1992.234895","DOIUrl":null,"url":null,"abstract":"The efficient computation of region parameters in image understanding by a SIMD (single-instruction multiple-data) array requires that those regions be processed simultaneously. The difficulty is in orchestrating nonuniform data-dependent communication using only a single thread of control. The authors have found that, on reconfigurable broadcast meshes, coterie structures can be used to overcome this problem. They present a deterministic algorithm to compute parallel prefix in O(log N) communication steps for a number of real images and sketch a randomized reduction algorithm based on graph contraction that has O(log N) complexity for all images.<<ETX>>","PeriodicalId":117789,"journal":{"name":"[Proceedings 1992] The Fourth Symposium on the Frontiers of Massively Parallel Computation","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1992-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"[Proceedings 1992] The Fourth Symposium on the Frontiers of Massively Parallel Computation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/FMPC.1992.234895","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
The efficient computation of region parameters in image understanding by a SIMD (single-instruction multiple-data) array requires that those regions be processed simultaneously. The difficulty is in orchestrating nonuniform data-dependent communication using only a single thread of control. The authors have found that, on reconfigurable broadcast meshes, coterie structures can be used to overcome this problem. They present a deterministic algorithm to compute parallel prefix in O(log N) communication steps for a number of real images and sketch a randomized reduction algorithm based on graph contraction that has O(log N) complexity for all images.<>