{"title":"Mapping schemes of image recognition tasks onto highly parallel SIMD/MIMD processors","authors":"S. Kyo, Shouhei Nomoto, S. Okazaki","doi":"10.1109/ICDSC.2009.5289350","DOIUrl":null,"url":null,"abstract":"Smart camera applications based on image recognition techniques require significant levels of computation and must operate within limited power budgets. This paper focuses on the schemes of mapping image recognition tasks onto a series of low-power highly parallel SIMD/MIMD mode switching processors called IMAPCAR2. In this paper, we discuss hardware design considerations, the schemes of mapping image tasks onto the architecture using the SIMD or MIMD execution modes, and the way to choose between execution modes. Benchmark results show that the measured performance of an IMAPCAR2-300 (108 MHz, 128 PE / 32 PU, 90-nm, ≪ 1 W) processor running the compiler-generated code of programs based on the proposed mapping schemes is up to 27 times faster using the SIMD mode, or up to 2.8 times faster using the MP mode than a 1.6GHz general purpose processor that consumes a similar amount of power.","PeriodicalId":324810,"journal":{"name":"2009 Third ACM/IEEE International Conference on Distributed Smart Cameras (ICDSC)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 Third ACM/IEEE International Conference on Distributed Smart Cameras (ICDSC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDSC.2009.5289350","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7
Abstract
Smart camera applications based on image recognition techniques require significant levels of computation and must operate within limited power budgets. This paper focuses on the schemes of mapping image recognition tasks onto a series of low-power highly parallel SIMD/MIMD mode switching processors called IMAPCAR2. In this paper, we discuss hardware design considerations, the schemes of mapping image tasks onto the architecture using the SIMD or MIMD execution modes, and the way to choose between execution modes. Benchmark results show that the measured performance of an IMAPCAR2-300 (108 MHz, 128 PE / 32 PU, 90-nm, ≪ 1 W) processor running the compiler-generated code of programs based on the proposed mapping schemes is up to 27 times faster using the SIMD mode, or up to 2.8 times faster using the MP mode than a 1.6GHz general purpose processor that consumes a similar amount of power.