视觉特征识别的表征

2003 IEEE International Conference on Communications (Cat. No.03CH37441) Pub Date : 2003-12-03 DOI:10.1109/WWC.2003.1249052

B. Mathew, A. Davis, R. Evans

{"title":"视觉特征识别的表征","authors":"B. Mathew, A. Davis, R. Evans","doi":"10.1109/WWC.2003.1249052","DOIUrl":null,"url":null,"abstract":"Natural human interfaces are a key to realizing the dream of ubiquitous computing. This implies that embedded systems must be capable of sophisticated perception tasks. This paper analyzes the nature of a visual feature recognition workload. Visual feature recognition is a key component of a number of important applications, e.g. gesture based interfaces, lip tracking to augment speech recognition, smart cameras, automated surveillance systems, robotic vision, etc. Given the power sensitive nature of the embedded space and the natural conflict between low-power and high-performance implementations, a precise understanding of these algorithms is an important step in developing efficient visual feature recognition applications for the embedded space. In particular, this work analyzes the performance characteristics of flesh toning, face detection and face recognition codes based on well known algorithms. We show that the problem can be decomposed into a pipeline of filters which could lead to efficient implementations as stream processors. With better than 92% hit rate for a modest 16KB L1 data cache, the algorithms have memory system behavior commensurate with embedded processors. However, our results indicate that their execution requirements strain the performance available on current embedded systems.","PeriodicalId":432745,"journal":{"name":"2003 IEEE International Conference on Communications (Cat. No.03CH37441)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2003-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"19","resultStr":"{\"title\":\"A characterization of visual feature recognition\",\"authors\":\"B. Mathew, A. Davis, R. Evans\",\"doi\":\"10.1109/WWC.2003.1249052\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Natural human interfaces are a key to realizing the dream of ubiquitous computing. This implies that embedded systems must be capable of sophisticated perception tasks. This paper analyzes the nature of a visual feature recognition workload. Visual feature recognition is a key component of a number of important applications, e.g. gesture based interfaces, lip tracking to augment speech recognition, smart cameras, automated surveillance systems, robotic vision, etc. Given the power sensitive nature of the embedded space and the natural conflict between low-power and high-performance implementations, a precise understanding of these algorithms is an important step in developing efficient visual feature recognition applications for the embedded space. In particular, this work analyzes the performance characteristics of flesh toning, face detection and face recognition codes based on well known algorithms. We show that the problem can be decomposed into a pipeline of filters which could lead to efficient implementations as stream processors. With better than 92% hit rate for a modest 16KB L1 data cache, the algorithms have memory system behavior commensurate with embedded processors. However, our results indicate that their execution requirements strain the performance available on current embedded systems.\",\"PeriodicalId\":432745,\"journal\":{\"name\":\"2003 IEEE International Conference on Communications (Cat. No.03CH37441)\",\"volume\":\"5 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2003-12-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"19\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2003 IEEE International Conference on Communications (Cat. No.03CH37441)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/WWC.2003.1249052\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2003 IEEE International Conference on Communications (Cat. No.03CH37441)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WWC.2003.1249052","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 19

摘要

自然的人机界面是实现普适计算梦想的关键。这意味着嵌入式系统必须能够执行复杂的感知任务。本文分析了视觉特征识别工作量的性质。视觉特征识别是许多重要应用的关键组成部分，例如基于手势的界面、增强语音识别的唇形跟踪、智能摄像头、自动监控系统、机器人视觉等。鉴于嵌入式空间的功率敏感特性以及低功耗和高性能实现之间的自然冲突，精确理解这些算法是为嵌入式空间开发高效视觉特征识别应用的重要一步。特别地，本工作分析了基于已知算法的调肤、人脸检测和人脸识别代码的性能特征。我们展示了这个问题可以被分解成一个过滤器的管道，它可以导致高效的实现作为流处理器。对于中等大小的16KB L1数据缓存，该算法的命中率超过92%，具有与嵌入式处理器相称的内存系统行为。然而，我们的结果表明，它们的执行需求对当前嵌入式系统的可用性能造成了压力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A characterization of visual feature recognition

Natural human interfaces are a key to realizing the dream of ubiquitous computing. This implies that embedded systems must be capable of sophisticated perception tasks. This paper analyzes the nature of a visual feature recognition workload. Visual feature recognition is a key component of a number of important applications, e.g. gesture based interfaces, lip tracking to augment speech recognition, smart cameras, automated surveillance systems, robotic vision, etc. Given the power sensitive nature of the embedded space and the natural conflict between low-power and high-performance implementations, a precise understanding of these algorithms is an important step in developing efficient visual feature recognition applications for the embedded space. In particular, this work analyzes the performance characteristics of flesh toning, face detection and face recognition codes based on well known algorithms. We show that the problem can be decomposed into a pipeline of filters which could lead to efficient implementations as stream processors. With better than 92% hit rate for a modest 16KB L1 data cache, the algorithms have memory system behavior commensurate with embedded processors. However, our results indicate that their execution requirements strain the performance available on current embedded systems.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2003 IEEE International Conference on Communications (Cat. No.03CH37441)

自引率

0.00%

发文量