Shape Representation

Marios Papas
{"title":"Shape Representation","authors":"Marios Papas","doi":"10.1201/9780849379406.ch5","DOIUrl":null,"url":null,"abstract":"After detecting the shape of an object, contour, or connected component, we should represent it in a concise and informative way. The simplest shape representation is a list of pixel coordinates. It can easy be obtained from the detection algorithm and includes precise spatial information. However, it is not ideal for further processing by higher-level algorithms for purposes such as object recognition or scene understanding. It does not characterize the shape in a useful way or indicate any shape features such as curvatures, slopes, or angles. Furthermore, this type of representation only describes a shape at a specific position, orientation, and scale. In summary, a list of coordinates describes a shape in a way that is too low-level and local to be immediately useful for high-level processing. We know that the early stages of the human visual processing stream use simple, local features for representing visual input. As information is passed on to higher levels, representations become more and more high-level, abstract, and location invariant. Ideally, our computer vision systems should do the same. It is would be highly inefficient to stick with pixel-based representations throughout the processing hierarchy and then attempt to accomplish the highest-level task based on such input. The highest level has to solve the most intricate problems and needs the lower levels to prepare the visual information in such a way that its tasks become computationally feasible. In this chapter, we will explore a variety of methods for representing shape at higher levels. Each of these techniques operates at a different level and emphasizes particular shape features. When choosing a representation for a computer vision system, we should of course always keep in mind the purpose of the system. Before starting any implementation, we should have developed a plan for all processing stages, their functionality, and the interfaces between them.","PeriodicalId":243930,"journal":{"name":"Shape Classification and Analysis","volume":"156 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Shape Classification and Analysis","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1201/9780849379406.ch5","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 9

Abstract

After detecting the shape of an object, contour, or connected component, we should represent it in a concise and informative way. The simplest shape representation is a list of pixel coordinates. It can easy be obtained from the detection algorithm and includes precise spatial information. However, it is not ideal for further processing by higher-level algorithms for purposes such as object recognition or scene understanding. It does not characterize the shape in a useful way or indicate any shape features such as curvatures, slopes, or angles. Furthermore, this type of representation only describes a shape at a specific position, orientation, and scale. In summary, a list of coordinates describes a shape in a way that is too low-level and local to be immediately useful for high-level processing. We know that the early stages of the human visual processing stream use simple, local features for representing visual input. As information is passed on to higher levels, representations become more and more high-level, abstract, and location invariant. Ideally, our computer vision systems should do the same. It is would be highly inefficient to stick with pixel-based representations throughout the processing hierarchy and then attempt to accomplish the highest-level task based on such input. The highest level has to solve the most intricate problems and needs the lower levels to prepare the visual information in such a way that its tasks become computationally feasible. In this chapter, we will explore a variety of methods for representing shape at higher levels. Each of these techniques operates at a different level and emphasizes particular shape features. When choosing a representation for a computer vision system, we should of course always keep in mind the purpose of the system. Before starting any implementation, we should have developed a plan for all processing stages, their functionality, and the interfaces between them.
形状表示
在检测到物体的形状、轮廓或连接的组件后,我们应该以简洁和信息的方式表示它。最简单的形状表示是一个像素坐标列表。它可以很容易地从检测算法中获得,并且包含精确的空间信息。然而,它并不适合用于物体识别或场景理解等目的的高级算法的进一步处理。它不能以有用的方式描述形状,也不能表示任何形状特征,如曲率、斜率或角度。此外,这种类型的表示只描述特定位置、方向和比例的形状。总而言之,坐标列表描述形状的方式过于低级和局部,无法立即用于高级处理。我们知道,人类视觉处理流的早期阶段使用简单的局部特征来表示视觉输入。随着信息被传递到更高的层次,表示变得越来越高级、抽象和位置不变。理想情况下,我们的计算机视觉系统也应该这样做。在整个处理层次结构中坚持使用基于像素的表示,然后尝试基于这样的输入来完成最高级别的任务,这将是非常低效的。最高层次必须解决最复杂的问题,并需要较低层次准备视觉信息,使其任务在计算上可行。在本章中,我们将探讨在更高层次上表示形状的各种方法。这些技术中的每一种都在不同的层次上运作,并强调特定的形状特征。在为计算机视觉系统选择表示时,我们当然应该始终牢记系统的目的。在开始任何实现之前,我们应该为所有处理阶段、它们的功能以及它们之间的接口制定一个计划。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信