利用基于模型的视觉和距离图像识别复杂背景下的三维物体

Proceedings of the IEEE Southwest Symposium on Image Analysis and Interpretation Pub Date : 1994-04-21 DOI:10.1109/IAI.1994.336667

E. Natonek, C. Baur

{"title":"利用基于模型的视觉和距离图像识别复杂背景下的三维物体","authors":"E. Natonek, C. Baur","doi":"10.1109/IAI.1994.336667","DOIUrl":null,"url":null,"abstract":"One of the active research fields in computer vision is the recognition of complex 3D objects. The task of object recognition is tightly bound to background understanding or suppression. Current literature describes the top down approaches as promising but not complete and the bottom-up approaches as not robust. The paper describes a model based vision system in which a commercial 3D computer graphics system has been used for object modeling and visual clue generation. Given the computer generated model image, a conventional CCD camera image and the corresponding scanned 3D dense range map of the real scene, the object can be located in it. The paper deals with how this is done using newly developed segmentation algorithms extracting \"focus features\" from range images (depth map) of the scene. The system uses the image pyramid of resolution and prediction-verification process. First the authors generate a hypothesis in a low resolution description, giving rough clues for the object boundaries, position and orientation. These regions of interest are then used as the field of comparison with higher resolution models. Such an iterative process is repeated until a given threshold of similarity is reached. Next an intensity image of the model in the scene is created using the available a priori knowledge. Direct correlation is then performed between the model and the \"focus feature\" of the scene. Illustrative examples of object recognition in simple and complex scenes are presented.<<ETX>>","PeriodicalId":438137,"journal":{"name":"Proceedings of the IEEE Southwest Symposium on Image Analysis and Interpretation","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1994-04-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Recognition of 3-D objects on complex backgrounds using model based vision and range images\",\"authors\":\"E. Natonek, C. Baur\",\"doi\":\"10.1109/IAI.1994.336667\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"One of the active research fields in computer vision is the recognition of complex 3D objects. The task of object recognition is tightly bound to background understanding or suppression. Current literature describes the top down approaches as promising but not complete and the bottom-up approaches as not robust. The paper describes a model based vision system in which a commercial 3D computer graphics system has been used for object modeling and visual clue generation. Given the computer generated model image, a conventional CCD camera image and the corresponding scanned 3D dense range map of the real scene, the object can be located in it. The paper deals with how this is done using newly developed segmentation algorithms extracting \\\"focus features\\\" from range images (depth map) of the scene. The system uses the image pyramid of resolution and prediction-verification process. First the authors generate a hypothesis in a low resolution description, giving rough clues for the object boundaries, position and orientation. These regions of interest are then used as the field of comparison with higher resolution models. Such an iterative process is repeated until a given threshold of similarity is reached. Next an intensity image of the model in the scene is created using the available a priori knowledge. Direct correlation is then performed between the model and the \\\"focus feature\\\" of the scene. Illustrative examples of object recognition in simple and complex scenes are presented.<<ETX>>\",\"PeriodicalId\":438137,\"journal\":{\"name\":\"Proceedings of the IEEE Southwest Symposium on Image Analysis and Interpretation\",\"volume\":\"33 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1994-04-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the IEEE Southwest Symposium on Image Analysis and Interpretation\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IAI.1994.336667\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the IEEE Southwest Symposium on Image Analysis and Interpretation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IAI.1994.336667","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

复杂三维物体的识别是计算机视觉研究的热点之一。目标识别任务与背景理解或背景抑制紧密相关。目前的文献描述了自上而下的方法是有希望的，但不完整，自下而上的方法是不健全的。本文介绍了一种基于模型的视觉系统，该系统采用商用三维计算机图形系统进行对象建模和视觉线索生成。给定计算机生成的模型图像、常规CCD相机图像以及相应扫描的真实场景的三维密集距离图，就可以在其中定位物体。本文讨论了如何使用新开发的分割算法从场景的距离图像(深度图)中提取“焦点特征”。该系统采用图像金字塔的分辨率和预测验证过程。首先，作者在低分辨率描述中生成一个假设，给出物体边界、位置和方向的粗略线索。然后将这些感兴趣的区域用作与更高分辨率模型进行比较的领域。这样的迭代过程不断重复，直到达到给定的相似性阈值。接下来，使用可用的先验知识创建场景中模型的强度图像。然后在模型和场景的“焦点特征”之间进行直接关联。给出了简单和复杂场景中物体识别的示例

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Recognition of 3-D objects on complex backgrounds using model based vision and range images

One of the active research fields in computer vision is the recognition of complex 3D objects. The task of object recognition is tightly bound to background understanding or suppression. Current literature describes the top down approaches as promising but not complete and the bottom-up approaches as not robust. The paper describes a model based vision system in which a commercial 3D computer graphics system has been used for object modeling and visual clue generation. Given the computer generated model image, a conventional CCD camera image and the corresponding scanned 3D dense range map of the real scene, the object can be located in it. The paper deals with how this is done using newly developed segmentation algorithms extracting "focus features" from range images (depth map) of the scene. The system uses the image pyramid of resolution and prediction-verification process. First the authors generate a hypothesis in a low resolution description, giving rough clues for the object boundaries, position and orientation. These regions of interest are then used as the field of comparison with higher resolution models. Such an iterative process is repeated until a given threshold of similarity is reached. Next an intensity image of the model in the scene is created using the available a priori knowledge. Direct correlation is then performed between the model and the "focus feature" of the scene. Illustrative examples of object recognition in simple and complex scenes are presented.<>

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the IEEE Southwest Symposium on Image Analysis and Interpretation

自引率

0.00%

发文量