使用投影3D标签捕获具有稀疏信息像素的图像

2008 IEEE Virtual Reality Conference Pub Date : 2008-03-08 DOI:10.1109/VR.2008.4480744

Li Zhang, N. Subramaniam, Robert Lin, R. Raskar, S. Nayar

{"title":"使用投影3D标签捕获具有稀疏信息像素的图像","authors":"Li Zhang, N. Subramaniam, Robert Lin, R. Raskar, S. Nayar","doi":"10.1109/VR.2008.4480744","DOIUrl":null,"url":null,"abstract":"In this paper, we propose a novel imaging system that enables the capture of photos and videos with sparse informational pixels. Our system is based on the projection and detection of 3D optical tags. We use an infrared (IR) projector to project temporally-coded (blinking) dots onto selected points in a scene. These tags are invisible to the human eye, but appear as clearly visible time-varying codes to an IR photosensor. As a proof of concept, we have built a prototype camera system (consisting of co-located visible and IR sensors) to simultaneously capture visible and IR images. When a user takes an image of a tagged scene using such a camera system, all the scene tags that are visible from the system's viewpoint are detected. In addition, tags that lie in the field of view but are occluded, and ones that lie just outside the field of view, are also automatically generated for the image. Associated with each tagged pixel is its 3D location and the identity of the object that the tag falls on. Our system can interface with conventional image recognition methods for efficient scene authoring, enabling objects in an image to be robustly identified using cheap cameras, minimal computations, and no domain knowledge. We demonstrate several applications of our system, including, photo-browsing, e-commerce, augmented reality, and objection localization.","PeriodicalId":173744,"journal":{"name":"2008 IEEE Virtual Reality Conference","volume":"72 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-03-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":"{\"title\":\"Capturing Images with Sparse Informational Pixels using Projected 3D Tags\",\"authors\":\"Li Zhang, N. Subramaniam, Robert Lin, R. Raskar, S. Nayar\",\"doi\":\"10.1109/VR.2008.4480744\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we propose a novel imaging system that enables the capture of photos and videos with sparse informational pixels. Our system is based on the projection and detection of 3D optical tags. We use an infrared (IR) projector to project temporally-coded (blinking) dots onto selected points in a scene. These tags are invisible to the human eye, but appear as clearly visible time-varying codes to an IR photosensor. As a proof of concept, we have built a prototype camera system (consisting of co-located visible and IR sensors) to simultaneously capture visible and IR images. When a user takes an image of a tagged scene using such a camera system, all the scene tags that are visible from the system's viewpoint are detected. In addition, tags that lie in the field of view but are occluded, and ones that lie just outside the field of view, are also automatically generated for the image. Associated with each tagged pixel is its 3D location and the identity of the object that the tag falls on. Our system can interface with conventional image recognition methods for efficient scene authoring, enabling objects in an image to be robustly identified using cheap cameras, minimal computations, and no domain knowledge. We demonstrate several applications of our system, including, photo-browsing, e-commerce, augmented reality, and objection localization.\",\"PeriodicalId\":173744,\"journal\":{\"name\":\"2008 IEEE Virtual Reality Conference\",\"volume\":\"72 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2008-03-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"12\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2008 IEEE Virtual Reality Conference\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/VR.2008.4480744\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 IEEE Virtual Reality Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/VR.2008.4480744","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 12

摘要

在本文中，我们提出了一种新的成像系统，可以捕获具有稀疏信息像素的照片和视频。我们的系统是基于三维光学标签的投影和检测。我们使用红外(IR)投影仪将时间编码(闪烁)点投射到场景中的选定点上。这些标签对人眼来说是不可见的，但对红外光敏器来说却是清晰可见的时变代码。作为概念验证，我们已经建立了一个原型相机系统(由共存的可见光和红外传感器组成)，以同时捕获可见光和红外图像。当用户使用这样的相机系统拍摄标记场景的图像时，所有从系统视点可见的场景标签都会被检测到。此外，位于视场内但被遮挡的标签，以及位于视场外的标签，也会自动为图像生成。与每个标记的像素相关联的是其3D位置和标记所落对象的身份。我们的系统可以与传统的图像识别方法接口，以实现高效的场景创作，使图像中的物体能够使用廉价的相机，最小的计算量和无领域知识进行鲁棒识别。我们演示了我们的系统的几个应用，包括照片浏览、电子商务、增强现实和异议定位。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Capturing Images with Sparse Informational Pixels using Projected 3D Tags

In this paper, we propose a novel imaging system that enables the capture of photos and videos with sparse informational pixels. Our system is based on the projection and detection of 3D optical tags. We use an infrared (IR) projector to project temporally-coded (blinking) dots onto selected points in a scene. These tags are invisible to the human eye, but appear as clearly visible time-varying codes to an IR photosensor. As a proof of concept, we have built a prototype camera system (consisting of co-located visible and IR sensors) to simultaneously capture visible and IR images. When a user takes an image of a tagged scene using such a camera system, all the scene tags that are visible from the system's viewpoint are detected. In addition, tags that lie in the field of view but are occluded, and ones that lie just outside the field of view, are also automatically generated for the image. Associated with each tagged pixel is its 3D location and the identity of the object that the tag falls on. Our system can interface with conventional image recognition methods for efficient scene authoring, enabling objects in an image to be robustly identified using cheap cameras, minimal computations, and no domain knowledge. We demonstrate several applications of our system, including, photo-browsing, e-commerce, augmented reality, and objection localization.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2008 IEEE Virtual Reality Conference

自引率

0.00%

发文量