Proceedings IEEE Workshop on Representation of Visual Scenes (In Conjunction with ICCV'95)最新文献

Metric calibration of a stereo rig 立体装置的公制校准

Proceedings IEEE Workshop on Representation of Visual Scenes (In Conjunction with ICCV'95) Pub Date : 1995-06-21 DOI: 10.1109/WVRS.1995.476857

Andrew Zisserman, P. Beardsley, I. Reid

引用次数: 137

Relation between 3D invariants and 2D invariants 三维不变量与二维不变量的关系

Proceedings IEEE Workshop on Representation of Visual Scenes (In Conjunction with ICCV'95) Pub Date : 1995-06-21 DOI: 10.1109/WVRS.1995.476852

S. Maybank

引用次数: 19

Shape tensors for efficient and learnable indexing 形状张量的高效和可学习索引

Proceedings IEEE Workshop on Representation of Visual Scenes (In Conjunction with ICCV'95) Pub Date : 1995-06-21 DOI: 10.1109/WVRS.1995.476853

D. Weinshall, M. Werman, A. Shashua

{"title":"Shape tensors for efficient and learnable indexing","authors":"D. Weinshall, M. Werman, A. Shashua","doi":"10.1109/WVRS.1995.476853","DOIUrl":"https://doi.org/10.1109/WVRS.1995.476853","url":null,"abstract":"Multi-point geometry: The geometry of 1 point in N images under perspective projection has been thoroughly investigated, identifying bilinear, trilinear, and quadrilinear relations between the projections of 1 point in 2, 3 and 4 frames respectively. The dual problem-the geometry of N points in 1 image-has been studied mostly in the context of object recognition, often assuming weak perspective or affine projection. We provide here a complete description of this problem. We employ a formalism in which multiframe and multi-point geometries appear in symmetry. Points and projections are interchangeable. We then derive bilinear equations for 6 points (dual to 3-frame geometry), trilinear equations for 7 points (dual to 3-frame geometry), and quadrilinear equations for 8 points (dual to the epipolar geometry). We show that the quadrilinear equations are dependent on the the bilinear and trilinear equations, and we show that adding more points will not generate any new equation. The new equations are used to design new algorithms for the reconstruction of shape from many frames, and for learning invariant relations for indexing into a database.","PeriodicalId":447791,"journal":{"name":"Proceedings IEEE Workshop on Representation of Visual Scenes (In Conjunction with ICCV'95)","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115979850","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 19

Layered representations for vision and video 视觉和视频的分层表示

Proceedings IEEE Workshop on Representation of Visual Scenes (In Conjunction with ICCV'95) Pub Date : 1995-06-21 DOI: 10.1109/WVRS.1995.476846

E. Adelson

{"title":"Layered representations for vision and video","authors":"E. Adelson","doi":"10.1109/WVRS.1995.476846","DOIUrl":"https://doi.org/10.1109/WVRS.1995.476846","url":null,"abstract":"Human vision, machine vision, and image coding, each demand representations that are useful and efficient. The best-established techniques today are based on low-level processing. Future systems for image analysis and image coding will increasingly use image representations that involve such concepts as surfaces, lighting, transparency, etc. These representations fall in the domain of \"mid-level\" vision, and there is accumulating evidence of their importance in human vision. By representing images with these more sophisticated vocabularies we can increase the flexibility and efficiency of our vision and image coding systems. We are developing systems that decompose image sequences into overlapping layers, rather like the \"cels\" used by a traditional animator. These layers are ordered in depth, sliding over one another and being combined according to the rules of transparency and occlusion. Using the layered representation we can achieve greatly improved motion analysis and image segmentation. By applying layers to image coding we can achieve data compression far better than MPEG, and achieve frame-rate independence as a side benefit. Moreover, the image sequence is decomposed in a meaningful way, which allows flexible image editing and access.","PeriodicalId":447791,"journal":{"name":"Proceedings IEEE Workshop on Representation of Visual Scenes (In Conjunction with ICCV'95)","volume":"75 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114786959","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 25

Duality of reconstruction and positioning from projective views 从投影角度看重建和定位的二元性

Proceedings IEEE Workshop on Representation of Visual Scenes (In Conjunction with ICCV'95) Pub Date : 1995-06-21 DOI: 10.1109/WVRS.1995.476856

Stefan Carlsson

引用次数: 66

Virtualized reality: concepts and early results 虚拟现实:概念和早期成果

Proceedings IEEE Workshop on Representation of Visual Scenes (In Conjunction with ICCV'95) Pub Date : 1995-06-21 DOI: 10.1109/WVRS.1995.476854

T. Kanade, P J Narayanan, P. Rander

{"title":"Virtualized reality: concepts and early results","authors":"T. Kanade, P J Narayanan, P. Rander","doi":"10.1109/WVRS.1995.476854","DOIUrl":"https://doi.org/10.1109/WVRS.1995.476854","url":null,"abstract":"The visual medium evolved from early paintings to the realistic paintings of the classical era to photographs. The medium of moving imagery started with motion pictures. Television and video recording advanced it to show action \"live\" or capture and playback later. In all of the above media, the view of the scene is determined at the transcription time, independent of the viewer. We have been developing a new visual medium called virtualized reality. It delays the selection of the viewing angle until view time, using techniques from computer vision and computer graphics. The visual event is captured using many cameras that cover the action from all sides. The 3D structure of the event, aligned with the pixels of the image, is computed for a few selected directions using a stereo technique. Triangulation and texture mapping enable the placement of a \"soft-camera\" to reconstruct the event from any new viewpoint. With a stereo-viewing system, virtualized reality allows a viewer to move freely in the scene, independent of the transcription angles used to record the scene. We describe the hardware and software setup in our \"studio\" to make virtualized reality movies. Examples are provided to demonstrate the effectiveness of the system.","PeriodicalId":447791,"journal":{"name":"Proceedings IEEE Workshop on Representation of Visual Scenes (In Conjunction with ICCV'95)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127267842","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 200

Physically-valid view synthesis by image interpolation 物理有效的图像插值视图合成

Proceedings IEEE Workshop on Representation of Visual Scenes (In Conjunction with ICCV'95) Pub Date : 1995-06-21 DOI: 10.1109/WVRS.1995.476848

S. Seitz, C. Dyer

{"title":"Physically-valid view synthesis by image interpolation","authors":"S. Seitz, C. Dyer","doi":"10.1109/WVRS.1995.476848","DOIUrl":"https://doi.org/10.1109/WVRS.1995.476848","url":null,"abstract":"Image warping is a popular tool for smoothly transforming one image to another. \"Morphing\" techniques based on geometric image interpolation create compelling visual effects, but the validity of such transformations has not been established. In particular, does 2D interpolation of two views of the same scene produce a sequence of physically valid in-between views of that scene? We describe a simple image rectification procedure which guarantees that interpolation does in fact produce valid views, under generic assumptions about visibility and the projection process. Towards this end, it is first shown that two basis views are sufficient to predict the appearance of the scene within a specific range of new viewpoints. Second, it is demonstrated that interpolation of the rectified basis images produces exactly this range of views. Finally, it is shown that generating this range of views is a theoretically well-posed problem, requiring neither knowledge of camera positions nor 3D scene reconstruction. A scanline algorithm for view interpolation is presented that requires only four user-provided feature correspondences to produce valid orthographic views. The quality of the resulting images is demonstrated with interpolations of real imagery.","PeriodicalId":447791,"journal":{"name":"Proceedings IEEE Workshop on Representation of Visual Scenes (In Conjunction with ICCV'95)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131126519","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 190

Multiframe structure from motion in perspective 从运动角度看多帧结构

Proceedings IEEE Workshop on Representation of Visual Scenes (In Conjunction with ICCV'95) Pub Date : 1995-06-21 DOI: 10.1109/WVRS.1995.476855

J. Oliensis

引用次数: 27

Representation of scenes from collections of images 从图像集合中表示场景

Proceedings IEEE Workshop on Representation of Visual Scenes (In Conjunction with ICCV'95) Pub Date : 1995-06-21 DOI: 10.1109/WVRS.1995.476847

R. Kumar, P. Anandan, M. Irani, J. Bergen, K. Hanna

{"title":"Representation of scenes from collections of images","authors":"R. Kumar, P. Anandan, M. Irani, J. Bergen, K. Hanna","doi":"10.1109/WVRS.1995.476847","DOIUrl":"https://doi.org/10.1109/WVRS.1995.476847","url":null,"abstract":"The goal of computer vision is to extract information about the world from collections of images. This information might be used to recognize or manipulate objects, to control movement through the environment, to measure or determine the condition of objects, and for many other purposes. The goal of this paper is to consider the representation of information derived from a collection of images and how it may support some of these tasks. By \"collection of images\" we mean any set of images relevant to a given scene. This includes video sequences, multiple images from a single still camera, or multiple images from different cameras. The central thesis of this paper is that the traditional approach to representation of information about scenes by relating each image to an abstract three dimensional coordinate system may not always be appropriate. An approach that more directly represents the relationships among the collection of images has a number of advantages. These relationships can also be computed using practical and efficient algorithms. We present a hierarchical framework for scene representation. We develop the algorithms used to build these representations and demonstrate results on real image sequences. Finally, the application of these representations to real world problems is discussed.","PeriodicalId":447791,"journal":{"name":"Proceedings IEEE Workshop on Representation of Visual Scenes (In Conjunction with ICCV'95)","volume":"101 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114237279","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 131

Direct methods for visual scene reconstruction 视觉场景重建的直接方法

Proceedings IEEE Workshop on Representation of Visual Scenes (In Conjunction with ICCV'95) Pub Date : 1995-06-21 DOI: 10.1109/WVRS.1995.476849

R. Szeliski, S. B. Kang

引用次数: 136