Proceedings IEEE Workshop on Representation of Visual Scenes (In Conjunction with ICCV'95)最新文献

筛选
英文 中文
Metric calibration of a stereo rig 立体装置的公制校准
Andrew Zisserman, P. Beardsley, I. Reid
{"title":"Metric calibration of a stereo rig","authors":"Andrew Zisserman, P. Beardsley, I. Reid","doi":"10.1109/WVRS.1995.476857","DOIUrl":"https://doi.org/10.1109/WVRS.1995.476857","url":null,"abstract":"Describes a method to determine affine and metric calibration for a stereo rig. The method does not involve the use of calibration objects or special motions, but simply a single general motion of the rig with fixed parameters (i.e. camera parameters and relative orientation of the camera pair). The novel aspects of this work are: first, relating the distinguished objects of Euclidean geometry to fixed entities of a Euclidean transformation matrix; second, showing that these fixed entities are accessible from the conjugate Euclidean transformation arising from the projective transformation of the structure under a motion of the fixed stereo rig; and third, a robust and automatic implementation of the method. Results are included of affine and metric calibration and structure recovery using images of real scenes.","PeriodicalId":447791,"journal":{"name":"Proceedings IEEE Workshop on Representation of Visual Scenes (In Conjunction with ICCV'95)","volume":"55 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122488444","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 137
Relation between 3D invariants and 2D invariants 三维不变量与二维不变量的关系
S. Maybank
{"title":"Relation between 3D invariants and 2D invariants","authors":"S. Maybank","doi":"10.1109/WVRS.1995.476852","DOIUrl":"https://doi.org/10.1109/WVRS.1995.476852","url":null,"abstract":"Polynomial relations are established between the invariants of certain mixed sets of points and lines and the invariants of their projected images. The relations are obtained using the properties of a rational curve, in fact a twisted cubic, which is a covariant of the given set of points and lines.","PeriodicalId":447791,"journal":{"name":"Proceedings IEEE Workshop on Representation of Visual Scenes (In Conjunction with ICCV'95)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114925894","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 19
Shape tensors for efficient and learnable indexing 形状张量的高效和可学习索引
D. Weinshall, M. Werman, A. Shashua
{"title":"Shape tensors for efficient and learnable indexing","authors":"D. Weinshall, M. Werman, A. Shashua","doi":"10.1109/WVRS.1995.476853","DOIUrl":"https://doi.org/10.1109/WVRS.1995.476853","url":null,"abstract":"Multi-point geometry: The geometry of 1 point in N images under perspective projection has been thoroughly investigated, identifying bilinear, trilinear, and quadrilinear relations between the projections of 1 point in 2, 3 and 4 frames respectively. The dual problem-the geometry of N points in 1 image-has been studied mostly in the context of object recognition, often assuming weak perspective or affine projection. We provide here a complete description of this problem. We employ a formalism in which multiframe and multi-point geometries appear in symmetry. Points and projections are interchangeable. We then derive bilinear equations for 6 points (dual to 3-frame geometry), trilinear equations for 7 points (dual to 3-frame geometry), and quadrilinear equations for 8 points (dual to the epipolar geometry). We show that the quadrilinear equations are dependent on the the bilinear and trilinear equations, and we show that adding more points will not generate any new equation. The new equations are used to design new algorithms for the reconstruction of shape from many frames, and for learning invariant relations for indexing into a database.","PeriodicalId":447791,"journal":{"name":"Proceedings IEEE Workshop on Representation of Visual Scenes (In Conjunction with ICCV'95)","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115979850","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 19
Layered representations for vision and video 视觉和视频的分层表示
E. Adelson
{"title":"Layered representations for vision and video","authors":"E. Adelson","doi":"10.1109/WVRS.1995.476846","DOIUrl":"https://doi.org/10.1109/WVRS.1995.476846","url":null,"abstract":"Human vision, machine vision, and image coding, each demand representations that are useful and efficient. The best-established techniques today are based on low-level processing. Future systems for image analysis and image coding will increasingly use image representations that involve such concepts as surfaces, lighting, transparency, etc. These representations fall in the domain of \"mid-level\" vision, and there is accumulating evidence of their importance in human vision. By representing images with these more sophisticated vocabularies we can increase the flexibility and efficiency of our vision and image coding systems. We are developing systems that decompose image sequences into overlapping layers, rather like the \"cels\" used by a traditional animator. These layers are ordered in depth, sliding over one another and being combined according to the rules of transparency and occlusion. Using the layered representation we can achieve greatly improved motion analysis and image segmentation. By applying layers to image coding we can achieve data compression far better than MPEG, and achieve frame-rate independence as a side benefit. Moreover, the image sequence is decomposed in a meaningful way, which allows flexible image editing and access.","PeriodicalId":447791,"journal":{"name":"Proceedings IEEE Workshop on Representation of Visual Scenes (In Conjunction with ICCV'95)","volume":"75 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114786959","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 25
Duality of reconstruction and positioning from projective views 从投影角度看重建和定位的二元性
Stefan Carlsson
{"title":"Duality of reconstruction and positioning from projective views","authors":"Stefan Carlsson","doi":"10.1109/WVRS.1995.476856","DOIUrl":"https://doi.org/10.1109/WVRS.1995.476856","url":null,"abstract":"Given multiple image data from a set of points in 3D, there are two fundamental questions that can be addressed: (1) What is the structure of the set of points in 3D? (2) What are the positions of the cameras relative to the points? In this paper, we show that for projective views and with structure- and position-defined modulo linear transformations, these problems are are dual in the sense that their solution arises from constraint equations where space point and camera positions occur in a reciprocal way. The problem of computing camera positions from m points in n views can be solved with the same algorithm as the problem of directly reconstructing n+4 points in m-4 views. This unifies different approaches for projective reconstruction: methods based on external calibration and direct methods exploiting constraints that exist between space and image invariants.","PeriodicalId":447791,"journal":{"name":"Proceedings IEEE Workshop on Representation of Visual Scenes (In Conjunction with ICCV'95)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126799790","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 66
Virtualized reality: concepts and early results 虚拟现实:概念和早期成果
T. Kanade, P J Narayanan, P. Rander
{"title":"Virtualized reality: concepts and early results","authors":"T. Kanade, P J Narayanan, P. Rander","doi":"10.1109/WVRS.1995.476854","DOIUrl":"https://doi.org/10.1109/WVRS.1995.476854","url":null,"abstract":"The visual medium evolved from early paintings to the realistic paintings of the classical era to photographs. The medium of moving imagery started with motion pictures. Television and video recording advanced it to show action \"live\" or capture and playback later. In all of the above media, the view of the scene is determined at the transcription time, independent of the viewer. We have been developing a new visual medium called virtualized reality. It delays the selection of the viewing angle until view time, using techniques from computer vision and computer graphics. The visual event is captured using many cameras that cover the action from all sides. The 3D structure of the event, aligned with the pixels of the image, is computed for a few selected directions using a stereo technique. Triangulation and texture mapping enable the placement of a \"soft-camera\" to reconstruct the event from any new viewpoint. With a stereo-viewing system, virtualized reality allows a viewer to move freely in the scene, independent of the transcription angles used to record the scene. We describe the hardware and software setup in our \"studio\" to make virtualized reality movies. Examples are provided to demonstrate the effectiveness of the system.","PeriodicalId":447791,"journal":{"name":"Proceedings IEEE Workshop on Representation of Visual Scenes (In Conjunction with ICCV'95)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127267842","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 200
Physically-valid view synthesis by image interpolation 物理有效的图像插值视图合成
S. Seitz, C. Dyer
{"title":"Physically-valid view synthesis by image interpolation","authors":"S. Seitz, C. Dyer","doi":"10.1109/WVRS.1995.476848","DOIUrl":"https://doi.org/10.1109/WVRS.1995.476848","url":null,"abstract":"Image warping is a popular tool for smoothly transforming one image to another. \"Morphing\" techniques based on geometric image interpolation create compelling visual effects, but the validity of such transformations has not been established. In particular, does 2D interpolation of two views of the same scene produce a sequence of physically valid in-between views of that scene? We describe a simple image rectification procedure which guarantees that interpolation does in fact produce valid views, under generic assumptions about visibility and the projection process. Towards this end, it is first shown that two basis views are sufficient to predict the appearance of the scene within a specific range of new viewpoints. Second, it is demonstrated that interpolation of the rectified basis images produces exactly this range of views. Finally, it is shown that generating this range of views is a theoretically well-posed problem, requiring neither knowledge of camera positions nor 3D scene reconstruction. A scanline algorithm for view interpolation is presented that requires only four user-provided feature correspondences to produce valid orthographic views. The quality of the resulting images is demonstrated with interpolations of real imagery.","PeriodicalId":447791,"journal":{"name":"Proceedings IEEE Workshop on Representation of Visual Scenes (In Conjunction with ICCV'95)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131126519","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 190
Multiframe structure from motion in perspective 从运动角度看多帧结构
J. Oliensis
{"title":"Multiframe structure from motion in perspective","authors":"J. Oliensis","doi":"10.1109/WVRS.1995.476855","DOIUrl":"https://doi.org/10.1109/WVRS.1995.476855","url":null,"abstract":"A new approach to multiframe structure from motion for point features is presented. Unlike previous approaches, it gives robust reconstruction in situations commonly encountered in outdoor robot navigation, for general motion and with large perspective effects. Under the appropriate conditions, the algorithm provably gives the correct reconstruction. The typical computation time is seconds. It is argued that the new approach, combined with previous algorithms valid in other domains (e.g., Tomasi's algorithm), gives a general method for structure from motion.","PeriodicalId":447791,"journal":{"name":"Proceedings IEEE Workshop on Representation of Visual Scenes (In Conjunction with ICCV'95)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130402591","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 27
Representation of scenes from collections of images 从图像集合中表示场景
R. Kumar, P. Anandan, M. Irani, J. Bergen, K. Hanna
{"title":"Representation of scenes from collections of images","authors":"R. Kumar, P. Anandan, M. Irani, J. Bergen, K. Hanna","doi":"10.1109/WVRS.1995.476847","DOIUrl":"https://doi.org/10.1109/WVRS.1995.476847","url":null,"abstract":"The goal of computer vision is to extract information about the world from collections of images. This information might be used to recognize or manipulate objects, to control movement through the environment, to measure or determine the condition of objects, and for many other purposes. The goal of this paper is to consider the representation of information derived from a collection of images and how it may support some of these tasks. By \"collection of images\" we mean any set of images relevant to a given scene. This includes video sequences, multiple images from a single still camera, or multiple images from different cameras. The central thesis of this paper is that the traditional approach to representation of information about scenes by relating each image to an abstract three dimensional coordinate system may not always be appropriate. An approach that more directly represents the relationships among the collection of images has a number of advantages. These relationships can also be computed using practical and efficient algorithms. We present a hierarchical framework for scene representation. We develop the algorithms used to build these representations and demonstrate results on real image sequences. Finally, the application of these representations to real world problems is discussed.","PeriodicalId":447791,"journal":{"name":"Proceedings IEEE Workshop on Representation of Visual Scenes (In Conjunction with ICCV'95)","volume":"101 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114237279","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 131
Direct methods for visual scene reconstruction 视觉场景重建的直接方法
R. Szeliski, S. B. Kang
{"title":"Direct methods for visual scene reconstruction","authors":"R. Szeliski, S. B. Kang","doi":"10.1109/WVRS.1995.476849","DOIUrl":"https://doi.org/10.1109/WVRS.1995.476849","url":null,"abstract":"There has been a lot of activity recently surrounding the reconstruction of photorealistic 3D scenes and high-resolution images from video sequences. We present some of our recent work in this area, which is based on the registration of multiple images (views) in a projective framework. Unlike most other techniques, we do not rely on special features to form a projective basis. Instead, we directly solve a least-squares estimation problem in the unknown structure and motion parameters, which leads to statistically optimal estimates. We discuss algorithms for both constructing planar and panoramic mosaics, and for projective depth recovery. We also speculate about the ultimate usefulness of projective approaches to visual scene reconstruction.","PeriodicalId":447791,"journal":{"name":"Proceedings IEEE Workshop on Representation of Visual Scenes (In Conjunction with ICCV'95)","volume":"56 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130053579","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 136
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信