Jean-Yves Guillemaut, A. Hilton, J. Starck, J. Kilner, O. Grau
{"title":"A Bayesian Framework for Simultaneous Matting and 3D Reconstruction","authors":"Jean-Yves Guillemaut, A. Hilton, J. Starck, J. Kilner, O. Grau","doi":"10.1109/3DIM.2007.3","DOIUrl":"https://doi.org/10.1109/3DIM.2007.3","url":null,"abstract":"Conventional approaches to 3D scene reconstruction often treat matting and reconstruction as two separate problems, with matting a prerequisite to reconstruction. The problem with such an approach is that it requires taking irreversible decisions at the first stage, which may translate into reconstruction errors at the second stage. In this paper, we propose an approach which attempts to solve both problems jointly, thereby avoiding this limitation. A general Bayesian formulation for estimating opacity and depth with respect to a reference camera is developed. In addition, it is demonstrated that in the special case of binary opacity values (background/foreground) and discrete depth values, a global solution can be obtained via a single graph-cut computation. We demonstrate the application of the method to novel view synthesis in the case of a large-scale outdoor scene. An experimental comparison with a two-stage approach based on chroma-keying and shape-from-silhouette illustrates the advantages of the new method.","PeriodicalId":442311,"journal":{"name":"Sixth International Conference on 3-D Digital Imaging and Modeling (3DIM 2007)","volume":"189 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124183396","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Pose Determination By PotentialWell Space Embedding","authors":"Limin Shang, M. Greenspan","doi":"10.1109/3DIM.2007.40","DOIUrl":"https://doi.org/10.1109/3DIM.2007.40","url":null,"abstract":"A novel algorithm is introduced to estimate the pose of objects from sparse range data. Pose determination is tackled by employing the ICP algorithm to find corresponding local minima between a preprocessed model and the runtime data. Unlike other existing algorithms that try to avoid local minima, here local minima are used as effective feature vectors for generating multiple hypotheses of the pose. These hypotheses are then examined and verified using the bounded Hough transform, which is more robust than using the registration error directly. Only a small number of iterations (e.g., 5) is needed for each ICP at both preprocessing and runtime, which makes the technique efficient. The algorithm has been implemented and tested on a variety of objects, including freeform models, using both simulated and real data from Lidar and stereovision sensors. The experimental results show the technique to be both effective and efficient, executing at multiple frames per second on standard hardware. In addition, it functions well with very sparse data, possibly comprising only hundreds of points per frame, and it is also robust to measurement error and outliers.","PeriodicalId":442311,"journal":{"name":"Sixth International Conference on 3-D Digital Imaging and Modeling (3DIM 2007)","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134300924","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Three-dimensional reconstruction using the perpendicularity constraint","authors":"B. Boufama, A. Habed","doi":"10.1109/3DIM.2007.58","DOIUrl":"https://doi.org/10.1109/3DIM.2007.58","url":null,"abstract":"This paper proposes a new method for calculating the Euclidean 3D structure of a scene using only two uncalibrated images and the basic Euclidean constraint resulting from the 3D perpendicularity. Instead of using the nonlinear self-calibration methods to estimate the intrinsic parameters, a linear search for the best values of these parameters is carried out. The criteria measuring the goodness of the intrinsic parameters' values is based on the Euclidean quality of the calculated 3D reconstruction using the 3D perpendicularity constraint. Experimental results on both simulated and real data have validated our method and have shown that the proposed method could be a better alternative to calibration-based reconstruction.","PeriodicalId":442311,"journal":{"name":"Sixth International Conference on 3-D Digital Imaging and Modeling (3DIM 2007)","volume":"150 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133518728","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Dual-Mode Deformable Models for Free-Viewpoint Video of Sports Events","authors":"J. Kilner, J. Starck, A. Hilton, O. Grau","doi":"10.1109/3DIM.2007.22","DOIUrl":"https://doi.org/10.1109/3DIM.2007.22","url":null,"abstract":"Generating free-viewpoint video in outdoor sports environments is currently an unsolved problem due to difficulties in obtaining accurate background segmentation and camera calibration. This paper introduces a technique for the reconstruction of a scene in the presence of these errors. We tackle the issues of reconstruction completeness, and accuracy of surface shape and appearance. We introduce the concept of the conservative visual hull as a technique to improve reconstruction completeness. We then present a view-dependent surface optimisation technique using deformable models to improve surface shape and appearance. We contribute a novel dual-mode snake algorithm that is robust to noise and demonstrates reduced dependence on parameterisation by separating the search of the solution space from the data fitting. We conclude by presenting results of this technique along with a quantitative evaluation against other reconstruction techniques using a leave-one- out data set.","PeriodicalId":442311,"journal":{"name":"Sixth International Conference on 3-D Digital Imaging and Modeling (3DIM 2007)","volume":"57 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131122502","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
K. Hara, Yuuki Kabashima, Y. Iwashita, R. Kurazume, T. Hasegawa
{"title":"Robust 2D-3D alignment based on geometrical consistency","authors":"K. Hara, Yuuki Kabashima, Y. Iwashita, R. Kurazume, T. Hasegawa","doi":"10.1109/3DIM.2007.44","DOIUrl":"https://doi.org/10.1109/3DIM.2007.44","url":null,"abstract":"This paper presents a new registration algorithm of a 2D image and a 3D geometrical model, which is robust for initial registration errors, for reconstructing a realistic 3D model of indoor scene settings. One of the typical techniques of pose estimation of a 3D model in a 2D image is the method based on the correspondences between 2D photometrical edges and 3D geometrical edges projected on the 2D image. However, for indoor settings, features extracted on the 2D image and jump edges of the geometrical model, which can be extracted robustly, are limited. Therefore, it is difficult to find corresponding edges between the 2D image and the 3D model correctly. For this reason, in most cases, the relative position has to be manually set close to correct position beforehand. To overcome this problem, in the proposed method, first the relative pose is roughly estimated by utilizing geometrical consistencies of back-projected 2D photometrical edges on a 3D model. Next, the edge-based method is applied for the precise pose estimation after the above estimation procedure is converged. The performance of the proposed method is successfully demonstrated with some experiments using simulated models of indoor scene settings and actual environments measured by range and image sensors.","PeriodicalId":442311,"journal":{"name":"Sixth International Conference on 3-D Digital Imaging and Modeling (3DIM 2007)","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132415013","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Automatic Registration for Model Building using Variable Dimensional Local Shape Descriptors","authors":"B. Taati, M. Bondy, P. Jasiobedzki, M. Greenspan","doi":"10.1109/3DIM.2007.14","DOIUrl":"https://doi.org/10.1109/3DIM.2007.14","url":null,"abstract":"A new set of variable dimensional local shape descriptors for 3D registration is proposed and applied to 3D model building from range images. The descriptors are based on a large set of properties represented as high dimensional histograms. The novelty of the method is two fold: first, it offers a generalized platform for a large class of local shape descriptors; second, unlike previously devised descriptors that are of low dimensionality and compact size, these descriptors are high dimensional and highly discriminating. The new approach suggests investing more into descriptor generation and comparison and in return gaining a higher percentage of inliers in the set of hypothesized point matches across the images being registered. This in turn drastically reduces the required number of RANSAC iterations for finding the alignment between two images, as is confirmed by experimentation in a 3D model building application. It is also shown that the correct choice of properties can increase the effectiveness of feature correspondences, thereby increasing the possible acquisition angle between overlapping images.","PeriodicalId":442311,"journal":{"name":"Sixth International Conference on 3-D Digital Imaging and Modeling (3DIM 2007)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128737626","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Marcel Germann, Michael D. Breitenstein, H. Pfister, I. Park
{"title":"Automatic Pose Estimation for Range Images on the GPU","authors":"Marcel Germann, Michael D. Breitenstein, H. Pfister, I. Park","doi":"10.1109/3DIM.2007.13","DOIUrl":"https://doi.org/10.1109/3DIM.2007.13","url":null,"abstract":"Object pose (location and orientation) estimation is a common task in many computer vision applications. Although many methods exist, most algorithms need manual initialization and lack robustness to illumination variation, appearance change, and partial occlusions. We propose a fast method for automatic pose estimation without manual initialization based on shape matching of a 3D model to a range image of the scene. We developed a new error function to compare the input range image to pre-computed range maps of the 3D model. We use the tremendous data- parallel processing performance of modern graphics hardware to evaluate and minimize the error function on many range images in parallel. Our algorithm is simple and accurately estimates the pose of partially occluded objects in cluttered scenes in about one second.","PeriodicalId":442311,"journal":{"name":"Sixth International Conference on 3-D Digital Imaging and Modeling (3DIM 2007)","volume":"168 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128349221","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Data-Driven Feature-Based 3D Face Synthesis","authors":"Yu Zhang, Shuhong Xu","doi":"10.1109/3DIM.2007.17","DOIUrl":"https://doi.org/10.1109/3DIM.2007.17","url":null,"abstract":"This paper presents a novel data-driven method for creating varied realistic face models by synthesizing a set of facial features according to intuitive high-level control parameters. Our method takes as examples 3D face scans in order to exploit the variations presented in the real faces of individuals. We use an automatic model fitting approach for the 3D registration problem. Once we have a common surface representation for each example, we form feature shape spaces by applying principal component analysis (PCA) to the data sets of facial feature shapes. Using PCA coefficients as a compact shape representation, we approach the shape synthesis problem by forming scattered data interpolation functions that are devoted to the generation of desired shape by taking the anthropometric parameters as input. The correspondence among all exemplar textures is obtained by parameterizing a 3D generic mesh over a 2D image domain. The new feature texture with desired attributes is synthesized by interpolating the example textures. Apart from an initial tuning of feature point positions and assignment of texture attribute values, our method is fully automated.","PeriodicalId":442311,"journal":{"name":"Sixth International Conference on 3-D Digital Imaging and Modeling (3DIM 2007)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115091769","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Tensor Algebraic Approach to Image Synthesis, Analysis and Recognition","authors":"M. Alex O. Vasilescu, Demetri Terzopoulos","doi":"10.1109/3DIM.2007.9","DOIUrl":"https://doi.org/10.1109/3DIM.2007.9","url":null,"abstract":"We review our multilinear (tensor) algebraic framework for image synthesis, analysis, and recognition. Natural images result from the multifactor interaction between the imaging process, the illumination, and the scene geometry. Numerical multilinear algebra provides a principled approach to disentangling and explicitly representing the essential factors or modes of image ensembles. Our multilinear image modeling technique employs a tensor extension of the conventional matrix singular value decomposition (SVD), known as the N-mode SVD. This leads us to a multilinear generalization of principal components analysis (PCA) and a novel multilinear generalization of independent components analysis (ICA). As example applications, we tackle currently significant problems in computer graphics, computer vision, and pattern recognition. In particular, we address image-based rendering, specifically the multilinear synthesis of images of textured surfaces for varying viewpoint and illumination, as well as the multilinear analysis and recognition of facial images under variable face shape, view, and illumination conditions. These new multilinear (tensor) algebraic methods outperform their conventional linear (matrix) algebraic counterparts.","PeriodicalId":442311,"journal":{"name":"Sixth International Conference on 3-D Digital Imaging and Modeling (3DIM 2007)","volume":"71 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128668893","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Outlier Robust ICP for Minimizing Fractional RMSD","authors":"J. M. Phillips, Ran Liu, Carlo Tomasi","doi":"10.1109/3DIM.2007.39","DOIUrl":"https://doi.org/10.1109/3DIM.2007.39","url":null,"abstract":"We describe a variation of the iterative closest point (ICP) algorithm for aligning two point sets under a set of transformations. Our algorithm is superior to previous algorithms because (1) in determining the optimal alignment, it identifies and discards likely outliers in a statistically robust manner, and (2) it is guaranteed to converge to a locally optimal solution. To this end, we formalize a new distance measure, fractional root mean squared distance (FRMSD), which incorporates the fraction of inliers into the distance function. Our framework can easily incorporate most techniques and heuristics from modern registration algorithms. We experimentally validate our algorithm against previous techniques on 2 and 3 dimensional data exposed to a variety of outlier types.","PeriodicalId":442311,"journal":{"name":"Sixth International Conference on 3-D Digital Imaging and Modeling (3DIM 2007)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-06-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122861333","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}