Proceedings. IEEE International Conference on Computer Vision最新文献

筛选
英文 中文
Intrinsic 3D Dynamic Surface Tracking based on Dynamic Ricci Flow and Teichmüller Map. 基于动态Ricci流和teichm<s:1> ller图的内禀三维动态表面跟踪。
Proceedings. IEEE International Conference on Computer Vision Pub Date : 2017-10-01 Epub Date: 2017-12-25 DOI: 10.1109/ICCV.2017.576
Xiaokang Yu, Na Lei, Yalin Wang, Xianfeng Gu
{"title":"Intrinsic 3D Dynamic Surface Tracking based on Dynamic Ricci Flow and Teichmüller Map.","authors":"Xiaokang Yu,&nbsp;Na Lei,&nbsp;Yalin Wang,&nbsp;Xianfeng Gu","doi":"10.1109/ICCV.2017.576","DOIUrl":"https://doi.org/10.1109/ICCV.2017.576","url":null,"abstract":"<p><p>3D dynamic surface tracking is an important research problem and plays a vital role in many computer vision and medical imaging applications. However, it is still challenging to efficiently register surface sequences which has large deformations and strong noise. In this paper, we propose a novel automatic method for non-rigid 3D dynamic surface tracking with surface Ricci flow and Teichmüller map methods. According to quasi-conformal Teichmüller theory, the Techmüller map minimizes the maximal dilation so that our method is able to automatically register surfaces with large deformations. Besides, the adoption of Delaunay triangulation and quadrilateral meshes makes our method applicable to low quality meshes. In our work, the 3D dynamic surfaces are acquired by a high speed 3D scanner. We first identified sparse surface features using machine learning methods in the texture space. Then we assign landmark features with different curvature settings and the Riemannian metric of the surface is computed by the dynamic Ricci flow method, such that all the curvatures are concentrated on the feature points and the surface is flat everywhere else. The registration among frames is computed by the Teichmüller mappings, which aligns the feature points with least angle distortions. We apply our new method to multiple sequences of 3D facial surfaces with large expression deformations and compare them with two other state-of-the-art tracking methods. The effectiveness of our method is demonstrated by the clearly improved accuracy and efficiency.</p>","PeriodicalId":74564,"journal":{"name":"Proceedings. IEEE International Conference on Computer Vision","volume":"2017 ","pages":"5400-5408"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/ICCV.2017.576","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"35903959","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
A geometric framework for statistical analysis of trajectories with distinct temporal spans. 具有不同时间跨度的轨迹统计分析的几何框架。
Proceedings. IEEE International Conference on Computer Vision Pub Date : 2017-10-01 Epub Date: 2017-12-25 DOI: 10.1109/iccv.2017.28
Rudrasis Chakraborty, Vikas Singh, Nagesh Adluru, Baba C Vemuri
{"title":"A geometric framework for statistical analysis of trajectories with distinct temporal spans.","authors":"Rudrasis Chakraborty,&nbsp;Vikas Singh,&nbsp;Nagesh Adluru,&nbsp;Baba C Vemuri","doi":"10.1109/iccv.2017.28","DOIUrl":"https://doi.org/10.1109/iccv.2017.28","url":null,"abstract":"<p><p><i>Analyzing data representing multifarious trajectories is central to the many fields in Science and Engineering; for example, trajectories representing a tennis serve, a gymnast's parallel bar routine, progression/remission of disease and so on. We present a novel geometric algorithm for performing statistical analysis of trajectories with distinct number of samples representing longitudinal (or temporal) data. A key feature of our proposal is that unlike existing schemes, our model is deployable in regimes where each participant provides a</i> different <i>number of acquisitions (trajectories have different number of sample points or temporal span). To achieve this, we develop a novel method involving the parallel transport of the tangent vectors along each given trajectory to the starting point of the respective trajectories and then use the span of the matrix whose columns consist of these vectors, to construct a linear subspace in</i> <b>R</b> <sup><i>m</i></sup> . <i>We then map these linear subspaces (possibly of distinct dimensions) of</i> <b>R</b> <sup><i>m</i></sup> <i>on to a single high dimensional hypersphere. This enables computing group statistics over trajectories by instead performing statistics on the hypersphere (equipped with a simpler geometry). Given a point on the hypersphere representing a trajectory, we also provide a \"reverse mapping\" algorithm to uniquely (under certain assumptions) reconstruct the subspace that corresponds to this point. Finally, by using existing algorithms for recursive Fréchet mean and exact principal geodesic analysis on the hypersphere, we present several experiments on synthetic and real (vision and medical) data sets showing how group testing on such diversely sampled longitudinal data is possible by analyzing the reconstructed data in the subspace spanned by the first few principal components.</i></p>","PeriodicalId":74564,"journal":{"name":"Proceedings. IEEE International Conference on Computer Vision","volume":"2017 ","pages":"172-181"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/iccv.2017.28","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"38023768","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
An Optimal Transportation based Univariate Neuroimaging Index. 基于单变量神经成像指数的最优运输。
Liang Mi, Wen Zhang, Junwei Zhang, Yonghui Fan, Dhruman Goradia, Kewei Chen, Eric M Reiman, Xianfeng Gu, Yalin Wang
{"title":"An Optimal Transportation based Univariate Neuroimaging Index.","authors":"Liang Mi,&nbsp;Wen Zhang,&nbsp;Junwei Zhang,&nbsp;Yonghui Fan,&nbsp;Dhruman Goradia,&nbsp;Kewei Chen,&nbsp;Eric M Reiman,&nbsp;Xianfeng Gu,&nbsp;Yalin Wang","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>The alterations of brain structures and functions have been considered closely correlated to the change of cognitive performance due to neurodegenerative diseases such as Alzheimer's disease. In this paper, we introduce a variational framework to compute the optimal transformation (OT) in 3D space and propose a univariate neuroimaging index based on OT to measure such alterations. We compute the OT from each image to a template and measure the Wasserstein distance between them. By comparing the distances from all the images to the common template, we obtain a concise and informative index for each image. Our framework makes use of the Newton's method, which reduces the computational cost and enables itself to be applicable to large-scale datasets. The proposed work is a generic approach and thus may be applicable to various volumetric brain images, including structural magnetic resonance (sMR) and fluorodeoxyglucose positron emission tomography (FDG-PET) images. In the classification between Alzheimer's disease patients and healthy controls, our method achieves an accuracy of 82.30% on the Alzheimers Disease Neuroimaging Initiative (ADNI) baseline sMRI dataset and outperforms several other indices. On FDG-PET dataset, we boost the accuracy to 88.37% by leveraging pairwise Wasserstein distances. In a longitudinal study, we obtain a 5% significance with p-value = 1.13×10<sup>5</sup> in a t-test on FDG-PET. The results demonstrate a great potential of the proposed index for neuroimage analysis and the precision medicine research.</p>","PeriodicalId":74564,"journal":{"name":"Proceedings. IEEE International Conference on Computer Vision","volume":"2017 ","pages":"182-191"},"PeriodicalIF":0.0,"publicationDate":"2017-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5719504/pdf/nihms896614.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"35238488","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Projection free method for Generalized Eigenvalue Problem with a nonsmooth Regularizer. 用非光滑正则处理广义特征值问题的无投影方法
Proceedings. IEEE International Conference on Computer Vision Pub Date : 2015-12-01 DOI: 10.1109/ICCV.2015.214
Seong Jae Hwang, Maxwell D Collins, Sathya N Ravi, Vamsi K Ithapu, Nagesh Adluru, Sterling C Johnson, Vikas Singh
{"title":"A Projection free method for Generalized Eigenvalue Problem with a nonsmooth Regularizer.","authors":"Seong Jae Hwang, Maxwell D Collins, Sathya N Ravi, Vamsi K Ithapu, Nagesh Adluru, Sterling C Johnson, Vikas Singh","doi":"10.1109/ICCV.2015.214","DOIUrl":"10.1109/ICCV.2015.214","url":null,"abstract":"<p><p>Eigenvalue problems are ubiquitous in computer vision, covering a very broad spectrum of applications ranging from estimation problems in multi-view geometry to image segmentation. Few other linear algebra problems have a more mature set of numerical routines available and many computer vision libraries leverage such tools extensively. However, the ability to call the underlying solver only as a \"black box\" can often become restrictive. Many 'human in the loop' settings in vision frequently exploit supervision from an expert, to the extent that the user can be considered a subroutine in the overall system. In other cases, there is additional domain knowledge, side or even partial information that one may want to incorporate within the formulation. In general, regularizing a (generalized) eigenvalue problem with such side information remains difficult. Motivated by these needs, this paper presents an optimization scheme to solve generalized eigenvalue problems (GEP) involving a (nonsmooth) regularizer. We start from an alternative formulation of GEP where the feasibility set of the model involves the Stiefel manifold. The core of this paper presents an end to end stochastic optimization scheme for the resultant problem. We show how this general algorithm enables improved statistical analysis of brain imaging data where the regularizer is derived from other 'views' of the disease pathology, involving clinical measurements and other image-derived representations.</p>","PeriodicalId":74564,"journal":{"name":"Proceedings. IEEE International Conference on Computer Vision","volume":"2015 ","pages":"1841-1849"},"PeriodicalIF":0.0,"publicationDate":"2015-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4828964/pdf/nihms764614.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"34405185","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Unsupervised Synchrony Discovery in Human Interaction. 人际互动中的无监督同步发现
Proceedings. IEEE International Conference on Computer Vision Pub Date : 2015-12-01 DOI: 10.1109/ICCV.2015.360
Wen-Sheng Chu, Jiabei Zeng, Fernando De la Torre, Jeffrey F Cohn, Daniel S Messinger
{"title":"Unsupervised Synchrony Discovery in Human Interaction.","authors":"Wen-Sheng Chu, Jiabei Zeng, Fernando De la Torre, Jeffrey F Cohn, Daniel S Messinger","doi":"10.1109/ICCV.2015.360","DOIUrl":"10.1109/ICCV.2015.360","url":null,"abstract":"<p><p>People are inherently social. Social interaction plays an important and natural role in human behavior. Most computational methods focus on individuals alone rather than in social context. They also require labelled training data. We present an unsupervised approach to discover interpersonal synchrony, referred as to two or more persons preforming common actions in overlapping video frames or segments. For computational efficiency, we develop a branch-and-bound (B&B) approach that affords exhaustive search while guaranteeing a globally optimal solution. The proposed method is entirely general. It takes from two or more videos any multi-dimensional signal that can be represented as a histogram. We derive three novel bounding functions and provide efficient extensions, including multi-synchrony detection and accelerated search, using a warm-start strategy and parallelism. We evaluate the effectiveness of our approach in multiple databases, including human actions using the CMU Mocap dataset [1], spontaneous facial behaviors using group-formation task dataset [37] and parent-infant interaction dataset [28].</p>","PeriodicalId":74564,"journal":{"name":"Proceedings. IEEE International Conference on Computer Vision","volume":"2015 ","pages":"3146-3154"},"PeriodicalIF":0.0,"publicationDate":"2015-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4918688/pdf/nihms-751964.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"34612878","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Lending A Hand: Detecting Hands and Recognizing Activities in Complex Egocentric Interactions. 伸出一只手:在复杂的自我中心互动中检测手和识别活动。
Proceedings. IEEE International Conference on Computer Vision Pub Date : 2015-12-01 Epub Date: 2016-02-18 DOI: 10.1109/ICCV.2015.226
Sven Bambach, Stefan Lee, David J Crandall, Chen Yu
{"title":"Lending A Hand: Detecting Hands and Recognizing Activities in Complex Egocentric Interactions.","authors":"Sven Bambach,&nbsp;Stefan Lee,&nbsp;David J Crandall,&nbsp;Chen Yu","doi":"10.1109/ICCV.2015.226","DOIUrl":"https://doi.org/10.1109/ICCV.2015.226","url":null,"abstract":"<p><p>Hands appear very often in egocentric video, and their appearance and pose give important cues about what people are doing and what they are paying attention to. But existing work in hand detection has made strong assumptions that work well in only simple scenarios, such as with limited interaction with other people or in lab settings. We develop methods to locate and distinguish between hands in egocentric video using strong appearance models with Convolutional Neural Networks, and introduce a simple candidate region generation approach that outperforms existing techniques at a fraction of the computational cost. We show how these high-quality bounding boxes can be used to create accurate pixelwise hand regions, and as an application, we investigate the extent to which hand segmentation alone can distinguish between different activities. We evaluate these techniques on a new dataset of 48 first-person videos of people interacting in realistic environments, with pixel-level ground truth for over 15,000 hand instances.</p>","PeriodicalId":74564,"journal":{"name":"Proceedings. IEEE International Conference on Computer Vision","volume":"2015 ","pages":"1949-1957"},"PeriodicalIF":0.0,"publicationDate":"2015-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/ICCV.2015.226","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"35238485","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 359
Volumetric Semantic Segmentation using Pyramid Context Features. 使用金字塔上下文特征的体积语义分割。
Proceedings. IEEE International Conference on Computer Vision Pub Date : 2013-12-01 DOI: 10.1109/ICCV.2013.428
Jonathan T Barron, Pablo Arbeláez, Soile V E Keränen, Mark D Biggin, David W Knowles, Jitendra Malik
{"title":"Volumetric Semantic Segmentation using Pyramid Context Features.","authors":"Jonathan T Barron, Pablo Arbeláez, Soile V E Keränen, Mark D Biggin, David W Knowles, Jitendra Malik","doi":"10.1109/ICCV.2013.428","DOIUrl":"10.1109/ICCV.2013.428","url":null,"abstract":"<p><p>We present an algorithm for the per-voxel semantic segmentation of a three-dimensional volume. At the core of our algorithm is a novel \"pyramid context\" feature, a descriptive representation designed such that exact per-voxel linear classification can be made extremely efficient. This feature not only allows for efficient semantic segmentation but enables other aspects of our algorithm, such as novel learned features and a stacked architecture that can reason about self-consistency. We demonstrate our technique on 3D fluorescence microscopy data of Drosophila embryos for which we are able to produce extremely accurate semantic segmentations in a matter of minutes, and for which other algorithms fail due to the size and high-dimensionality of the data, or due to the difficulty of the task.</p>","PeriodicalId":74564,"journal":{"name":"Proceedings. IEEE International Conference on Computer Vision","volume":"2013 ","pages":"3448-3455"},"PeriodicalIF":0.0,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/ICCV.2013.428","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"33223269","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
Learning a Dictionary of Shape Epitomes with Applications to Image Labeling. 学习形状缩影词典及其在图像标记中的应用。
Proceedings. IEEE International Conference on Computer Vision Pub Date : 2013-12-01 DOI: 10.1109/ICCV.2013.49
Liang-Chieh Chen, George Papandreou, Alan L Yuille
{"title":"Learning a Dictionary of Shape Epitomes with Applications to Image Labeling.","authors":"Liang-Chieh Chen,&nbsp;George Papandreou,&nbsp;Alan L Yuille","doi":"10.1109/ICCV.2013.49","DOIUrl":"https://doi.org/10.1109/ICCV.2013.49","url":null,"abstract":"<p><p>The first main contribution of this paper is a novel method for representing images based on a dictionary of shape epitomes. These shape epitomes represent the local edge structure of the image and include hidden variables to encode shift and rotations. They are learnt in an unsupervised manner from groundtruth edges. This dictionary is compact but is also able to capture the typical shapes of edges in natural images. In this paper, we illustrate the shape epitomes by applying them to the image labeling task. In other work, described in the supplementary material, we apply them to edge detection and image modeling. We apply shape epitomes to image labeling by using Conditional Random Field (CRF) Models. They are alternatives to the superpixel or pixel representations used in most CRFs. In our approach, the shape of an image patch is encoded by a shape epitome from the dictionary. Unlike the superpixel representation, our method avoids making early decisions which cannot be reversed. Our resulting hierarchical CRFs efficiently capture both local and global class co-occurrence properties. We demonstrate its quantitative and qualitative properties of our approach with image labeling experiments on two standard datasets: MSRC-21 and Stanford Background.</p>","PeriodicalId":74564,"journal":{"name":"Proceedings. IEEE International Conference on Computer Vision","volume":"2013 ","pages":"337-344"},"PeriodicalIF":0.0,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/ICCV.2013.49","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"33964061","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
Recursive Estimation of the Stein Center of SPD Matrices & its Applications. SPD矩阵Stein中心的递归估计及其应用。
Proceedings. IEEE International Conference on Computer Vision Pub Date : 2013-12-01 DOI: 10.1109/ICCV.2013.225
Hesamoddin Salehian, Guang Cheng, Baba C Vemuri, Jeffrey Ho
{"title":"Recursive Estimation of the Stein Center of SPD Matrices & its Applications.","authors":"Hesamoddin Salehian,&nbsp;Guang Cheng,&nbsp;Baba C Vemuri,&nbsp;Jeffrey Ho","doi":"10.1109/ICCV.2013.225","DOIUrl":"https://doi.org/10.1109/ICCV.2013.225","url":null,"abstract":"<p><p>Symmetric positive-definite (SPD) matrices are ubiquitous in Computer Vision, Machine Learning and Medical Image Analysis. Finding the center/average of a population of such matrices is a common theme in many algorithms such as clustering, segmentation, principal geodesic analysis, etc. The center of a population of such matrices can be defined using a variety of distance/divergence measures as the minimizer of the sum of squared distances/divergences from the unknown center to the members of the population. It is well known that the computation of the Karcher mean for the space of SPD matrices which is a negatively-curved Riemannian manifold is computationally expensive. Recently, the LogDet divergence-based center was shown to be a computationally attractive alternative. However, the LogDet-based mean of more than two matrices can not be computed in closed form, which makes it computationally less attractive for large populations. In this paper we present a novel recursive estimator for center based on the Stein distance - which is the square root of the LogDet divergence - that is significantly faster than the batch mode computation of this center. The key theoretical contribution is a closed-form solution for the weighted Stein center of two SPD matrices, which is used in the recursive computation of the Stein center for a population of SPD matrices. Additionally, we show experimental evidence of the convergence of our recursive Stein center estimator to the batch mode Stein center. We present applications of our recursive estimator to K-means clustering and image indexing depicting significant time gains over corresponding algorithms that use the batch mode computations. For the latter application, we develop novel hashing functions using the Stein distance and apply it to publicly available data sets, and experimental results have shown favorable comparisons to other competing methods.</p>","PeriodicalId":74564,"journal":{"name":"Proceedings. IEEE International Conference on Computer Vision","volume":" ","pages":"1793-1800"},"PeriodicalIF":0.0,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/ICCV.2013.225","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9368617","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 19
Image Segmentation with Cascaded Hierarchical Models and Logistic Disjunctive Normal Networks. 基于级联层次模型和逻辑析取正规网络的图像分割。
Proceedings. IEEE International Conference on Computer Vision Pub Date : 2013-12-01 DOI: 10.1109/ICCV.2013.269
Mojtaba Seyedhosseini, Mehdi Sajjadi, Tolga Tasdizen
{"title":"Image Segmentation with Cascaded Hierarchical Models and Logistic Disjunctive Normal Networks.","authors":"Mojtaba Seyedhosseini,&nbsp;Mehdi Sajjadi,&nbsp;Tolga Tasdizen","doi":"10.1109/ICCV.2013.269","DOIUrl":"https://doi.org/10.1109/ICCV.2013.269","url":null,"abstract":"<p><p>Contextual information plays an important role in solving vision problems such as image segmentation. However, extracting contextual information and using it in an effective way remains a difficult problem. To address this challenge, we propose a multi-resolution contextual framework, called cascaded hierarchical model (CHM), which learns contextual information in a hierarchical framework for image segmentation. At each level of the hierarchy, a classifier is trained based on downsampled input images and outputs of previous levels. Our model then incorporates the resulting multi-resolution contextual information into a classifier to segment the input image at original resolution. We repeat this procedure by cascading the hierarchical framework to improve the segmentation accuracy. Multiple classifiers are learned in the CHM; therefore, a fast and accurate classifier is required to make the training tractable. The classifier also needs to be robust against overfitting due to the large number of parameters learned during training. We introduce a novel classification scheme, called logistic disjunctive normal networks (LDNN), which consists of one adaptive layer of feature detectors implemented by logistic sigmoid functions followed by two fixed layers of logical units that compute conjunctions and disjunctions, respectively. We demonstrate that LDNN outperforms state-of-theart classifiers and can be used in the CHM to improve object segmentation performance.</p>","PeriodicalId":74564,"journal":{"name":"Proceedings. IEEE International Conference on Computer Vision","volume":"2013 ","pages":"2168-2175"},"PeriodicalIF":0.0,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/ICCV.2013.269","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"32835013","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 80
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信