2009 13th International Machine Vision and Image Processing Conference最新文献

筛选
英文 中文
Estimating 3D Scene Flow from Multiple 2D Optical Flows 从多个2D光流估计3D场景流
2009 13th International Machine Vision and Image Processing Conference Pub Date : 2009-09-02 DOI: 10.1109/IMVIP.2009.8
J. Ruttle, M. Manzke, Rozenn Dahyot
{"title":"Estimating 3D Scene Flow from Multiple 2D Optical Flows","authors":"J. Ruttle, M. Manzke, Rozenn Dahyot","doi":"10.1109/IMVIP.2009.8","DOIUrl":"https://doi.org/10.1109/IMVIP.2009.8","url":null,"abstract":"Scene flow is the motion of the surface points in the 3D world. For a camera, it is seen as a 2D optical flow in the image plane. Knowing the scene flow can be very useful as it gives an idea of the surface geometry of the objects in the scene and how those objects are moving. Four methods for calculating the scene flow given multiple optical flows have been explored and detailed in this paper along with the basic mathematics surrounding multi-view geometry. It was found that given multiple optical flows it is possible to estimate the scene flow to different levels of detail depending on the level of prior information present.","PeriodicalId":179564,"journal":{"name":"2009 13th International Machine Vision and Image Processing Conference","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2009-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120851411","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
A Multistage Hierarchical Algorithm for Hand Shape Recognition 手部形状识别的多阶段分层算法
2009 13th International Machine Vision and Image Processing Conference Pub Date : 2009-09-02 DOI: 10.1109/IMVIP.2009.26
M. Farouk, Alistair Sutherland, Amin A. Shoukry
{"title":"A Multistage Hierarchical Algorithm for Hand Shape Recognition","authors":"M. Farouk, Alistair Sutherland, Amin A. Shoukry","doi":"10.1109/IMVIP.2009.26","DOIUrl":"https://doi.org/10.1109/IMVIP.2009.26","url":null,"abstract":"This paper represents a multistage hierarchical algorithm for hand shape recognition using principal component analysis (PCA) as a dimensionality reduction and a feature extraction method. The paper discusses the effect of image blurring to build data manifolds using PCA and the different ways to construct these manifolds. In_order to classify the hand shape of an incoming sign object and to be invariant to linear transformations like translation and rotation, a multistage hierarchical classifier structure is used. Computer generated images for different Irish Sign Language shapes are used in testing. Experimental results are given to show the accuracy and performance of the proposed algorithm.","PeriodicalId":179564,"journal":{"name":"2009 13th International Machine Vision and Image Processing Conference","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2009-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124525351","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Hidden Conditional Random Fields for Visual Speech Recognition 视觉语音识别的隐藏条件随机场
2009 13th International Machine Vision and Image Processing Conference Pub Date : 2009-09-02 DOI: 10.1109/IMVIP.2009.28
Adrian Pass, Jianguo Zhang, D. Stewart
{"title":"Hidden Conditional Random Fields for Visual Speech Recognition","authors":"Adrian Pass, Jianguo Zhang, D. Stewart","doi":"10.1109/IMVIP.2009.28","DOIUrl":"https://doi.org/10.1109/IMVIP.2009.28","url":null,"abstract":"In this paper we present the application of Hidden Conditional Random Fields (HCRFs) to modeling speech for visual speech recognition. HCRFs may be easily adapted to model long range dependencies across an observation sequence. As a result visual word recognition performance can be improved as the model is able to take more of a contextual approach to generating state sequences. Results are presented from a speaker-dependent, isolated digit, visual speech recognition task using comparisons with a baseline HMM system. We firstly illustrate that word recognition rates on clean video using HCRFs can be improved by increasing the number of past and future observations being taken into account by each state. Secondly we compare model performances using various levels of video compression on the test set. As far as we are aware this is the first attempted use of HCRFs for visual speech recognition.","PeriodicalId":179564,"journal":{"name":"2009 13th International Machine Vision and Image Processing Conference","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2009-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122609873","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
GPU Implementation of the Affine Transform for 3D Image Registration 三维图像配准仿射变换的GPU实现
2009 13th International Machine Vision and Image Processing Conference Pub Date : 2009-09-02 DOI: 10.1109/IMVIP.2009.34
D. Crookes, K. Boyle, P. Miller, C. Gillan
{"title":"GPU Implementation of the Affine Transform for 3D Image Registration","authors":"D. Crookes, K. Boyle, P. Miller, C. Gillan","doi":"10.1109/IMVIP.2009.34","DOIUrl":"https://doi.org/10.1109/IMVIP.2009.34","url":null,"abstract":"Recent developments in 3D low-light level CCD (L3CCD) image capture have resulted in vast volumes of data being produced in real time which require image registration. The amount of data involved means that acceleration of the processing is essential. One of the key steps in one iterative registration algorithm is the application of an affine transform to all the planes of a 3D image. This paper presents details and performance results for a number of parallelized implementations of the affine transform on the NVIDIA 8800 GPU series, and shows that the transform runs 128 times faster on the GPU than a C++ version on a PC, or 54 times faster when data transfer between the GPU and the host PC is included.","PeriodicalId":179564,"journal":{"name":"2009 13th International Machine Vision and Image Processing Conference","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2009-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122723638","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Denoising Magnetic Resonance Images Using Fourth Order Complex Diffusion 利用四阶复扩散去噪磁共振图像
2009 13th International Machine Vision and Image Processing Conference Pub Date : 2009-09-02 DOI: 10.1109/IMVIP.2009.29
Jeny Rajan, B. Jeurissen, Jan Sijbers, K. Kannan
{"title":"Denoising Magnetic Resonance Images Using Fourth Order Complex Diffusion","authors":"Jeny Rajan, B. Jeurissen, Jan Sijbers, K. Kannan","doi":"10.1109/IMVIP.2009.29","DOIUrl":"https://doi.org/10.1109/IMVIP.2009.29","url":null,"abstract":"Complex diffusion is a comparatively new Partial Differential Equations (PDE) based method introduced for removing noise from images. The efficiency of 2nd order complex diffusion for image denoising is already proved by many researchers. 2nd order non linear complex diffusion can behave like 3rd and 4th order real PDEs enabling a variety of new options with standard 2nd order numerical schemes. Extending 2nd order non linear complex diffusion to 4th order can produce a much better result. In this paper we present a 4th order non linear complex diffusion. Our experimental results show that this 4th order complex PDE is a good choice for denoising Magnetic Resonance images. The efficacy of the algorithm is demonstrated on both simulated and real Magnetic Resonance images.","PeriodicalId":179564,"journal":{"name":"2009 13th International Machine Vision and Image Processing Conference","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2009-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116621308","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Rotation Invariant Matching of Partial Shoeprints 部分鞋印的旋转不变性匹配
2009 13th International Machine Vision and Image Processing Conference Pub Date : 2009-09-02 DOI: 10.1109/IMVIP.2009.24
O. Nibouche, A. Bouridane, D. Crookes, M. Gueham, M. Laadjel
{"title":"Rotation Invariant Matching of Partial Shoeprints","authors":"O. Nibouche, A. Bouridane, D. Crookes, M. Gueham, M. Laadjel","doi":"10.1109/IMVIP.2009.24","DOIUrl":"https://doi.org/10.1109/IMVIP.2009.24","url":null,"abstract":"In this paper, we propose a solution for the problem of rotated partial shoeprint retrieval, based on the combined use of local points of interest and SIFT descriptor. Once the generated features are encoded using SIFT descriptor, matching is carried out using RANSAC to estimate a transformation model and establish the number of its inliers which is then multiplied by the sum of point-to-point Euclidean distances below a hard threshold. We demonstrate that such combination can overcome the issue of retrieval of partial prints in the presence of rotation and noise distortions. Conducted experiments have shown that the proposed solution achieves very good matching results and outperforms similar work in the literature both in terms of performance and complexity.","PeriodicalId":179564,"journal":{"name":"2009 13th International Machine Vision and Image Processing Conference","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2009-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130391042","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 32
Segmentation of Intertwining Stringlike Objects in Three Dimensional CT Image Based on Positional Information 基于位置信息的三维CT图像中缠绕线状物体的分割
2009 13th International Machine Vision and Image Processing Conference Pub Date : 2009-09-02 DOI: 10.1109/IMVIP.2009.13
T. Shinohara
{"title":"Segmentation of Intertwining Stringlike Objects in Three Dimensional CT Image Based on Positional Information","authors":"T. Shinohara","doi":"10.1109/IMVIP.2009.13","DOIUrl":"https://doi.org/10.1109/IMVIP.2009.13","url":null,"abstract":"In this paper, a method for segmenting intertwining stringlike objects (strings) in a three-dimensional (3D) X-ray computed tomography (CT) image is proposed. In the proposed method, the positional information, which is a sequence of the center points of the string, is used. The positional information is obtained by tracing the string. The string is traced by sequentially estimating the center and direction of it. The center and direction are estimated by correlating the voxel values with a solid model of the string. The voxels in the 3D CT image are segmented into the strings by clustering on the basis of distance between the voxel and the obtained positional information. The effectiveness of the proposed segmentation method is discussed by experimentally applying it to the 3D CT image of a plain knitted fabric.","PeriodicalId":179564,"journal":{"name":"2009 13th International Machine Vision and Image Processing Conference","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2009-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130521615","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Robust Panning Analysis for Slideshow Detection in Video Databases 视频数据库中幻灯片检测的鲁棒平移分析
2009 13th International Machine Vision and Image Processing Conference Pub Date : 2009-09-02 DOI: 10.1109/IMVIP.2009.23
Zbigniew Zdziarski, Rozenn Dahyot
{"title":"Robust Panning Analysis for Slideshow Detection in Video Databases","authors":"Zbigniew Zdziarski, Rozenn Dahyot","doi":"10.1109/IMVIP.2009.23","DOIUrl":"https://doi.org/10.1109/IMVIP.2009.23","url":null,"abstract":"We present an algorithm for slideshow detection in video databases such as YouTube or Blip.TV. Our solution is based around feature tracking to extract movement between sequentially captured frames. This movement is then analysed through the use of the Hough Transform and compared against behaviour commonly exhibited by slideshows: still and panning static images. We show experimentally the effectiveness of this novel idea and approach.","PeriodicalId":179564,"journal":{"name":"2009 13th International Machine Vision and Image Processing Conference","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2009-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132382837","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信