{"title":"Scale-Less Feature-Spatial Matching","authors":"Chao Zhang, Tingzhi Shen","doi":"10.1109/DICTA.2013.6691526","DOIUrl":"https://doi.org/10.1109/DICTA.2013.6691526","url":null,"abstract":"In this paper, we improve the discriminability of the Scale-Less SIFT (SLS) descriptor, which is constructed without requiring scale estimation of interest points. We thereby avoid to find stable scales which are difficult to obtain in many cases. Scale-Less SIFT descriptors of interest points are represented as sets of SIFT descriptors at multiple scales. We construct the linear subspace as the geometric representation for sets of SIFT descriptors. Then an embedding representation is learned that combines the descriptor similarity across scales and the spatial arrangement in a unified Euclidean embedding space. The learned subspace are highly capable of capturing the scale-varying values of SIFT descriptors. Experiment results demonstrate significant improvements by our constructed descriptors over existing methods on standard benchmark datasets.","PeriodicalId":231632,"journal":{"name":"2013 International Conference on Digital Image Computing: Techniques and Applications (DICTA)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114967699","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
S. Chan, R. Jeffree, M. Fay, S. Crozier, Zhengyi Yang, Y. Gal, P. Thomas
{"title":"Automated Classification of Bone and Air Volumes for Hybrid PET-MRI Brain Imaging","authors":"S. Chan, R. Jeffree, M. Fay, S. Crozier, Zhengyi Yang, Y. Gal, P. Thomas","doi":"10.1109/DICTA.2013.6691483","DOIUrl":"https://doi.org/10.1109/DICTA.2013.6691483","url":null,"abstract":"In clinically applicable structural magnetic resonance images (MRI), bone and air have similarly low signal intensity, making the differentiation between them a very challenging task. MRI-based bone/air segmentation, however, is a critical step in some emerging applications, such as skull atlas building, MRI-based attenuation correction for Positron Emission Tomography (PET), and MRI-based radiotherapy planning. In view of the availability of hybrid PET-MRI machines, we propose a voxel-wise classification method for bone/air segmentation. The method is based on random forest theory and features extracted from structural MRI and attenuation uncorrected PET. The Dice Similarity Score (DSC) score between the segmentation result and the 'ground truth' obtained by thresholding Computed Tomography images was calculated for validation. Images from 10 subjects were used for validation, achieving a DSC of 0.83±0.08 and 0.98±0.01 for air and bone, respectively. The results suggest that structural MRI and uncorrected PET can be used to reliably differentiate between air and bone.","PeriodicalId":231632,"journal":{"name":"2013 International Conference on Digital Image Computing: Techniques and Applications (DICTA)","volume":"149 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127516236","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Mst. Nargis Aktar, M. Alam, A. Lambert, M. Pickering
{"title":"Robust 3D Multi-Modal Registration of MRI Volumes Using the Sum of Conditional Variance","authors":"Mst. Nargis Aktar, M. Alam, A. Lambert, M. Pickering","doi":"10.1109/DICTA.2013.6691520","DOIUrl":"https://doi.org/10.1109/DICTA.2013.6691520","url":null,"abstract":"Multi-modal registration is a fundamental step for many medical imaging procedures. In this paper, the sum of conditional variance (SCV) similarity measure is proposed for 3D multi-modal medical image registration. The SCV similarity measure is based on minimizing the sum of conditional variances that are calculated using the joint histogram of the two images to be registered. Standard Gauss-Newton optimization is used to automatically minimize this measure which allows fast computational time and high accuracy. Experimental results show that our proposed approach is robust, computationally efficient and also more accurate when compared with the standard mutual information (MI) based approach and also the recently proposed sum-of-squared-difference on entropy images (eSSD) approach.","PeriodicalId":231632,"journal":{"name":"2013 International Conference on Digital Image Computing: Techniques and Applications (DICTA)","volume":"144 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124595538","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Ang Li, Changyang Li, Xiuying Wang, S. Eberl, D. Feng, M. Fulham
{"title":"Automated Segmentation of Prostate MR Images Using Prior Knowledge Enhanced Random Walker","authors":"Ang Li, Changyang Li, Xiuying Wang, S. Eberl, D. Feng, M. Fulham","doi":"10.1109/DICTA.2013.6691485","DOIUrl":"https://doi.org/10.1109/DICTA.2013.6691485","url":null,"abstract":"Prostate cancer is the second most common cause of cancer deaths in males. Accurate prostate segmentation from magnetic resonance (MR) images is critical to the diagnosis and treatment of prostate cancer. Automated prostate segmentation is challenging due to the variety in shapes and sizes of the prostate. Furthermore, the expected boundaries of ROIs are often indistinct, while heterogeneity concurrently exists within the ROIs. To address these challenges, we propose an automated approach that incorporates the local intensity features by random walker (RW) algorithm and global probability knowledge from an atlas to better describe unique characteristics of the prostate in MR images. We formulated a new RW weight function to take into account atlas probabilities and intensity differences. The prior knowledge from the atlas probability map not only reflects the statistical shape approximation of the prostate but also provides confinement and guidance for RW segmentation. Our approach was validated and compared with the conventional RW algorithm on segmenting 30 3-T prostate MR volumes. The experimental results indicated that our approach with an average DSC of 80.7±5.1%, outperformed that of the conventional RW (average DSC = 71.9±9.1%) and several other reported methods in terms of DSC accuracy and robustness.","PeriodicalId":231632,"journal":{"name":"2013 International Conference on Digital Image Computing: Techniques and Applications (DICTA)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133417637","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Predicting Shot Locations in Tennis Using Spatiotemporal Data","authors":"Xinyu Wei, P. Lucey, S. Morgan, S. Sridharan","doi":"10.1109/DICTA.2013.6691516","DOIUrl":"https://doi.org/10.1109/DICTA.2013.6691516","url":null,"abstract":"Over the past decade, vision-based tracking systems have been successfully deployed in professional sports such as tennis and cricket for enhanced broadcast visualizations as well as aiding umpiring decisions. Despite the high-level of accuracy of the tracking systems and the sheer volume of spatiotemporal data they generate, the use of this high quality data for quantitative player performance and prediction has been lacking. In this paper, we present a method which predicts the location of a future shot based on the spatiotemporal parameters of the incoming shots (i.e. shot speed, location, angle and feet location) from such a vision system. Having the ability to accurately predict future short-term events has enormous implications in the area of automatic sports broadcasting in addition to coaching and commentary domains. Using Hawk-Eye data from the 2012 Australian Open Men's draw, we utilize a Dynamic Bayesian Network to model player behaviors and use an online model adaptation method to match the player's behavior to enhance shot predictability. To show the utility of our approach, we analyze the shot predictability of the top 3 players seeds in the tournament (Djokovic, Federer and Nadal) as they played the most amounts of games.","PeriodicalId":231632,"journal":{"name":"2013 International Conference on Digital Image Computing: Techniques and Applications (DICTA)","volume":"69 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114706752","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Learning Discriminative Local Patterns with Unrestricted Structure for Face Recognition","authors":"Douglas Brown, Yongsheng Gao, J. Zhou","doi":"10.1109/DICTA.2013.6691504","DOIUrl":"https://doi.org/10.1109/DICTA.2013.6691504","url":null,"abstract":"Local binary patterns are a popular local texture feature for describing textures and objects. The standard method and many derivatives use a hand- crafted structure of point comparisons to encode the local texture to build the descriptors. In this paper we propose automatically learning a discriminative pattern structure from an extended pool of candidate pattern elements, without restricting the possible configurations. The learnt pattern structure may contain elements describing many different scales and gradient orientations that are not available in LBP (and related patterns), thus allowing the flexibility to construct structures capable of better representing the objects under test. We show through experimentation on two face recognition databases that this approach consistently outperforms other methods, in terms of training speed and recognition accuracy in every tested case.","PeriodicalId":231632,"journal":{"name":"2013 International Conference on Digital Image Computing: Techniques and Applications (DICTA)","volume":"58 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122177639","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Xueyan Dong, M. Towsey, Jinglan Zhang, Jasmine Banks, P. Roe
{"title":"A Novel Representation of Bioacoustic Events for Content-Based Search in Field Audio Data","authors":"Xueyan Dong, M. Towsey, Jinglan Zhang, Jasmine Banks, P. Roe","doi":"10.1109/DICTA.2013.6691473","DOIUrl":"https://doi.org/10.1109/DICTA.2013.6691473","url":null,"abstract":"Bioacoustic data can provide an important base for environmental monitoring. To explore a large amount of field recordings collected, an automated similarity search algorithm is presented in this paper. A region of an audio defined by frequency and time bounds is provided by a user; the content of the region is used to construct a query. In the retrieving process, our algorithm will automatically scan through recordings to search for similar regions. In detail, we present a feature extraction approach based on the visual content of vocalisations - in this case ridges, and develop a generic regional representation of vocalisations for indexing. Our feature extraction method works best for bird vocalisations showing ridge characteristics. The regional representation method allows the content of an arbitrary region of a continuous recording to be described in a compressed format.","PeriodicalId":231632,"journal":{"name":"2013 International Conference on Digital Image Computing: Techniques and Applications (DICTA)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129463414","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Ad Hoc Radiometric Calibration of a Thermal-Infrared Camera","authors":"Stephen Vidas, Peyman Moghadam","doi":"10.1109/DICTA.2013.6691478","DOIUrl":"https://doi.org/10.1109/DICTA.2013.6691478","url":null,"abstract":"Many applications can benefit from the accurate surface temperature estimates that can be made using a passive thermal-infrared camera. However, the process of radiometric calibration which enables this can be both expensive and time consuming. An ad hoc approach for performing radiometric calibration is proposed which does not require specialized equipment and can be completed in a fraction of the time of the conventional method. The proposed approach utilizes the mechanical properties of the camera to estimate scene temperatures automatically, and uses these target temperatures to model the effect of sensor temperature on the digital output. A comparison with a conventional approach using a blackbody radiation source shows that the accuracy of the method is sufficient for many tasks requiring temperature estimation. Furthermore, a novel visualization method is proposed for displaying the radiometrically calibrated images to human operators. The representation employs an intuitive coloring scheme and allows the viewer to perceive a large variety of temperatures accurately.","PeriodicalId":231632,"journal":{"name":"2013 International Conference on Digital Image Computing: Techniques and Applications (DICTA)","volume":"74 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128606867","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Pose Priors for Aerial Image Registration","authors":"A. Marburg, M. Hayes, A. Bainbridge-Smith","doi":"10.1109/DICTA.2013.6691515","DOIUrl":"https://doi.org/10.1109/DICTA.2013.6691515","url":null,"abstract":"RANSAC has become an essential tool for the robust estimation of inter-image geometries using point correspondences. Due to its randomized nature, RANSAC becomes inefficient for large numbers of incorrect matches. This paper presents four methods for utilizing prior geometric information when estimating the homography which register a pair of near-nadir aerial images. In this case the initial geometric estimate is derived using strict assumptions about the camera and scene geometry and photometric information present in each image, however, the techniques described could be applied to other sources of pose prior information. The algorithms described allow for the successful estimation of the inter-image homography in a severely constrained number of iterations, even in the presence of low numbers of inliers.","PeriodicalId":231632,"journal":{"name":"2013 International Conference on Digital Image Computing: Techniques and Applications (DICTA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128652058","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Automated Selection of Optimal Frames in NIR Iris Videos","authors":"Nitin K. Mahadeo, A. Paplinski, S. Ray","doi":"10.1109/DICTA.2013.6691486","DOIUrl":"https://doi.org/10.1109/DICTA.2013.6691486","url":null,"abstract":"A relatively new trend in the iris biometric area is the use of videos as a capturing device. Frame by frame approach is richer in information and gives more flexibility as opposed to the use of traditional still images. However, the quality, shape and size of the iris may vary from one frame to another. In this paper, we propose a new technique for selecting the best frames in an iris video. Taking advantage of the temporal correspondence in iris frames, we classify iris videos into 3 categories, namely Adequate, Motion Constrained and Time Constrained. Frames with blinks and off-angle gaze are eliminated using frame averaging and correlation. Quality factors, namely motion blur, out of focus, translational motion and lighting present in iris videos are detected and their effect on recognition performance is investigated. Experimental results are carried out on both the MBGC NIR Iris Video and the MBGC NIR Iris Still datasets from the National Institute for Standards and Technology (NIST). Firstly, this work demonstrates that the proposed optimal frame selection technique in NIR Iris Videos leads to significant improvement in recognition performance. Secondly, the performance of NIR Iris Still images vs. NIR Iris Videos is compared. Thirdly, we show that interoperability between iris frames and iris images in an iris recognition system affects performance. Finally, the computational time and the elimination of noisy frames at each stage using the proposed method are examined.","PeriodicalId":231632,"journal":{"name":"2013 International Conference on Digital Image Computing: Techniques and Applications (DICTA)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130023529","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}