{"title":"Sparse Variation Pattern for Texture Classification","authors":"M. Tavakolian, F. Hajati, A. Mian, S. Gheisari","doi":"10.1109/DICTA.2013.6691530","DOIUrl":"https://doi.org/10.1109/DICTA.2013.6691530","url":null,"abstract":"We present Sparse Variation Pattern (SVP) to extract image features for texture classification. Using the directional derivatives in a local circular neighborhood, SVP captures texture transition patterns in the spatial domain. Unlike conventional feature extraction methods, SVP characterizes the image points taking the co-occurrence of two derivatives in the same direction into account without encoding to binary patterns. Using the directional derivatives, SVP defines a dictionary to solve the classification problem with sparse representation. The proposed texture descriptor was evaluated on the FERET and the LFW face databases, and the PolyU palmprint database. Comparisons with the existing state-of-the-art methods demonstrate that the SVP achieves the overall best performance on all three databases.","PeriodicalId":231632,"journal":{"name":"2013 International Conference on Digital Image Computing: Techniques and Applications (DICTA)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129284662","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Alina Bialkowski, P. Lucey, Xinyu Wei, S. Sridharan
{"title":"Person Re-Identification Using Group Information","authors":"Alina Bialkowski, P. Lucey, Xinyu Wei, S. Sridharan","doi":"10.1109/DICTA.2013.6691512","DOIUrl":"https://doi.org/10.1109/DICTA.2013.6691512","url":null,"abstract":"After first observing a person, the task of person re-identification involves recognising an individual at different locations across a network of cameras at a later time. Traditionally, this task has been performed by first extracting appearance features of an individual and then matching these features to the previous observation. However, identifying an individual based solely on appearance can be ambiguous, particularly when people wear similar clothing (i.e. people dressed in uniforms in sporting and school settings). This task is made more difficult when the resolution of the input image is small as is typically the case in multi-camera networks. To circumvent these issues, we need to use other contextual cues. In this paper, we use \"group\" information as our contextual feature to aid in the re-identification of a person, which is heavily motivated by the fact that people generally move together as a collective group. To encode group context, we learn a linear mapping function to assign each person to a \"role\" or position within the group structure. We then combine the appearance and group context cues using a weighted summation. We demonstrate how this improves performance of person re-identification in a sports environment over appearance based-features.","PeriodicalId":231632,"journal":{"name":"2013 International Conference on Digital Image Computing: Techniques and Applications (DICTA)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132348144","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Using Super-Resolution Methods to Solve a Novel Multi-Sinusoidal Signal Model","authors":"R. Marchant, P. Jackway","doi":"10.1109/DICTA.2013.6691537","DOIUrl":"https://doi.org/10.1109/DICTA.2013.6691537","url":null,"abstract":"Sinusoidal signal models are a useful representation of local image structure, as sinusoid phase describes symmetry separately from strength and orientation. Existing models consist of one or two oriented sinusoids, calculated using the 0th to 3rd order Riesz transforms. We propose an expanded signal model consisting of a larger number of oriented sinusoids. The model parameters are estimated using higher-order Riesz transforms and a novel application of super-resolution theory. Image features consisting of multiple lines or edges can be analysed using the method, which compares favourably to existing approaches.","PeriodicalId":231632,"journal":{"name":"2013 International Conference on Digital Image Computing: Techniques and Applications (DICTA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130844051","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Utilizing Least Significant Bit-Planes of RONI Pixels for Medical Image Watermarking","authors":"H. Nyeem, W. Boles, C. Boyd","doi":"10.1109/DICTA.2013.6691538","DOIUrl":"https://doi.org/10.1109/DICTA.2013.6691538","url":null,"abstract":"We propose a computationally efficient image border pixel based watermark embedding scheme for medical images. We considered the border pixels of a medical image as RONI (region of non-interest), since those pixels have no or little interest to doctors and medical professionals irrespective of the image modalities. Although RONI is used for embedding, our proposed scheme still keeps distortion at a minimum level in the embedding region using the optimum number of least significant bit-planes for the border pixels. All these not only ensure that a watermarked image is safe for diagnosis, but also help minimize the legal and ethical concerns of altering all pixels of medical images in any manner (e.g, reversible or irreversible). The proposed scheme avoids the need for RONI segmentation, which incurs capacity and computational overheads. The performance of the proposed scheme has been compared with a relevant scheme in terms of embedding capacity, image perceptual quality (measured by SSIM and PSNR), and computational efficiency. Our experimental results show that the proposed scheme is computationally efficient, offers an image-content-independent embedding capacity, and maintains a good image quality of RONI while keeping all other pixels in the image untouched.","PeriodicalId":231632,"journal":{"name":"2013 International Conference on Digital Image Computing: Techniques and Applications (DICTA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123582493","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"ICFSIFT: Improving Collection-Specific CBIR with ICF-Based Local Features","authors":"Nabeel Mohammed, D. Squire","doi":"10.1109/DICTA.2013.6691498","DOIUrl":"https://doi.org/10.1109/DICTA.2013.6691498","url":null,"abstract":"We present a new adaptive local feature, ICFSIFT, which utilises SIFT keypoints and Independent Component Analysis. The ICFSIFT feature combines the keypoint detection, and scale and orientation invariance, of SIFT with the collection-specific adaptive properties of Independent Component Filter (ICF) features. We evaluate the performance of this feature for image retrieval on two standard texture collections, comparing with SIFT features and previously published global ICF features. On both collections the ICFSIFT features perform best. We also show that combining these ICFSIFT features with the ICF-based global features further improves CBIR performance.","PeriodicalId":231632,"journal":{"name":"2013 International Conference on Digital Image Computing: Techniques and Applications (DICTA)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125081697","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
O. V. R. Murthy, Ibrahim Radwan, Abhinav Dhall, Roland Göcke
{"title":"On the Effect of Human Body Parts in Large Scale Human Behaviour Recognition","authors":"O. V. R. Murthy, Ibrahim Radwan, Abhinav Dhall, Roland Göcke","doi":"10.1109/DICTA.2013.6691507","DOIUrl":"https://doi.org/10.1109/DICTA.2013.6691507","url":null,"abstract":"Automatic analysis of human behaviour in large collections of videos is gaining interest, even more so with the advent of file sharing sites such as YouTube. Human behaviour analysis methods can be categorised into three classes based on the type of features. The three representations are local, region of interest and densely sampled based representations. Local feature representation, such as Spatio-Temporal Interest Points (STIP), are quite popular for modelling temporal aspects in human action recognition. Region of Interest (ROI) based feature representations try to capture and represent human body part regions. Densely sampled representations capture information at uniformly spaced intervals spread in space and temporal directions of the given video. In this paper, we investigate the effect of human body part (ROI) information in large scale action recognition. Further, we also investigate the effect of its fusion with Harris 3D points (local representation) information and densely sampled representations. All experiments use a Bag-of-Words framework. We present our results on large class benchmark databases such as the UCF50 and HMDB51 datasets.","PeriodicalId":231632,"journal":{"name":"2013 International Conference on Digital Image Computing: Techniques and Applications (DICTA)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116352025","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Yuyao Zhang, P. Ogunbona, W. Li, B. Munro, G. Wallace
{"title":"Pathological Gait Detection of Parkinson's Disease Using Sparse Representation","authors":"Yuyao Zhang, P. Ogunbona, W. Li, B. Munro, G. Wallace","doi":"10.1109/DICTA.2013.6691510","DOIUrl":"https://doi.org/10.1109/DICTA.2013.6691510","url":null,"abstract":"Parkinson's disease is a progressively degenerative neurological disorder which impacts the control of body movements. While there is no known permanent cure for the disorder, it is possible to monitor the progression and establish management regime that could help the medical team, patients and their family cope with the condition. Gait analysis becomes an attractive quantitative and non-invasive mechanism that can aid early detection and monitoring of the response of patients to the management schedules. In this paper, we model cycles of human gait as a sparsely represented signal using over-complete dictionary. This representation forms the basis of a classification that allows the recognition of symptomatic subjects. Experiments have been conducted using signals of vertical ground reaction force (GRF) from subjects with Parkinson's disease from the publicly available gait database (physionet.org). Our method achieved a classification accuracy of 83% in recognising pathological cases and represents a significant improvement on previously published results that use a selection of the Fourier transform coefficients as features.","PeriodicalId":231632,"journal":{"name":"2013 International Conference on Digital Image Computing: Techniques and Applications (DICTA)","volume":"161 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128636966","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Sadeep Jayasumana, R. Hartley, M. Salzmann, Hongdong Li, M. Harandi
{"title":"Combining Multiple Manifold-Valued Descriptors for Improved Object Recognition","authors":"Sadeep Jayasumana, R. Hartley, M. Salzmann, Hongdong Li, M. Harandi","doi":"10.1109/DICTA.2013.6691493","DOIUrl":"https://doi.org/10.1109/DICTA.2013.6691493","url":null,"abstract":"We present a learning method for classification using multiple manifold-valued features. Manifold techniques are becoming increasingly popular in computer vision since Riemannian geometry often comes up as a natural model for many descriptors encountered in different branches of computer vision. We propose a feature combination and selection method that optimally combines descriptors lying on different manifolds while respecting the Riemannian geometry of each underlying manifold. We use our method to improve object recognition by combining HOG~cite{Dalal05Hog} and Region Covariance~cite{Tuzel06} descriptors that reside on two different manifolds. To this end, we propose a kernel on the $n$-dimensional unit sphere and prove its positive definiteness. Our experimental evaluation shows that combining these two powerful descriptors using our method results in significant improvements in recognition accuracy.","PeriodicalId":231632,"journal":{"name":"2013 International Conference on Digital Image Computing: Techniques and Applications (DICTA)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128856211","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Parallel Implementation of Geodesic Distance Transform with Application in Superpixel Segmentation","authors":"Tuan Q. Pham","doi":"10.1109/DICTA.2013.6691508","DOIUrl":"https://doi.org/10.1109/DICTA.2013.6691508","url":null,"abstract":"This paper presents a parallel implementation of geodesic distance transform using OpenMP. We show how a sequential-based chamfer distance algorithm can be executed on parallel processing units with shared memory such as multiple cores on a modern CPU. Experimental results show a speedup of 2.6 times on a quad-core machine can be achieved without loss in accuracy. This work forms part of a C implementation for geodesic superpixel segmentation of natural images.","PeriodicalId":231632,"journal":{"name":"2013 International Conference on Digital Image Computing: Techniques and Applications (DICTA)","volume":"63 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129758230","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Lei Tong, J. Zhou, Chengyuan Xu, Y. Qian, Yongsheng Gao
{"title":"Soil Biochar Quantification via Hyperspectral Unmixing","authors":"Lei Tong, J. Zhou, Chengyuan Xu, Y. Qian, Yongsheng Gao","doi":"10.1109/DICTA.2013.6691529","DOIUrl":"https://doi.org/10.1109/DICTA.2013.6691529","url":null,"abstract":"Biochar has unique function to improve soil chemo-physical and biological properties for crop growth. Because changes of biochar in soil may affect its long-term effectiveness as an amendment, it is important to quantify and monitor biochar after application. In this paper, we propose a solution for this problem via hyperspectral image analysis. We treat the soil image as a mixture of soil and biochar signals, and then apply hyperspectral unmixing methods to predict the biochar abundance at each pixel. The final percentage of biochar can be calculated by taking the mean of the abundance of hyperspectral pixels. We have compared several hyperspectral unmixing methods based on least squares estimation and nonnegative matrix factorization with various sparsity constraints. Experimental results are evaluated by polynomial regression and root mean square errors against the ground truth data collected in the environmental labs. The results show that hyperspectral unmixing is a promising method to measure the percentage of biochar in the soil.","PeriodicalId":231632,"journal":{"name":"2013 International Conference on Digital Image Computing: Techniques and Applications (DICTA)","volume":"81 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126377129","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}