Adrian Rechy Romero, Srimal Jayawardena, Mark Cox, P. Borges
{"title":"Partitioning the Input Domain for Classification","authors":"Adrian Rechy Romero, Srimal Jayawardena, Mark Cox, P. Borges","doi":"10.1109/DICTA.2015.7371293","DOIUrl":"https://doi.org/10.1109/DICTA.2015.7371293","url":null,"abstract":"We explore an approach to use simple classification models to solve complex problems by partitioning the input domain into smaller regions that are more amenable to the classifier. For this purpose weinvestigate two variants of partitioning based on energy, as measured by the variance. We argue that restricting the energy of the input domain limits the complexity of the problem. Therefore, our method directly controls the energy in each partition. The partitioning methods and several classifiers are evaluated on a road detection application. Our results indicate that partitioning improves the performance of a linear Support Vector Machine and a classifier which considers the average label in each partition, to match the performance of a more sophisticated Neural Network classifier.","PeriodicalId":214897,"journal":{"name":"2015 International Conference on Digital Image Computing: Techniques and Applications (DICTA)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122889373","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Image Labeling by Integrating Local, Middle and Global Information","authors":"T. Ishida, K. Hotta","doi":"10.1109/DICTA.2015.7371268","DOIUrl":"https://doi.org/10.1109/DICTA.2015.7371268","url":null,"abstract":"We carry out image labeling based on probabilistic integration of local, middle and global information. Local information is effective for capturing color and texture pattern. Middle information is obtained from patches which are larger than local regions and is able to incorporate context information. Global information obtained from an entire image helps to decide the presence of categories in the scene. In the experiments using the MSRC21 dataset, labeling accuracies are much improved by integrating local, middle and global information. Our method gave the state-of-the-art performance.","PeriodicalId":214897,"journal":{"name":"2015 International Conference on Digital Image Computing: Techniques and Applications (DICTA)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121442997","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Survey on Human Action Recognition Using Depth Sensors","authors":"Bin Liang, Lihong Zheng","doi":"10.1109/DICTA.2015.7371223","DOIUrl":"https://doi.org/10.1109/DICTA.2015.7371223","url":null,"abstract":"The recent advent of depth sensors opens up new opportunities to advance human action recognition by providing depth information. Many different approaches have been proposed for human action recognition using depth sensors. The main purpose of this paper is to provide a comprehensive study and an updated review on human action recognition using depth sensors. We give an overview of recent works in this field from the viewpoints of data modalities, feature extraction and classification. In terms of data modalities from depth sensors, recent approaches can be roughly categorized into depth map based and skeleton based approaches. Since depth maps encode 3D shape and appearance information, approaches based on depth maps are suitable for short simple actions and can achieve high performance. In contrast, due to the discriminative power and more concise form of skeletal joints, skeleton based approaches can model more complex actions, even in real time. This paper further provides a summary of the results obtained in the last couple of years on the public datasets. Moreover, we discuss limitations of the state of the art and outline promising directions of research in this area. The review assists in guiding both researchers and practitioners in the selection and development of approaches for human action recognition using depth sensors.","PeriodicalId":214897,"journal":{"name":"2015 International Conference on Digital Image Computing: Techniques and Applications (DICTA)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115858954","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Robust Angle Invariant GAS Meter Reading","authors":"I. Gallo, Alessandro Zamberletti, L. Noce","doi":"10.1109/DICTA.2015.7371300","DOIUrl":"https://doi.org/10.1109/DICTA.2015.7371300","url":null,"abstract":"In this work we propose a novel method for automatic gas meter reading from real world images. In a wide range of countries all over the world, the existing automatic technology is not adopted, usually the reading is manually done on site, and a picture is taken through a mobile device as a proof of reading. In order to confirm the reading, a tedious work of checking the proof images is commonly done offline by an operator. With this contribution we aim to supply an effective system, able to provide a real support to the validation process reducing the human effort and the time consumed. We exploit both region-based and Maximally Stable Extremal Regions techniques, during the phase involving the localization of the meter area and to detect the meter counter digits in the detection step respectively. The evaluation has been carried out on every step of our approach, as well as on the overall assessment; although the problem is complex, the proposed method leads to good results even when applied to degraded images, it represents an effective solution to the gas meter reading problem and it can be utilized in real applications.","PeriodicalId":214897,"journal":{"name":"2015 International Conference on Digital Image Computing: Techniques and Applications (DICTA)","volume":"54 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115777063","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Selective Multi-Source Total Variation Image Restoration","authors":"Stephen Tierney, Yi Guo, Junbin Gao","doi":"10.1109/DICTA.2015.7371306","DOIUrl":"https://doi.org/10.1109/DICTA.2015.7371306","url":null,"abstract":"This paper is concerned with automatically fusing multiple noisy and partially corrupted source images into a single denoised image. To create the fused image we minimise a convex objective function, which ensures spatial smoothness through total variation regularisation, and similarity to the source images via pixel-wise selective regularisation against each of the source images. We call this approach Selective Multi-Source Total Variation Image Restoration (SMTV). Applications of SMTV include noise removal in low-light conditions, enhancement of images from low quality or damaged imaging sensors and haze or cloud removal from satellite imagery. Experimental evaluation demonstrates that the fusion of multiple images results in a more accurate recovery than single image restoration.","PeriodicalId":214897,"journal":{"name":"2015 International Conference on Digital Image Computing: Techniques and Applications (DICTA)","volume":"137 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115712276","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Monocular Position-Pose Measurement Based on Circular and Linear Features","authors":"Cai Meng, Jiao Xue, Zhan Hu","doi":"10.1109/DICTA.2015.7371284","DOIUrl":"https://doi.org/10.1109/DICTA.2015.7371284","url":null,"abstract":"This paper presents a new method for monocular position-pose measurement, which is based on the circular and linear features. The method improves the single circle based monocular position-pose measurement by recognizing the detectable circular structure and linear edge of target. First, according to the projection of the circle on the image, two sets of position-pose solutions are calculated, which do not contain the roll angle. Then, the roll angle is solved by utilizing the linear feature. The upper endpoint of the linear edge is selected as a reference point to eliminate the ambiguity. Finally, the simulation model and physical experimental platform are established to verify the validity and accuracy of the proposed method. Experimental results indicate that the proposed method can obtain the roll angle accurately and eliminate the ambiguity effectively. In case of noise and detection error, the results still remain accurate, which indicates that the proposed method is robust and has great application value in monocular position-pose measurement.","PeriodicalId":214897,"journal":{"name":"2015 International Conference on Digital Image Computing: Techniques and Applications (DICTA)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121586662","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Quantitative and Qualitative Evaluation of Performance and Robustness of Image Stitching Algorithms","authors":"Vipula Dissanayake, Sachini Herath, Sanka Rasnayaka, Sachith Seneviratne, Rajith Vidanaarachchi, C. Gamage","doi":"10.1109/DICTA.2015.7371297","DOIUrl":"https://doi.org/10.1109/DICTA.2015.7371297","url":null,"abstract":"Many different image stitching algorithms, and mechanisms to assess their quality have been proposed by different research groups in the past decade. However, a comparison across different stitching algorithms and evaluation mechanisms has not been performed before. Our objective is to recognize the best algorithm for panoramic image stitching. We measure the robustness of different algorithms by means of assessing image quality of a set of panoramas. For the evaluation itself a varied set of assessment criteria are used, and the evaluation is performed over a large range of images captured using differing cameras. In an ideal stitching algorithm, the resulting stitched image should be without visible seams and other noticeable anomalies. An objective evaluation for image quality should give results corresponding to a similar evaluation by the Human Visual System. Our conclusion is that the choice of stitching algorithm is scenario dependent, with run-time and accuracy being the primary considerations.","PeriodicalId":214897,"journal":{"name":"2015 International Conference on Digital Image Computing: Techniques and Applications (DICTA)","volume":"111 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121998975","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Kernel Subspace Integral Image Based Probabilistic Visual Object Tracking","authors":"Iftikhar Majeed, Omar Arif","doi":"10.1109/DICTA.2015.7371275","DOIUrl":"https://doi.org/10.1109/DICTA.2015.7371275","url":null,"abstract":"This paper presents a novel object tracking algorithm. Object appearance and spatial information is learned from a single template using a non-linear subspace projection. A probabilistic search strategy, based on particle filter, is employed to find object region in each frame of the video sequence that best models the target object in the subspace representation. Particle filter estimates the posterior distribution using weighted samples. Increasing the number of samples increases the estimation accuracy at the cost of increased computations. We, therefore propose a novel kernel subspace integral image framework, which allows the tracker to densely sample the state space without loosing computational efficiency. The algorithm is tested on real world tracking examples to demonstrate the performance.","PeriodicalId":214897,"journal":{"name":"2015 International Conference on Digital Image Computing: Techniques and Applications (DICTA)","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121550501","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Afaf Tareef, Yang Song, Min-Zhao Lee, D. Feng, Mei Chen, Weidong (Tom) Cai
{"title":"Morphological Filtering and Hierarchical Deformation for Partially Overlapping Cell Segmentation","authors":"Afaf Tareef, Yang Song, Min-Zhao Lee, D. Feng, Mei Chen, Weidong (Tom) Cai","doi":"10.1109/DICTA.2015.7371285","DOIUrl":"https://doi.org/10.1109/DICTA.2015.7371285","url":null,"abstract":"Accurate cell segmentation is an important and long-standing challenge in biomedical image analysis due to large variations in shape and boundary ambiguity. In this paper, we present a segmentation framework for partially overlapping cervical cells. The proposed method starts by cellular clump estimation with morphological reconstruction. Subsequently, the nuclei inside the cellular clumps are located by H-maxima transformation and thresholding. The cytoplasm of each detected nucleus is finally delineated with hierarchical deformation based on landmarks and shape dictionaries. The proposed approach is tested on a cervical smear image dataset containing single and partially overlapping cells. The results demonstrate that our approach can achieve more accurate and stable cytoplasmic segmentation, better nuclear segmentation, and lower time complexity, compared to a state-of-the-art approach.","PeriodicalId":214897,"journal":{"name":"2015 International Conference on Digital Image Computing: Techniques and Applications (DICTA)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133851899","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Automated Mass Detection in Mammograms Using Cascaded Deep Learning and Random Forests","authors":"Neeraj Dhungel, G. Carneiro, A. Bradley","doi":"10.1109/DICTA.2015.7371234","DOIUrl":"https://doi.org/10.1109/DICTA.2015.7371234","url":null,"abstract":"Mass detection from mammograms plays a crucial role as a pre- processing stage for mass segmentation and classification. The detection of masses from mammograms is considered to be a challenging problem due to their large variation in shape, size, boundary and texture and also because of their low signal to noise ratio compared to the surrounding breast tissue. In this paper, we present a novel approach for detecting masses in mammograms using a cascade of deep learning and random forest classifiers. The first stage classifier consists of a multi-scale deep belief network that selects suspicious regions to be further processed by a two-level cascade of deep convolutional neural networks. The regions that survive this deep learning analysis are then processed by a two-level cascade of random forest classifiers that use morphological and texture features extracted from regions selected along the cascade. Finally, regions that survive the cascade of random forest classifiers are combined using connected component analysis to produce state-of-the-art results. We also show that the proposed cascade of deep learning and random forest classifiers are effective in the reduction of false positive regions, while maintaining a high true positive detection rate. We tested our mass detection system on two publicly available datasets: DDSM-BCRP and INbreast. The final mass detection produced by our approach achieves the best results on these publicly available datasets with a true positive rate of 0.96 ± 0.03 at 1.2 false positive per image on INbreast and true positive rate of 0.75 at 4.8 false positive per image on DDSM-BCRP.","PeriodicalId":214897,"journal":{"name":"2015 International Conference on Digital Image Computing: Techniques and Applications (DICTA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132925999","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}