{"title":"CBRA: Color-based ranking aggregation for person re-identification","authors":"Raphael C. Prates, W. R. Schwartz","doi":"10.1109/ICIP.2015.7351146","DOIUrl":"https://doi.org/10.1109/ICIP.2015.7351146","url":null,"abstract":"The problem of automatically tracking a pedestrian within camera networks with non-overlapping field-of-view, known as person re-identification, is a challenging task with still suboptimal results. Different features have been proposed in the literature, specially colors which achieved the best results when fused in a unique feature representation. Despite being better than considering individually, the fusion still does not explores all the feature discriminative power. Therefore, we propose the use of rank aggregation to improve the results. In this paper, we address the person re-identification problem using a Color-based Ranking Aggregation (CBRA) method, which explores different feature representations to obtain complementary ranking lists and combine them using the Stuart ranking aggregation method. The obtained experimental results demonstrate a great improvement in state-of-the-art, reaching top-1 rank recognition rates of 50.0% and 56.9% in the ViPER and PRTD450S data sets, respectively.","PeriodicalId":147245,"journal":{"name":"International Conference on Information Photonics","volume":"259 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123099828","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Fusing well-crafted feature descriptors for efficient fine-grained classification","authors":"Andrea Britto Mottos, R. Feris","doi":"10.1109/ICIP.2014.7026052","DOIUrl":"https://doi.org/10.1109/ICIP.2014.7026052","url":null,"abstract":"As citizen science projects become more popular and engage an increasing number of volunteers, smartphones are turning into commonly used sensors in the biodiversity environment. In this paper, we propose a novel approach for classification of subordinate categories such as plant and insect species that is fast and suitable for use in mobile devices. In particular, we show that a combination of carefully designed features, including a robust shape descriptor to capture fine morphological structures of objects, as well as traditional color and texture features, is essential for obtaining good performance. A novel weighting technique assigns different costs to each feature, taking into account the inter-class and intra-class variation between species. We tested our proposed method in the popular Oxford Flower Dataset and in the Leeds Butterfly Dataset. We are able to achieve state-of-the-art accuracy while proposing an efficient approach that is suitable for mobile applications and can be applied to different species.","PeriodicalId":147245,"journal":{"name":"International Conference on Information Photonics","volume":"67 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127870041","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Error resilient video coding using hybrid hierarchical B pictures","authors":"Wen-Jiin Tsai, Wan-Han Liu","doi":"10.1109/ICIP.2012.6467191","DOIUrl":"https://doi.org/10.1109/ICIP.2012.6467191","url":null,"abstract":"In this paper, a hybrid model based on hierarchical B picture structure is proposed to improve error concealment effects when there is a whole-frame loss. The model combines two hierarchical B-picture coding structures such that key-frames, reference B frames, or even non-reference B frames have buddy frames to serve as their data recovery frames when they are lost. With buddy frames, the distance between a lost frame and its recovering frame can be substantially reduced and thus error concealment performance can be improved with little bit-rate redundancy.","PeriodicalId":147245,"journal":{"name":"International Conference on Information Photonics","volume":"58 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128422794","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Regularized adaptive classification based on image retrieval for clustered microcalcifications","authors":"Hao Jing, Yongyi Yang","doi":"10.1109/ICIP.2012.6467073","DOIUrl":"https://doi.org/10.1109/ICIP.2012.6467073","url":null,"abstract":"We propose a regularization based approach for efficient, case-adaptive classification in computer-aided diagnosis (CAD) of breast cancer. The goal is to boost the classification accuracy on a query case by making use of a set of similar cases retrieved from an existing library of known cases. In the proposed approach, a regularization scheme in the form a prior derived from an existing baseline classifier is used for the adaptive classifier, which can reduce the extra computational burden associated with adaption of the classifier for a query case. We consider two different forms for the regularization prior. In the experiments the proposed approach is demonstrated on a data set of 1,006 clinical cases. The results show that it could achieve improvements in both numerical efficiency and classification performance.","PeriodicalId":147245,"journal":{"name":"International Conference on Information Photonics","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128492620","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Wide baseline stereo object matching using minimal cost flow algorithm","authors":"Shuqing Zeng","doi":"10.1109/ICIP.2012.6467321","DOIUrl":"https://doi.org/10.1109/ICIP.2012.6467321","url":null,"abstract":"Monocular vision-based vehicle detection is a low-cost solution for active safety and driver assistance systems (ASDA). However, the depth estimation deviates its true value when the flat ground assumption does not hold. In this paper, we propose a stereo approach with a large baseline to address the issue without extracting three-dimensional features from disparity map. The proposed system first searches vehicle template among possible discrete rectangle boxes in the image pair. The system detects the presence, and estimates the distance of a vehicle simultaneously. This joint problem of detection and matching can be formulated as a minimal cost flow problem, which can be solved efficiently. The experimental results show that not only we have a redundant monocular vision system, but also the performance of both detection and range estimation is significantly enhanced.","PeriodicalId":147245,"journal":{"name":"International Conference on Information Photonics","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114326594","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"View-invariant measure of line correspondence and its application in people localization","authors":"K. Lo, Jen-Hui Chuang","doi":"10.1109/ICIP.2012.6467277","DOIUrl":"https://doi.org/10.1109/ICIP.2012.6467277","url":null,"abstract":"A correspondence measure of 2D line segments in two different views is proposed in this paper. Such a quantitative measure is view-invariant and can handle line segment of arbitrary configuration in the 3D scene. A line-based people localization scheme is proposed by applying such a measure to improve the efficiency of [1]. By verifying whether 2D line samples from different views belong to the same person, computations associated with incorrectly reconstructed 3D line samples of people can be avoided. Experimental results show that people localization results, in terms of correctness and accuracy, comparable to [1] can be obtained with the proposed scheme, but with more than three times in computation speed.","PeriodicalId":147245,"journal":{"name":"International Conference on Information Photonics","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128769056","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A sparseland model for deblurring images in the presence of impulse noise","authors":"Haili Zhang, Yunmei Chen","doi":"10.1109/ICIP.2012.6467550","DOIUrl":"https://doi.org/10.1109/ICIP.2012.6467550","url":null,"abstract":"Joint image deblurring and denoising has long been an interesting problem. Traditional deconvolution methods (like the ROF model) only work for Gaussian noise. Median-based approaches are generally concerned with the removal of impulse noise, which are more likely to hamper the deblurring process. In this paper, we propose a spareland model for deblurring images corrupted by impulse noise. The key point is to approximate the probability density function by two different randomly mixed Gaussian distributions. Experimental results are provided at the end of this paper to demonstrate the effectiveness of the proposed method.","PeriodicalId":147245,"journal":{"name":"International Conference on Information Photonics","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124907338","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Adaptive ternary-derivative pattern for disparity enhancement","authors":"V. D. Nguyen, T. Nguyen, D. Nguyen, J. Jeon","doi":"10.1109/ICIP.2012.6467523","DOIUrl":"https://doi.org/10.1109/ICIP.2012.6467523","url":null,"abstract":"High dynamic range conditions are major obstacles to the implementation of practical stereovision systems in real scenes. We address this problem by introducing an adaptive local ternary-derivative pattern (ALTDP) which is a fusion of the local ternary pattern (LTP) and local derivative pattern (LDP). We make three main contributions in this study: (i) ALTDP encodes more detail information than LDP by extending to eight directions; (ii) ALDTP is better at discriminating and less sensitive to noise in uniform regions with three-value encoding (-1,0,1) without using a pre-defined threshold; and (iii) ALTDP significantly improves the performance of hierarchical belief propagation (BP) by substituting ALTDP data cost for the different intensity data cost. Moreover, our proposed method performs slightly better than LBP and LDP with three datasets: synthetic sequences (set 2) in the EISATS dataset, bright differences sequences (set 5) in the EISATS dataset, and the bumblebee xb3 dataset.","PeriodicalId":147245,"journal":{"name":"International Conference on Information Photonics","volume":"56 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129708652","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A novel JSCC scheme for scalable video transmission over MIMO systems","authors":"Xinggong Zhang, Chao Zhou, Zongming Guo","doi":"10.1109/ICIP.2012.6467350","DOIUrl":"https://doi.org/10.1109/ICIP.2012.6467350","url":null,"abstract":"MIMO recently emerges as one of promising techniques for wireless video streaming. It is still a challenge to provide un-equal error protections by joint source-channel coding (JSCC) over multiple diverse MIMO sub-channels. In this paper, a joint source-channel coding and antenna mapping scheme for scalable video transmission over MIMO systems is proposed. Bandwidth are elaborately allocated between video source and channel protections by layer extracting and FEC coding. For the extracted layers, we determine i) which antenna will they be transmitted over and ii) how much redundancy bits will be added for error protections. We formulate this scheme into a non-linear integer optimization problem, whose complexity is very high. Instead, a low-complexity branch-and-bound algorithm is presented. Source layers are partitioned into subsets of layers, and the selected layer are mapped to antennas using Min-max scheduling algorithm. By branching and pruning, the computation complexity are reduced significantly. We carry out extensive numerical experiments under various network conditions. The results demonstrate our algorithm's efficiency and the overall transmission quality is improved significantly.","PeriodicalId":147245,"journal":{"name":"International Conference on Information Photonics","volume":"276 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121293196","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
B. Zou, Meicun Wang, Junping Zhang, Lamei Zhang, Ye Zhang
{"title":"Improving spatial resolution for CHANG'E-1 imagery using ARSIS concept and Pulse Coupled Neural Networks","authors":"B. Zou, Meicun Wang, Junping Zhang, Lamei Zhang, Ye Zhang","doi":"10.1109/ICIP.2012.6467312","DOIUrl":"https://doi.org/10.1109/ICIP.2012.6467312","url":null,"abstract":"To broaden the future application of CHANG'E-1 imagery, including hyperspectral imagery (low spatial resolution of 200m) and CCD imagery (relatively high spatial resolution of 120m), an ARSIS-based method for spatial-spectral fusion is proposed in this paper, which aims at combine high spatial and high spectral resolution. Firstly, ARSIS concept is employed, in which Atrous wavelet is used to describe images at different resolutions for multiresolution analysis. Secondly, Pulse Coupled Neural Network (PCNN) is employed to search and model a relationship between the high frequencies of the images to be fused for missing information. The ARSIS method preserves the spectral content of the original image for its very definition, and Atrous wavelet and PCNN prove to be effective means to implement it on CHANG'E-1 Imagery. The experimental results demonstrate that the visual improvement and spectral fidelity of the proposed method outperform many conventional methods of image fusion.","PeriodicalId":147245,"journal":{"name":"International Conference on Information Photonics","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121065888","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}