{"title":"A closed-form expression for thin lens image irradiance","authors":"Robert D. Friedlander, A. Yezzi","doi":"10.1109/IPTA.2017.8310133","DOIUrl":"https://doi.org/10.1109/IPTA.2017.8310133","url":null,"abstract":"Computer vision tasks often have the goal of inferring geometric and radiometric information about a 3D environment given limited sensing resources. It is helpful to develop relationships between these real-world properties and the actual measurements that are taken. To this end we propose a new relationship between object radiance and image irradiance based on power conservation and a thin lens imaging model. The relationship has a closed-form solution for in-focus points and can be solved via numerical integration for points that are not focused. It can be thought of as a generalization of Horn's irradiance equation. Through both numerical simulations and comparison with the intensity values of actual images, our equation is shown to provide better accuracy than Horn's equation. Improvement is most notable for near-focused images where the pinhole imaging model implicit in Horn's derivation breaks down. Outside of this regime, our model validates the use of Horn's approximation.","PeriodicalId":316356,"journal":{"name":"2017 Seventh International Conference on Image Processing Theory, Tools and Applications (IPTA)","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127291796","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Brain tissue classification of alzheimer disease using partial volume possibilistic modeling: Application to ADNI phantom images","authors":"L. Lazli, M. Boukadoum, O. Mohamed","doi":"10.1109/IPTA.2017.8310095","DOIUrl":"https://doi.org/10.1109/IPTA.2017.8310095","url":null,"abstract":"This paper describes an automatic segmentation approach for PET and T1-weighted MR images using a possibilistic clustering algorithm for deriving fuzzy tissue maps of white matter, gray matter and cerebrospinal fluid volumes, and using the fuzzy C-means algorithm for the centers initialization process; this hybrid technique allows to compute the degree of membership of each voxel to different brain tissues. The fuzzy process is illustrated for Alzheimer's disease using phantom images from the Alzheimer's Disease Neuroimaging Initiative (ADNI). Our method, inspired from the conventional possibilistic algorithm, is less sensitive to noise while taking into consideration the effect of partial volume.","PeriodicalId":316356,"journal":{"name":"2017 Seventh International Conference on Image Processing Theory, Tools and Applications (IPTA)","volume":"71 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114779008","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Zhe Xin, Xiaoguang Cui, Jixiang Zhang, Yiping Yang, Yanqing Wang
{"title":"Visual place recognition with CNNs: From global to partial","authors":"Zhe Xin, Xiaoguang Cui, Jixiang Zhang, Yiping Yang, Yanqing Wang","doi":"10.1109/IPTA.2017.8310121","DOIUrl":"https://doi.org/10.1109/IPTA.2017.8310121","url":null,"abstract":"Visual place recognition is one of the most challenging problems in computer vision, due to the large diversities that real-world places can represent. Recently, visual place recognition has become a key part of loop closure detection and topological localization in long-term mobile robot autonomy. In this work, we build up a novel visual place recognition pipeline composed of a first filtering stage followed by a partial reranking process. In the filtering stage, image-wise features are utilized to find a small set of potential places. Afterwards, stable region-wise landmarks are extracted for more accurate matching in the partial reranking process. All global and partial image representations are derived from pre-trained Convolutional Neural Networks (CNNs), and the landmarks are extracted by object proposal techniques. Moreover, a new similarity measurement is provided by considering both spatial and scale distribution of landmarks. Compared with current methods only considering scale distribution, the presented similarity measurement can benefit recognition precision and robustness effectively. Experiments with varied viewpoints and environmental conditions demonstrate that the proposed method achieves superior performance against state-of-the-art methods.","PeriodicalId":316356,"journal":{"name":"2017 Seventh International Conference on Image Processing Theory, Tools and Applications (IPTA)","volume":"115 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117269388","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
João M. Santos, P. Assunção, L. Cruz, Luis M. N. Tavora, R. Fonseca-Pinto, S. Faria
{"title":"Lossless light-field compression using reversible colour transformations","authors":"João M. Santos, P. Assunção, L. Cruz, Luis M. N. Tavora, R. Fonseca-Pinto, S. Faria","doi":"10.1109/IPTA.2017.8310154","DOIUrl":"https://doi.org/10.1109/IPTA.2017.8310154","url":null,"abstract":"Recent advances in Light Field acquisition and rendering are pushing research efforts towards increasingly efficient methods to encode this particular type of data. Light Field image compression is of the utmost importance, not only due to the large amount of data required for its representation but also due to quality requirements of many applications and computational photography methods. This paper presents a research study about the impact of reversible colour transformations and alternative data arrangements in Light Field lossless coding. The experimental results indicate that the RCT reversible transform consistently achieves the highest compression performance across all data arrangements and lossless encoders. In particular, the best results are obtained with MRP when encoding the stack of sub-aperture images using a spiral scan order, achieving 6.41 bpp, on average.","PeriodicalId":316356,"journal":{"name":"2017 Seventh International Conference on Image Processing Theory, Tools and Applications (IPTA)","volume":"263 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133727668","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"No-reference image quality assessment using Gabor-based smoothness and latent noise estimation","authors":"Vineet Kumar, R. Chouhan","doi":"10.1109/IPTA.2017.8310104","DOIUrl":"https://doi.org/10.1109/IPTA.2017.8310104","url":null,"abstract":"No-reference image quality assessment is a challenging task due to the absence of a reference image in practical situations to quantify image quality. This paper proposes a new no-reference image quality metric for natural images using latent noise estimation, Gabor response, and contrast deviation. The algorithm employs an extension of gradient-based SSIM into the no-reference application using SVD-based AWGN estimation, and defines attributes such as Gabor-based smoothness and contrast deviation. The proposed metric arrives at an overall quality score by computing a linear weighted summation of the three image attributes. The proposed algorithm has been tested on several public databases (i.e. LIVE, TID 2013 and CSIQ), and the overall results display a noteworthy correlation of nearly 80% with the human visual system.","PeriodicalId":316356,"journal":{"name":"2017 Seventh International Conference on Image Processing Theory, Tools and Applications (IPTA)","volume":"77 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115966967","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A multiresolution DCT-based blind blur quality measure","authors":"F. Kerouh, D. Ziou, A. Serir","doi":"10.1109/IPTA.2017.8310086","DOIUrl":"https://doi.org/10.1109/IPTA.2017.8310086","url":null,"abstract":"The paper deals with assessing blur amount in images. Blur is a common artefact that attenuates the high frequency components of an image. The main idea turns on analysing the frequency response at transitions through resolutions. To achieve that, the histogram of the multiresolution DCT coefficients is modelled by using an exponential probability density function (pdf). The steepness of the pdf is used as a cue to characterize the blur effect. Faithful scores are obtained while testing the proposed approach on five image collections. The proposed measure is validated on the JPEG2000 lossy compression algorithm and the Lucy-Richardson iterative deblurring approach.","PeriodicalId":316356,"journal":{"name":"2017 Seventh International Conference on Image Processing Theory, Tools and Applications (IPTA)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130929132","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Multispectral single-sensor RGB-NIR imaging: New challenges and opportunities","authors":"Xavier Soria Poma, A. Sappa, A. Akbarinia","doi":"10.1109/IPTA.2017.8310105","DOIUrl":"https://doi.org/10.1109/IPTA.2017.8310105","url":null,"abstract":"Multispectral images captured with a single sensor camera have become an attractive alternative for numerous computer vision applications. However, in order to fully exploit their potentials, the color restoration problem (RGB representation) should be addressed. This problem is more evident in outdoor scenarios containing vegetation, living beings, or specular materials. The problem of color distortion emerges from the sensitivity of sensors due to the overlap of visible and near infrared spectral bands. This paper empirically evaluates the variability of the near infrared (NIR) information with respect to the changes of light throughout the day. A tiny neural network is proposed to restore the RGB color representation from the given RGBN (Red, Green, Blue, NIR) images. In order to evaluate the proposed algorithm, different experiments on a RGBN outdoor dataset are conducted, which include various challenging cases. The obtained result shows the challenge and the importance of addressing color restoration in single sensor multispectral images.","PeriodicalId":316356,"journal":{"name":"2017 Seventh International Conference on Image Processing Theory, Tools and Applications (IPTA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116053380","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"High performance and fast object detection in road environments","authors":"M. Kang, Y. Lim","doi":"10.1109/IPTA.2017.8310148","DOIUrl":"https://doi.org/10.1109/IPTA.2017.8310148","url":null,"abstract":"In this paper, we present a high performance and fast object detection method based on a fully convolutional network (FCN) for advanced driver assistance systems (ADAS). Object detection methods based on deep learning have high performance but they require high computational complexity. Even if a method works on the high-performance graphics processing unit (GPU) hardware platform, it is hard to guarantee real-time processing. General object detectors based on deep learning try to localize too many classes of objects in various dynamic environments. The proposed detection method based on FCN improves detection performance and maintains real-time processing in road environments through various schemes related to the limitation of object class type, data augmentation, network architecture, and multi-ratio default boxes. Our experimental results show that the proposed method outperforms a previous method both in terms of performance and speed.","PeriodicalId":316356,"journal":{"name":"2017 Seventh International Conference on Image Processing Theory, Tools and Applications (IPTA)","volume":"176 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121273437","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Mohamed Cheniti, Z. Akhtar, N. Boukezzoula, T. Falk
{"title":"Combining left and right wrist vein images for personal verification","authors":"Mohamed Cheniti, Z. Akhtar, N. Boukezzoula, T. Falk","doi":"10.1109/IPTA.2017.8310109","DOIUrl":"https://doi.org/10.1109/IPTA.2017.8310109","url":null,"abstract":"Multibiometric systems that fuse information from different sources are able to alleviate limitations of the unimodal biometric systems. In this paper, we propose a multibiometric framework to identify people using their left and right wrist vein patterns. The framework uses a fast and robust preprocessing and feature extraction method. A generic score level fusion approach is proposed to integrate the scores from left and right wrist vein patterns using Dubois and Parad triangular-norm (t-norm). Experiments on the publicly available PUT wrist vein dataset show that the proposed multibiometric framework outperforms the unimodal systems, their fusion using other t-norms techniques, and existing wrist vein recognition methods.","PeriodicalId":316356,"journal":{"name":"2017 Seventh International Conference on Image Processing Theory, Tools and Applications (IPTA)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115174589","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A new change detector in heterogeneous remote sensing imagery","authors":"Redha Touati, M. Mignotte, M. Dahmane","doi":"10.1109/IPTA.2017.8310138","DOIUrl":"https://doi.org/10.1109/IPTA.2017.8310138","url":null,"abstract":"Multimodal change detection in satellite images is a challenging and complex problem mainly because the local statistics of the images to be compared can be very different. In this paper, we present a novel, reliable and simple change detection operator which is first based on a imaging modality-invariant operator that detects the common specific high-frequency pattern of each structural region existing in the two heterogeneous satellite images. The resultant similarity map is then filtered out by a superpixel-based spatially adaptive filter which increases its reliability against noise. Second, in order to achieve more robustness, changes are then identified, from this similarity map, by combining the results of different automatic thresholding algorithms with a weighted spatially regularized multi-criteria decision analysis. Experimental results involving a mixture of different types of imaging modalities confirm the robustness of the proposed approach.","PeriodicalId":316356,"journal":{"name":"2017 Seventh International Conference on Image Processing Theory, Tools and Applications (IPTA)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127431995","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}