{"title":"Reduced complexity architecture for integral image generation","authors":"M. A. Khorsandi, N. Karimi","doi":"10.1109/IRANIANMVIP.2015.7397509","DOIUrl":"https://doi.org/10.1109/IRANIANMVIP.2015.7397509","url":null,"abstract":"Integral image plays an important role in AdaBoost algorithm which uses Haar-like features. The calculation of integral image needs many accesses to memory and most of the required addresses are not sequential. In addition, its calculation is compute-intensive. In this paper we propose an approach for generating integral image to cope with none-sequential addresses by affine transforming input image and using a pipeline architecture to compute results in an improved way. This approach needs the lowest clock pulses for integral image generation and in addition, its architecture is improved and less complex.","PeriodicalId":326511,"journal":{"name":"2015 9th Iranian Conference on Machine Vision and Image Processing (MVIP)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125302139","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"3D reconstruction of non-rigid surfaces from realistic monocular video","authors":"M. Sepehrinour, S. Kasaei","doi":"10.1109/IRANIANMVIP.2015.7397536","DOIUrl":"https://doi.org/10.1109/IRANIANMVIP.2015.7397536","url":null,"abstract":"A novel algorithm for recovering the 3D shape of deformable objects purely from realistic monocular video is presented in this paper. Unlike traditional non-rigid structure from motion (NRSfM) methods, which have been studied only on synthetic datasets and controlled lab environments that needs some prior constraints (such as manually segmented objects, limited rotations and occlusions, or full-length trajectories), the proposed method has been described and tested on realistic video sequences, which have been downloaded from some social networks (such as Facebook and Twitter). In order to apply NRSfM to the realistic video sequences, because of no-prior information about the scene and camera parameters, one should employ different methods that can handle a huge amount of unknown parameters (such as 3D shape and camera parameters) and deal with some other ambiguities such as incomplete segmentation and Tracking. In this paper, this goal is concerned by first proposing a novel method for completing the missing trajectories (as the most important challenge in realistic videos due to occlusions and lighting changes) and then applying a method that formulates the NRSfM as an energy minimization problem. The proposed method is evaluated on popular video segmentation datasets and its performance is compared to other available methods.","PeriodicalId":326511,"journal":{"name":"2015 9th Iranian Conference on Machine Vision and Image Processing (MVIP)","volume":"66 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115541465","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Reduced-Reference image quality assessment based on 2-D discrete FFT and Edge Similarity","authors":"Majid Khorrami, Zhila Azimzadeh, S. Nabipour","doi":"10.1109/IRANIANMVIP.2015.7397496","DOIUrl":"https://doi.org/10.1109/IRANIANMVIP.2015.7397496","url":null,"abstract":"Reduced-Reference (RR) image quality measures aim to predict the perceptual quality of distorted image using only partial information about the original image. In this paper, an effective Reduced-Reference image quality assessment algorithm based on FFT transform and Edge Similarity is introduced. The main design principle of the proposed method is choice of the best blocks of Image. After dividing the source images into blocks of 16×16 pixels, calculating the FFT Transform for each block, the FFT Transform gives best blocks of image. Next, the important features blocks of the image were recognized by Edge and the same actions were done on the image of distortions and finally, the similarity of both images was calculated. The experimental results on LIVE and CSIQ databases show that our RR proposed metric correlates well with the subjective quality scores, also in comparison with commonly used full-reference metric and with a state-of-the-art reduced reference.","PeriodicalId":326511,"journal":{"name":"2015 9th Iranian Conference on Machine Vision and Image Processing (MVIP)","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122887602","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"An efficient run-based method for connected component labeling","authors":"M. M. Gharasuie, Aboozar Gaffari","doi":"10.1109/IRANIANMVIP.2015.7397514","DOIUrl":"https://doi.org/10.1109/IRANIANMVIP.2015.7397514","url":null,"abstract":"This paper presents a new run-based algorithm for labeling connected components in a binary image. The algorithm removes assumption on all border pixels of image are background. Also it does not use merging operation for resolving label equivalences among provisional labels (sets), but it uses post processing stage. The post processing stage reduces complexity for resolving label equivalency. During the first scan, provisional labels are assigned to the connected components. After the first scan, the post processing is done to resolve label equivalency. The smallest provisional label among all provisional labels that are assigned to a connected component is considered as a representative label. During the second scan, the algorithm accesses to each foreground pixel directly and sets its representative label. Experimental results on various types of images demonstrate that the proposed algorithm is superior to conventional labeling algorithms.","PeriodicalId":326511,"journal":{"name":"2015 9th Iranian Conference on Machine Vision and Image Processing (MVIP)","volume":"62 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126406201","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Beyond bag-of-words: An improved Sparse Topical Coding for learning motion patterns in traffic scenes","authors":"P. Ahmadi, M. Tabandeh, I. Gholampour","doi":"10.1109/IRANIANMVIP.2015.7397491","DOIUrl":"https://doi.org/10.1109/IRANIANMVIP.2015.7397491","url":null,"abstract":"Analyzing motion patterns in traffic videos can directly generate some high-level descriptions of the video content which can be further employed in rule mining and abnormal event detection. The most recent and successful unsupervised approaches for complex traffic scene analysis are based on topic models. However, most existing topic models share some key characteristics which could limit their utility. In this paper, based on extracted optical flow features from video clips, we employ Sparse Topical Coding (STC) framework to automatically discover typical motion patterns in traffic scenes. For this purpose, we improve the STC to overcome one of the drawbacks of topic models with the aim of learning the semantic traffic motion patterns. We go beyond the usual word-document paradigm in topic models by taking into account the order of optical flow words during learning. Experimental results show that our proposed method can learn better motion patterns to analyse the traffic video scenes.","PeriodicalId":326511,"journal":{"name":"2015 9th Iranian Conference on Machine Vision and Image Processing (MVIP)","volume":"118 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128899638","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Performance analysis of processing load distribution in camera networks for multi-target tracking","authors":"Mehdi Jafarizadeh, A. Zakerolhosseini","doi":"10.1109/IRANIANMVIP.2015.7397544","DOIUrl":"https://doi.org/10.1109/IRANIANMVIP.2015.7397544","url":null,"abstract":"Camera networks have been widely used in surveillance and environmental monitoring in recent years. Among different topologies, there is a tendency to distributed topologies due to their efficiency in resource usage. But the comparatively low processing capability of individual smart cameras is an obstacle to run time-consuming algorithms like multi-target tracking on them. One suggestion is to share the extra processing load of busy nodes through the others, by taking into account the networking delays and different application requirements. In this work, we define and formulate the optimality, and analyze the optimum sharing amount of overloads for different conditions using queuing theory. Experimental results confirm that the analysis are valid and satisfy pre-defined requirements.","PeriodicalId":326511,"journal":{"name":"2015 9th Iranian Conference on Machine Vision and Image Processing (MVIP)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125863890","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Super resolution on the centroid reference grid","authors":"H. Rezayi, S. Seyedin","doi":"10.1109/IRANIANMVIP.2015.7397541","DOIUrl":"https://doi.org/10.1109/IRANIANMVIP.2015.7397541","url":null,"abstract":"In the multiframe image super-resolution (SR) methods a sequence of low-resolution (LR) images is used to create a high-resolution (HR) image. In all aforegoing methods, all of the LR images were registered with respect to one of existing images (called reference image) and their samples were mapped on an up-scaled grid of this reference image (called reference grid (RG)). However, the use of this RG may cause the samples appears with a highly non-uniform distribution. Additionally, final precision and quality may be dependent on the selection of reference image. In this paper a new RG which reduces the non-uniformity of sample distribution is proposed. This RG is produced by incorporating all of the images; therefore the final result is not dependent on the selection of reference image. We also propose a method to find this RG (will be referred as centroid RG). Moreover, this RG is unique.","PeriodicalId":326511,"journal":{"name":"2015 9th Iranian Conference on Machine Vision and Image Processing (MVIP)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126128262","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Correlation estimation between nitrogen and bean plant colors","authors":"Tayebeh Valiollahi, A. Shahbahrami, M. Zavareh","doi":"10.1109/IRANIANMVIP.2015.7397522","DOIUrl":"https://doi.org/10.1109/IRANIANMVIP.2015.7397522","url":null,"abstract":"Processing digital images has a lot of applications in different sciences such as medicine, industry and agriculture. One of the uses of digital images is in agriculture industry, for instance digital images could be used in providing the nitrogen of the plant. This research aims to estimate the correlation between the amount of nitrogen in bean plants and its color parameters. For this goal an algorithm is proposed in this paper. First beans images are provided and some preprocessing operations such as resizing, noise removing are performed. Second, RGB color space is converted to HSV color space. Finally correlation between plant color and nitrogen is estimated using regression equation. Implementation results show that there is high, strong, and positive correlation between the color features and the amount of nitrogen in the tissue of the bean. Among RGB, green color has the highest correlation with nitrogen compared to other colors, it is about 0.62. In addition, we consider the combination of three colors for our estimations. Our study show that (G/G+R) has the highest correlations with nitrogen in comparison to other equations and results. It is about 0.81.","PeriodicalId":326511,"journal":{"name":"2015 9th Iranian Conference on Machine Vision and Image Processing (MVIP)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126138584","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A decision fusion framework for high-resolution remote-sensing image classification","authors":"Ali Jafari, Mostafa Heidarpour","doi":"10.1109/IRANIANMVIP.2015.7397540","DOIUrl":"https://doi.org/10.1109/IRANIANMVIP.2015.7397540","url":null,"abstract":"Classification of high-resolution remote-sensing images is a challenging research area. In this paper we proposed a novel decision fusion framework to combine bag of features (BOF) based classifiers. The proposed framework, can also be used in multi category image classification applications. A single voting algorithm is used for decision fusion and an ambiguity detection module is used to determine the ambiguous situations. An ambiguous situation will occur during multi-category voting, where more than one class got the maximum votes, and also when the number of the same votes doesn't exceeds a desired threshold. To resolve this situation we proposed to use the earth mover's distance (EMD) which is a metric for histogram matching. Indeed, we used the EMD to compare the BOF based histogram of images with the centroid classes. Finally, to evaluate the proposed framework, we used a multi-category remote-sensing image dataset and compared the proposed approach with several other similar approaches with BOF based classifiers. The experimental results demonstrate the effectiveness of the proposed framework.","PeriodicalId":326511,"journal":{"name":"2015 9th Iranian Conference on Machine Vision and Image Processing (MVIP)","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125030079","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Iterative semi-supervised learning approach for color image segmentation","authors":"M. Jafari, S. Samavi","doi":"10.1109/IRANIANMVIP.2015.7397508","DOIUrl":"https://doi.org/10.1109/IRANIANMVIP.2015.7397508","url":null,"abstract":"Image segmentation is an important step in many image processing techniques. In this paper, a new semi-supervised approach for color image segmentation is proposed. This method takes advantage of a limited human assistant. After an unsupervised segmentation stage, classes of some regions are questioned from the user. These user hints are used as an initial sample data and will be iteratively expanded based on the existing relevancy between adjacent pixels. This relevancy is measured by probabilities calculated by a classifier which has learned the existing samples prior to that iteration. The learner is a multinomial logistic regression (MLR) classifier. The extended seed is used for training of a support vector machine (SVM) classifier in order to perform the final segmentation. The result of this segmentation fulfills the intention of the user and extracts the targeted classes. Experimental results show that our proposed method makes a noticeable improvement in the accuracy with respect to comparable algorithms.","PeriodicalId":326511,"journal":{"name":"2015 9th Iranian Conference on Machine Vision and Image Processing (MVIP)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131343733","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}