Electronic Letters on Computer Vision and Image Analysis最新文献_第7页

Understanding eye movements: psychophysics and a model of primary visual cortex 理解眼球运动:心理物理学和初级视觉皮层模型

Electronic Letters on Computer Vision and Image Analysis Pub Date : 2019-07-26 DOI: 10.5565/rev/elcvia.1193

David Berga

{"title":"Understanding eye movements: psychophysics and a model of primary visual cortex","authors":"David Berga","doi":"10.5565/rev/elcvia.1193","DOIUrl":"https://doi.org/10.5565/rev/elcvia.1193","url":null,"abstract":"Humans move their eyes in order to learn visual representations of the world. These eye movements depend on distinct factors, either by the scene that we perceive or by our own decisions. To select what is relevant to attend is part of our survival mechanisms and the way we build reality, as we constantly react both consciously and unconsciously to all the stimuli that is projected into our eyes. In this thesis we try to explain (1) how we move our eyes, (2) how to build machines that understand visual information and deploy eye movements, and (3) how to make these machines understand tasks in order to decide for eye movements. (1) We provided the analysis of eye movement behavior elicited by low-level feature distinctiveness with a dataset of 230 synthetically-generated image patterns. A total of 15 types of stimuli has been generated (e.g. orientation, brightness, color, size, etc.), with 7 feature contrasts for each feature category. Eye-tracking data was collected from 34 participants during the viewing of the dataset, using Free-Viewing and Visual Search task instructions. Results showed that saliency is predominantly and distinctively influenced by: 1. feature type, 2. feature contrast, 3. temporality of fixations, 4. task difficulty and 5. center bias. From such dataset (SID4VAM), we have computed a benchmark of saliency models by testing performance using psychophysical patterns. Model performance has been evaluated considering model inspiration and consistency with human psychophysics. Our study reveals that state-of-the-art Deep Learning saliency models do not perform well with synthetic pattern images, instead, models with Spectral/Fourier inspiration outperform others in saliency metrics and are more consistent with human psychophysical experimentation. (2) Computations in the primary visual cortex (area V1 or striate cortex) have long been hypothesized to be responsible, among several visual processing mechanisms, of bottom-up visual attention (also named saliency). In order to validate this hypothesis, images from eye tracking datasets have been processed with a biologically plausible model of V1 (named Neurodynamic Saliency Wavelet Model or NSWAM). Following Li's neurodynamic model, we define V1's lateral connections with a network of firing rate neurons, sensitive to visual features such as brightness, color, orientation and scale. Early subcortical processes (i.e. retinal and thalamic) are functionally simulated. The resulting saliency maps are generated from the model output, representing the neuronal activity of V1 projections towards brain areas involved in eye movement control. We want to pinpoint that our unified computational architecture is able to reproduce several visual processes (i.e. brightness, chromatic induction and visual discomfort) without applying any type of training or optimization and keeping the same parametrization. The model has been extended (NSWAM-CM) with an implementation of the cortical magnificatio","PeriodicalId":38711,"journal":{"name":"Electronic Letters on Computer Vision and Image Analysis","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42022517","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Image decomposition using a second-order variational model and wavelet shrinkage 基于二阶变分模型和小波收缩的图像分解

Electronic Letters on Computer Vision and Image Analysis Pub Date : 2019-07-18 DOI: 10.5565/REV/ELCVIA.1162

Minh-Phuong Tran

{"title":"Image decomposition using a second-order variational model and wavelet shrinkage","authors":"Minh-Phuong Tran","doi":"10.5565/REV/ELCVIA.1162","DOIUrl":"https://doi.org/10.5565/REV/ELCVIA.1162","url":null,"abstract":"The paper is devoted to the new model for image decomposition, that splits an image $f$ into three components $u+v+omega$ , with $u$ a piecewise -smooth or the ``cartoon'' component, $v$ a texture component and $omega$ the noise part in variational approach. This decomposition model is in fact incorporates the advantages of two preceding models: the second-order total variation minimization of Rudin - Osher - Fatemi ( ROF2 ), and wavelet shrinkage for oscillatory functions. This decomposition model is presented as an extension of the three components decomposition algorithm of Aujol et al. in cite {JAC}. It also continues the idea introduced previously by authors in cite {TPB}, for two components decomposition model. The ROF2 model was first proposed by Bergounioux et al. in cite {BP}, it is an improved regularization method to overcome the undesirable staircasing effect. The wavelet shrinkage is well combined to separate the oscillating part due to texture from that due to noise. Experimental results validate the proposed algorithm and demonstrate that the image decomposition model presents effective and comparable performance to other state-of-the-art models.","PeriodicalId":38711,"journal":{"name":"Electronic Letters on Computer Vision and Image Analysis","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49638950","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Probability-Possibility Theories Based Iris Biometric Recognition System 基于概率-可能性理论的虹膜生物识别系统

Electronic Letters on Computer Vision and Image Analysis Pub Date : 2019-07-02 DOI: 10.5565/REV/ELCVIA.1132

Bellaaj Majd

{"title":"Probability-Possibility Theories Based Iris Biometric Recognition System","authors":"Bellaaj Majd","doi":"10.5565/REV/ELCVIA.1132","DOIUrl":"https://doi.org/10.5565/REV/ELCVIA.1132","url":null,"abstract":"The performance and robustness of the iris-based recognition systems still suffer from imperfection in the biometric information. This paper makes an attempt to address these imperfections and deals with important problem for real system. We proposed a new method for iris recognition system based on uncertainty theories to treat imperfection iris feature. Several factors cause different types of degradation in iris data such as the poor quality of the acquired pictures, the partial occlusion of the iris region due to light spots, or lenses, eyeglasses, hair or eyelids, and adverse illumination and/or contrast. All of these factors are open problems in the field of iris recognition and affect the performance of iris segmentation, its feature extraction or decision making process, and appear as imperfections in the extracted iris feature. The aim of our experiments is to model the variability and ambiguity in the iris data with the uncertainty theories. This paper illustrates the importance of the use of this theory for modeling or/and treating encountered imperfections. Several comparative experiments are conducted on two subsets of the CASIA-V4 iris image database namely Interval and Synthetic. Compared to a typical iris recognition system relying on the uncertainty theories , experimental results show that our proposed model improves the iris recognition system in terms of Equal Error Rates (EER), Area Under the receiver operating characteristics Curve (AUC) and Accuracy Recognition Rate (ARR) statistics .","PeriodicalId":38711,"journal":{"name":"Electronic Letters on Computer Vision and Image Analysis","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41627232","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

Automatic Date Fruit Recognition Using Outlier Detection Techniques and Gaussian Mixture Models 基于异常值检测技术和高斯混合模型的椰枣果实自动识别

Electronic Letters on Computer Vision and Image Analysis Pub Date : 2019-06-27 DOI: 10.5565/REV/ELCVIA.1041

Oussama Aiadi, M. L. Kherfi, Belal Khaldi

{"title":"Automatic Date Fruit Recognition Using Outlier Detection Techniques and Gaussian Mixture Models","authors":"Oussama Aiadi, M. L. Kherfi, Belal Khaldi","doi":"10.5565/REV/ELCVIA.1041","DOIUrl":"https://doi.org/10.5565/REV/ELCVIA.1041","url":null,"abstract":"In this paper, we propose a method for automatically recognizing different date varieties. The presence of outlier samples could significantly degrade the recognition outcomes. Therefore, we separately prune samples of each variety from outliers using the Pruning Local Distance-based Outlier Factor (PLDOF) method. Samples of the same variety could have several visual appearances because of the noticeable variation in terms of their visual characteristics. Thus, in order to take this intra-variation into account, we model each variety with a Gaussian Mixture Model (GMM), where each component within the GMM corresponds to one visual appearance. Expectation-Maximization (EM) algorithm was used for parameters estimation and Davies-Bouldin index was used to automatically and precisely estimate the number of components (i.e., appearances). Compared to the related studies, the proposed method 1) is capable to recognize samples though the noticeable variation, in terms of maturity stage and hardness degree, within some varieties; 2) achieves a high recognition rate in spite of the presence of outlier samples; 3) is capable to distinguish between the highly confusing varieties; 4) is fully automatic, as it does not require neither physical measurements nor human assistance. For testing purposes, we introduce a new benchmark which includes the highest number of varieties (11) compared to the previous studies. Experiments show that our method has significantly outperformed several methods, where a high recognition rate of 97.8% has been reached.","PeriodicalId":38711,"journal":{"name":"Electronic Letters on Computer Vision and Image Analysis","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44247217","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 9

An Adaptive Non-linear Statistical Salt-and-Pepper Noise Removal Algorithm using Interquartile Range 一种基于四分位间距的自适应非线性统计椒盐噪声去除算法

Electronic Letters on Computer Vision and Image Analysis Pub Date : 2019-06-27 DOI: 10.5565/REV/ELCVIA.1145

A. Halder

引用次数: 3

A New feature extraction method to Improve Emotion Detection Using EEG Signals 一种改进脑电信号情感检测的特征提取新方法

Electronic Letters on Computer Vision and Image Analysis Pub Date : 2018-11-07 DOI: 10.5565/REV/ELCVIA.1045

H. Zamanian, H. Farsi

引用次数: 28

Efficient Iris and Eyelids Detection from Facial Sketch Images 基于人脸素描图像的虹膜和眼睑高效检测

Electronic Letters on Computer Vision and Image Analysis Pub Date : 2018-11-07 DOI: 10.5565/REV/ELCVIA.1044

Tan Boonchuan, S. Setumin, Abduljalil Radman, S. A. Suandi

引用次数: 4

Depth Data Error Modeling of the ZED 3D Vision Sensor from Stereolabs 立体实验室ZED三维视觉传感器深度数据误差建模

Electronic Letters on Computer Vision and Image Analysis Pub Date : 2018-06-19 DOI: 10.5565/REV/ELCVIA.1084

Luis E. Ortiz, Elizabeth V. Cabrera, L. Gonçalves

{"title":"Depth Data Error Modeling of the ZED 3D Vision Sensor from Stereolabs","authors":"Luis E. Ortiz, Elizabeth V. Cabrera, L. Gonçalves","doi":"10.5565/REV/ELCVIA.1084","DOIUrl":"https://doi.org/10.5565/REV/ELCVIA.1084","url":null,"abstract":"The ZED camera is binocular vision system that can be used to provide a 3D perception of the world. It can be applied in autonomous robot navigation, virtual reality, tracking, motion analysis and so on. This paper proposes a mathematical error model for depth data estimated by the ZED camera with its several resolutions of operation. For doing that, the ZED is attached to a Nvidia Jetson TK1 board providing an embedded system that is used for processing raw data acquired by ZED from a 3D checkerboard. Corners are extracted from the checkerboard using RGB data, and a 3D reconstruction is done for these points using disparity data calculated from the ZED camera, coming up with a partially ordered, and regularly distributed (in 3D space) point cloud of corners with given coordinates, which are computed by the device software. These corners also have their ideal world (3D) positions known with respect to the coordinate frame origin that is empirically set in the pattern. Both given (computed) coordinates from the camera’s data and known (ideal) coordinates of a corner can, thus, be compared for estimating the error between the given and ideal point locations of the detected corner cloud. Subsequently, using a curve fitting technique, we obtain the equations that model the RMS (Root Mean Square) error. This procedure is repeated for several resolutions of the ZED sensor, and at several distances. Results showed its best effectiveness with a maximum distance of approximately sixteen meters, in real time, which allows its use in robotic or other online applications.","PeriodicalId":38711,"journal":{"name":"Electronic Letters on Computer Vision and Image Analysis","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2018-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47475185","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 65

Contributions to the Problem of Fight Detection in Video 对视频中战斗检测问题的贡献

Electronic Letters on Computer Vision and Image Analysis Pub Date : 2018-03-12 DOI: 10.5565/REV/ELCVIA.1135

Ismael Serrano-Gracia

引用次数: 0

Recognition and retrieval of objects in diverse applications 识别和检索对象在不同的应用

Electronic Letters on Computer Vision and Image Analysis Pub Date : 2018-03-05 DOI: 10.5565/REV/ELCVIA.1130

Laura Fernández-Robles

引用次数: 0