{"title":"Low Effectiveness of Non-Geometric-Operation Data Augmentations for Lesion Segmentation with Fully Convolution Networks","authors":"Yuming Qiu, Xiaolin Qin, Ju Zhang","doi":"10.1109/ICIVC.2018.8492891","DOIUrl":"https://doi.org/10.1109/ICIVC.2018.8492891","url":null,"abstract":"Data augmentation is a prevalent strategy to enlarge the training data in order to enhance the generalization of the model of deep convolutional neural networks (DCNNs). However, not all of augmentation schemes are always effective for all types of DCNNs models, especially for fully convolutional networks (FCNs) which greatly improved semantic segmentation by employing a skip architecture that fuses the feature hierarchy to combine deep, coarse, semantic information and shallow, fine, appearance information. In order to make the effectiveness of data augmentation clear, in this work, we propose to divide the augmentation schemes into two groups, geometric operations and non-geometric operations. Through analyzing the performance of them for lesion segmentation with FCNs, it is found that non-geometric-operation data augmentations are less effective in two dermoscopy datasets. Moreover, we further theoretically revealed that the skip architecture in FCNs is the main reason behind this finding. This work is of value on guiding the practice of data augmentation while using FCNs, and enlightening significance for analyzing other skip architecture deep neural networks.","PeriodicalId":173981,"journal":{"name":"2018 IEEE 3rd International Conference on Image, Vision and Computing (ICIVC)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124040908","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Synthesizing Plausible Videos by Introducing Favorable Object Pose Using Trajectory Matching","authors":"M. Seifelnasr, A. Ismail, Hongxing Guo","doi":"10.1109/ICIVC.2018.8492774","DOIUrl":"https://doi.org/10.1109/ICIVC.2018.8492774","url":null,"abstract":"Video is a mean of keeping our memories in a chronological way. Due to the proliferation of handheld devices embedded with high-resolution camera, people became fond of filming themselves everywhere. Yet, not always, people are in their best appearance and mood. Sometimes, they come across scenery sites and they want to record this moment. However, due to the lack of filming experience and the sudden of the event, the output video is below the expectation. In this paper, we propose a workflow to produce a plausible video of the people. The workflow introduced the appearance and behavior depicted in another clip of the user into the recorded outing scenery clip. The workflow considered the variance in both the size and the trajectory of the user during a fixed-camera video. The validity of the workflow is demonstrated by conducting two experiments where the input videos are recorded with different cameras in different circumstances to produce a plausible video. The output video is composed of the background from the scenery video and the foreground object and behavior from the raw data video.","PeriodicalId":173981,"journal":{"name":"2018 IEEE 3rd International Conference on Image, Vision and Computing (ICIVC)","volume":"71 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126247170","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Methods for Long-Distance Crack Location and Detection of Concrete Bridge Structures","authors":"Youfa Cai, Xing Fu, Y. Shang, Jingxin Shi","doi":"10.1109/ICIVC.2018.8492764","DOIUrl":"https://doi.org/10.1109/ICIVC.2018.8492764","url":null,"abstract":"In order to improve the efficiency of crack detection of concrete bridge structures, a new method based on computer vision technology and coordinate mapping is proposed. In this research, this crack measurement system is integrated mainly with a high magnification image acquisition system, a two-dimensional electric cradle head device and a laser ranging system. It has a set of observing coordinate system. Firstly, the marking points' image coordinates are mapped to the observation coordinates. Secondly, according to the marking points' observation coordinates, the measured crack's coordinates are mapped to a same world coordinates so as to realize the spatial location of the measured cracks regardless of different test cycles or instrument's setup positions, which is a great convenience for the review detection of surface cracks of concrete bridge structures. The experiments show that this method is efficient and convenient. It can automatically locate the measured cracks within 16 s, and the deviation is not more than ± 0.07 °. At a distance of 100 m, the measurement accuracy of crack width is better than ± 0.12 mm.","PeriodicalId":173981,"journal":{"name":"2018 IEEE 3rd International Conference on Image, Vision and Computing (ICIVC)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126210557","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Jiye Qian, Jin Fu, Bin Fang, Xin Zhou, Hengjun Zhao, Jide Qian, Bangfei Deng, Haibing Zhang, Weiwen Zhang
{"title":"Touching Objects Count with Progressive Erosion","authors":"Jiye Qian, Jin Fu, Bin Fang, Xin Zhou, Hengjun Zhao, Jide Qian, Bangfei Deng, Haibing Zhang, Weiwen Zhang","doi":"10.1109/ICIVC.2018.8492742","DOIUrl":"https://doi.org/10.1109/ICIVC.2018.8492742","url":null,"abstract":"Counting touching objects is of great importance to accurately count all objects in images. Morphology erosion is a simple, effective and efficient operation, commonly used in binary image analysis. The progressive erosion with changing image can accurately count touching objects by separating touching objects into isolated ones. This work also supports that the distance transform for counting touching objects can be replaced by a certain progressive erosion. With the help of counting strategies, the proposed method, which counts touching objects based on concave points, can be applied to object counting in many situations. Experimental results indicate that the proposed algorithm is both effective and efficient.","PeriodicalId":173981,"journal":{"name":"2018 IEEE 3rd International Conference on Image, Vision and Computing (ICIVC)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132357387","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Adaptive Blind Correction of TIADC Mismatch Based on Cyclic Autocorrelation","authors":"Maowei Yin, Z. Ye","doi":"10.1109/ICIVC.2018.8492905","DOIUrl":"https://doi.org/10.1109/ICIVC.2018.8492905","url":null,"abstract":"A blind and adaptive method is proposed for time-interleaved analog-to-digital converter (TIADC) online mismatch correction. Firstly, a TIADC model is derived by transforming mismatches to FIR filters with different amplitudes and lags. Subsequently, residual error measure of corrected output for wide sense stationary input is built following cyclic autocorrelation function. Mismatches estimator could be obtained by minimizing the measure, in consequence, gain and timing mismatches correction could be achieved accurately via adjust parameterized filter adaptively. It is experimentally verified that the proposed method can enhance spurious-free dynamic range of TIADC output by 27~32dB. The proposed method can be used for online correction of gain and timing mismatch of TIADC with zero-mean wide stationary input.","PeriodicalId":173981,"journal":{"name":"2018 IEEE 3rd International Conference on Image, Vision and Computing (ICIVC)","volume":"36 8","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134226207","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Simulations of Ground Clutter Filter Design and Velocity Dealiasing of Dual-PRF Signals for X-Band Weather Radar","authors":"Xiao Liang, Tao Wang","doi":"10.1109/ICIVC.2018.8492775","DOIUrl":"https://doi.org/10.1109/ICIVC.2018.8492775","url":null,"abstract":"The use of nonuniform sampled sequence effectively extends the unambiguous range and velocity of the Doppler weather radar but makes the ground clutter mitigation problem more challenging. This paper presents a clutter filter method applied to dual pulse repetition frequency (i.e., dual-PRF) scheme with a finite impulse response (FIR) filter bank designed under the minimum mean-square error criterion. This adaptive filtering method comprehensively considers the magnitude and phase characteristics in the frequency domain and yields desired autocorrelation estimation. Simulation of a clutter-contaminated weather signal model for X-band weather radar proves that, although the filter bank is not generated in real time, it can provide effective clutter suppression in almost all cases we expect and has a good performance of meteorological variables estimation.","PeriodicalId":173981,"journal":{"name":"2018 IEEE 3rd International Conference on Image, Vision and Computing (ICIVC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130823368","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Generalizing Integer Projected Graph Matching Algorithm for Outlier Problem","authors":"Lei He, Xu Yang, Zhiyong Liu","doi":"10.1109/ICIVC.2018.8492896","DOIUrl":"https://doi.org/10.1109/ICIVC.2018.8492896","url":null,"abstract":"Graph matching plays an important role in computer vision and pattern recognition. Recent graph matching algorithms usually formulate graph matching by a discrete optimization problem, and have designed various types of optimization techniques to find a local optimum in reasonable time. Among them some algorithms utilizing the graduated projection to the discrete domain exhibit superior performance, but these algorithms are limited to specific applications. From the outlier perspective, they are applicable to subgraph matching in which outliers exist in at most one graph. However, in real tasks there are usually outliers in both graphs. Previously we have proposed a method directly targeting at finding the most similar subgraphs in two weighted graphs. In this paper we show that the idea can be generalized to other algorithms, and the IPFP is chosen as a representative algorithm for generalization. Experiments witness the effectiveness of the generalization.","PeriodicalId":173981,"journal":{"name":"2018 IEEE 3rd International Conference on Image, Vision and Computing (ICIVC)","volume":"78 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132566646","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Shaukat Hayat, She Kun, Zuo Tengtao, Yue Yu, Tianyi Tu, Yan-ping Du
{"title":"A Deep Learning Framework Using Convolutional Neural Network for Multi-Class Object Recognition","authors":"Shaukat Hayat, She Kun, Zuo Tengtao, Yue Yu, Tianyi Tu, Yan-ping Du","doi":"10.1109/ICIVC.2018.8492777","DOIUrl":"https://doi.org/10.1109/ICIVC.2018.8492777","url":null,"abstract":"Object recognition is classic technique used to effectively recognize an object in the image. Technologies specifically in field of computer vision are expected to detect and recognize more complex tasks with help of local features detection methods. Over the last decade, there has been sustained increase in the number of researchers from various kind of disciplines i.e. academia, industry, security agencies and even from general public has caught an attention to explore the covered aspects of object detection and recognition concerned problems. It is further significantly amended by adopting deep learning model. In this paper, we applied deep learning to multi-class object recognition and explore convolutional neural network (CNN). The convolutional neural network is created with normalized standard initialization and trained with training set of sample images from 9 different object categories plus sample test images using widely varied dataset. All results are implemented in python tensorflow framework. We examine and compared CNN results with final feature vectors extracted from variant approaches of BOW based on linear L2-SVM classifier. Based on it, sufficient experiments verify our CNN model effectiveness and robustness with rate of 90.12% accuracy.","PeriodicalId":173981,"journal":{"name":"2018 IEEE 3rd International Conference on Image, Vision and Computing (ICIVC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115345030","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"An Efficient Way to Estimate the Focus of Expansion","authors":"Rui Huang, S. Ericson","doi":"10.1109/ICIVC.2018.8492881","DOIUrl":"https://doi.org/10.1109/ICIVC.2018.8492881","url":null,"abstract":"Detecting independent motion from a single camera is a difficult task in computer vision. It is because the captured image sequences are the combinations of the objects' movements and the camera's ego-motion. One major branch is to find the focus of expansion (FOE) instead as the goal. This is ideal for the situation commonly seen in UAV's camera system. In this case, the translation is dominant in camera's motion while the rotation is relatively small. To separate the ego motion and scene structure, many researchers used the directional flow as the theoretic basis and extracted its properties related to FOE. In this paper, we formulate finding FOE as an optimizing problem. The position of FOE has the minimal standard deviation for the directional flow in all directions, which is also subjected to the introduced constraint. The experiments show the proposed methods out-perform the previous method.","PeriodicalId":173981,"journal":{"name":"2018 IEEE 3rd International Conference on Image, Vision and Computing (ICIVC)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114348537","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Robust Tracking Algorithm Based on Feature Fusion and Occlusion Judgment","authors":"Cheng-Gang Gu, Zhan-Li Sun, Xia Chen","doi":"10.1109/ICIVC.2018.8492748","DOIUrl":"https://doi.org/10.1109/ICIVC.2018.8492748","url":null,"abstract":"How to achieve a robust performance remains an intractable problem in the various object tracking algorithms due to some unfavorable factors, e.g. occlusions, appearance change, etc. In this paper, a robust object tracking approach is proposed based on feature fusion and occlusion detection. Under the relevant filtering model, two complementary features, HOG and color name features, are fused via a weighting strategy. Moreover, an occlusion detection method is presented according to the response function of the fused features. Experimental results on several challenging sequences demonstrate the effectiveness and feasibility of the proposed method.","PeriodicalId":173981,"journal":{"name":"2018 IEEE 3rd International Conference on Image, Vision and Computing (ICIVC)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114947525","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}