{"title":"A Moving Target Recognition Algorithm Based on Improved Mixture Gaussian Background Model","authors":"Zhang Yongmei, Ma Li, Liu Mengmeng, S. Haiyan","doi":"10.1145/3177404.3177418","DOIUrl":"https://doi.org/10.1145/3177404.3177418","url":null,"abstract":"Many mutant factors in the video such as noise and illumination can easily lead to the moving target recognition error. The paper proposes a moving target recognition algorithm based on improved mixture Gaussian background model for ships and vehicles. The algorithm on account of mixture Gaussian background model and three-frame difference can obtain the potential target regions with less background under the condition of motion disturbance and light mutation in the background, extract the straight line, shape factor and Zernike moment features from the potential regions, and construct the least square support vector machine to identify the ships and vehicles. The experiment results show the algorithm can accurately identify the ships and vehicles.","PeriodicalId":133378,"journal":{"name":"Proceedings of the International Conference on Video and Image Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-12-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124892178","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Xuemei Xie, Jiang Du, Guangming Shi, H. Hu, Wang Li
{"title":"An Improved Approach for Visualizing Dynamic Vision Sensor and its Video Denoising","authors":"Xuemei Xie, Jiang Du, Guangming Shi, H. Hu, Wang Li","doi":"10.1145/3177404.3177411","DOIUrl":"https://doi.org/10.1145/3177404.3177411","url":null,"abstract":"Dynamic vision sensor (DVS) is an event-based camera capturing the changes of vision with high speed and low storage consumption. To better understand what DVS captures, we need to visualize the events. Existing methods have realized visualization. To optimize the vision experience, this paper proposes a framework to visualize events with rich information, high speed and less noise. Firstly, we propose an improved visualization approach using overlapped events based on human vision system. Secondly, we propose a video denoising method using shared dictionaries. In our experiments, the proposed method realizes the expected purpose on the whole video.","PeriodicalId":133378,"journal":{"name":"Proceedings of the International Conference on Video and Image Processing","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-12-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130130959","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Structural Approach for Event Resolution in Cricket Videos","authors":"S. Premaratne, K. Jayaratne","doi":"10.1145/3177404.3177414","DOIUrl":"https://doi.org/10.1145/3177404.3177414","url":null,"abstract":"Classification of multimedia big data today has become a serious issue for organizations. Therefore, new concepts to mine of multimedia big have emerged and people are doing numerous researches on how to effectively handle different types of multimedia data also known as multi model data. In our research, we focus on how to effectively extract and classify data from a multimedia data related to sports videos and draw conclusions considering all media present in the content. We specifically considered the game of cricket in this research to build a multi-model mining approach to identify specific events. We consider low level details such as color variations in videos and pitch in audio to analyze and distinguish different attributes of a given event. The input can be any cricket video and our approach is to identify events such as a four, a six, a dot ball etc. by extracting low level details of its color variations, edges, camera changes and audio frequency variations. This research for video/audio data extraction provides room for addition of further audio data extraction and textual data extraction for classification of a multimodal data set.","PeriodicalId":133378,"journal":{"name":"Proceedings of the International Conference on Video and Image Processing","volume":"33 23","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-12-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120818530","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Yuanyuan Wu, Rong Ding, Jinjian Yang, Mengxiang Lin
{"title":"3D Thermal Mapping Using a 2D Laser Rangefinder and a Thermal Camera","authors":"Yuanyuan Wu, Rong Ding, Jinjian Yang, Mengxiang Lin","doi":"10.1145/3177404.3177422","DOIUrl":"https://doi.org/10.1145/3177404.3177422","url":null,"abstract":"This paper presents a low-cost 3D thermal modeling tool consisting of a laser rangefinder and a thermal camera. We utilize a two dimension laser rangefinder with the aid of a steering gear to get a 3D point cloud. Furthermore, the 3D point cloud is fused with infrared images captured by the thermal camera. As a result, a 3D thermal model containing spatial and temperature information is constructed. For better recognizing the entities, the sparse 3D model is enhanced by greedy projection triangulation algorithm before visualizing. The tool is built and evaluated in real scenes. The experimental results show that our tool can reconstruct the three-dimension thermal model of an environment successfully.","PeriodicalId":133378,"journal":{"name":"Proceedings of the International Conference on Video and Image Processing","volume":"56 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-12-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122991760","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"The method of the pipeline magnetic flux leakage detection image formation based on the artificial intelligence","authors":"Lijian Yang, Meng Shi, Song-wei Gao","doi":"10.1145/3177404.3177434","DOIUrl":"https://doi.org/10.1145/3177404.3177434","url":null,"abstract":"For long distance pipelines of oil and gas pipeline, the magnetic flux leakage detection in data storage and discriminant quantity is larger and the problem of recognition is slow. By using the convolution neural network, the detection data of the leakage magnetic field is processed to realize the detection of magnetic leakage and the intelligent processing of the data discriminant. The method of the pipeline magnetic flux leakage detection image formation based on the artificial intelligence achieving leakage magnetic detection imaging, and it is the earlier processing of the intelligence identification. The original image is analyzed and pretreated by the imaging processing method of image corrosion and grayscale. The method is highlights the image features, and it makes the pipeline features display clearly, which provides a clearly image feature for the intelligent identification of pipeline features and improves the efficiency of intelligent identification of the magnetic flux leakage data.","PeriodicalId":133378,"journal":{"name":"Proceedings of the International Conference on Video and Image Processing","volume":"10 1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-12-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126368350","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Image Stitching and Quality Evaluation Algorithm for Large Size Parts","authors":"Xin Li, Liming Wu, Guitang Wang, Yujun Chen, Kuai Rong, Jianhai Lin","doi":"10.1145/3177404.3177451","DOIUrl":"https://doi.org/10.1145/3177404.3177451","url":null,"abstract":"Aiming at the problem of small field of view and low precision of online vision measurement for large size parts, an algorithm for image stitching and quality evaluation of large size parts is proposed.Through the establishment of imaging normalized model, calculating the transformation matrix between images, and then the camera is calibrated several times to obtain the best calibration matrix, which can realize image stitching.Finally, according to the indexes that are the SSIM value and the DoEM value of the mosaic image to evaluate the quality of the mosaic image of large size parts.Experimental results show that the proposed method can not only reduce stitching time, improve accuracy, but also meet the subjective visual perception of human, which meets the requirements of on-line inspection of large size parts.","PeriodicalId":133378,"journal":{"name":"Proceedings of the International Conference on Video and Image Processing","volume":"81 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-12-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126379866","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Effectiveness of Contrast Limited Adaptive Histogram Equalization Technique on Multispectral Satellite Imagery","authors":"Ganesh R. Vidhya, H. Ramesh","doi":"10.1145/3177404.3177409","DOIUrl":"https://doi.org/10.1145/3177404.3177409","url":null,"abstract":"Contrast Limited Adaptive Histogram Equalization technique (CLAHE) is a widely used form of contrast enhancement, used predominantly in enhancing medical imagery like X-rays and to enhance features in ordinary photographs. This paper aimed to understand the effectiveness of using this technique in multispectral satellite imagery and to study its effectiveness in different regions of the electromagnetic spectrum. This study also aimed at analyzing variations of spatial and spectral resolutions of a sensor affect the performance of the CLAHE technique by means of comparing quantitative parameters of the enhanced images between the sensors. A general idea of the feature that can be enhanced in each spectral region was also studied. The results showed that a comparative study between the CLAHE technique and the conventional global histogram equalization technique resulted in the former technique emerging superior of the two and thereby reconstructed images of better quality.","PeriodicalId":133378,"journal":{"name":"Proceedings of the International Conference on Video and Image Processing","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-12-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121673961","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Study on the Digital Art Presentation Based on Tri-colored Camel Carrying Musicians on the Back","authors":"Chen Peng, Wang Qian","doi":"10.1145/3177404.3177447","DOIUrl":"https://doi.org/10.1145/3177404.3177447","url":null,"abstract":"Tang Sancai (also called Tri-colored glazed pottery) has its unique artistic value and is known as the \"shining treasure of the Chinese art\". However, many people are not aware of its artistic features and cannot fully appreciate the beauty of Tang Sancai due to its limited form of display. Therefore, it is important to extract the artistic features of the pottery, explore new ways of displaying it and hence inherit and promote the spiritual culture embodied in the pottery. This essay takes Tri-colored Camel Carrying Musicians on the Back as an example. By analyzing and comparing different types of artistic representation of the digital art, the author aims to showcase the artistic characteristics of the pottery for better understanding and appreciation.","PeriodicalId":133378,"journal":{"name":"Proceedings of the International Conference on Video and Image Processing","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-12-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116697211","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Zhixiang He, Xiaoli Sun, Chenhui Li, G. Baciu, Yushi Li
{"title":"Superpixel Matching Based Image Retrieval","authors":"Zhixiang He, Xiaoli Sun, Chenhui Li, G. Baciu, Yushi Li","doi":"10.1145/3177404.3177428","DOIUrl":"https://doi.org/10.1145/3177404.3177428","url":null,"abstract":"Local features of images have been widely used in image retrieval, however, the cost is so heavy. To address this issue, a superpixel-based approach for image retrieval is proposed. We first extract the image structure that preserves the main information and removes the redundant information from the image by smoothing and oversegment a smoothed image into a certain number of superpixels. We then extract the positive candidate superpixels by combining superpixels with local descriptors. Finally, we compute the similarity of two images by analyzing two sets of positive candidate superpixels. Experiments on dataset PQ7 demonstrate the performance of the proposed approach.","PeriodicalId":133378,"journal":{"name":"Proceedings of the International Conference on Video and Image Processing","volume":"73 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-12-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127615727","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A New 3D Multi-modality Medical Bone Image Registration Algorithm","authors":"Huanjie Tao, Xiaobo Lu","doi":"10.1145/3177404.3177427","DOIUrl":"https://doi.org/10.1145/3177404.3177427","url":null,"abstract":"Three-dimensional (3D) multi-modality medical bone image registration is an important technology in surgical application, especially in large computer-aided orthopedic surgery. To improve registration accuracy, we propose a new 3D multi-modality medical bone image registration algorithm based on local features through analyzing the bone structure. In this method, the image Hessian matrix is introduced for local features extraction, and the local behavior of the 3D bone image is described by the eigenvalues of Hessian matrix. This method can automatically extract and select the most representative feature points (blob-like structure) in different scales. Then we adopt the idea of triangle matching to get stereo matching point pairs. Improve random sample consensus (RANSAC) algorithm is adopted to remove wrong matching point pairs. We use the right matching point pairs to establish rigid transformation model and solve this non-linear model by Levenberg-Marquardt algorithm to get geometric transformation parameters. Simulated experiments and real experiments demonstrate that the proposed method can achieve a high image registration accuracy.","PeriodicalId":133378,"journal":{"name":"Proceedings of the International Conference on Video and Image Processing","volume":"234 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-12-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123118320","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}