Xiang Qiang, Zhaoyang Zhang, Qiwei Chen, Cheng Wu, Yiming Wang
{"title":"Video-based adaptive railway recognition in complex scene","authors":"Xiang Qiang, Zhaoyang Zhang, Qiwei Chen, Cheng Wu, Yiming Wang","doi":"10.1109/ICALIP.2016.7846527","DOIUrl":"https://doi.org/10.1109/ICALIP.2016.7846527","url":null,"abstract":"Adaptively tracking tram railway in video-based complex scene is difficult because of road curving and environment changing. In this paper, we introduce an adaptive railway recognition method by analyzing gray distribution features of railway region. This method firstly segments track regions using multiple thresholds which can be dynamically optimized based on the change of local accumulation histogram with the change of scenes. Then, on the basis of binary image, combined with connectivity and skeleton extraction, track feature points are automatically extracted from the position of the track starting point. A suitable curve model is chosen to construct the railway equation. The proposed method is able to achieve accurate recognition of railway in different scenes.","PeriodicalId":184170,"journal":{"name":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","volume":"63 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132310655","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Visual tracking via local patches and contextual information","authors":"Hua Bao, Zonghai Chen","doi":"10.1109/ICALIP.2016.7846526","DOIUrl":"https://doi.org/10.1109/ICALIP.2016.7846526","url":null,"abstract":"In this paper, a new visual tracking approach via the local patches and the contextual information of the target is presented. In the tracking procedure, the target object is decomposed into a set of patches of equal size and each patch is represented by using intensity and gradient histograms. Then, the likelihood of local patches is defined as the weighted sum of reliability and stability indices, which are applied to evaluate the patches' robustness. Furthermore, the target is represented by using double bounding boxes corresponding to the foreground and background, respectively, which are encoded by HSV color histograms. As this, the drifts can be effectively suppressed by using the contextual information. In the tracking process, the object position is estimated by maximizing the likelihood of the target under the Bayesian framework. The experimental results demonstrate that the proposed approach performs much better than the existing state-of-the-art methods do in terms of efficiency, accuracy and robustness.","PeriodicalId":184170,"journal":{"name":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130050867","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Human action recognition based on Kinect and PSO-SVM by representing 3D skeletons as points in lie group","authors":"Dan Xu, Xiao Xiao, Xuzhi Wang, Jingjing Wang","doi":"10.1109/ICALIP.2016.7846646","DOIUrl":"https://doi.org/10.1109/ICALIP.2016.7846646","url":null,"abstract":"In this paper, we propose an effective method to recognize human actions. Combined with the relationship between 3D skeleton model of joint position and particle group optimization algorithm is used to optimize the support vector machine (PSO-SVM) and depth through the Kinect sensor to obtain human 3D skeleton model, each skeletal model with 20 joints and 19 joints, the relative geometry between various body parts provides a more meaningful description than their absolute locations, we explicitly model the relative 3D geometry between different body parts in our skeletal representation. Mathematically, rigid body rotations and translations in 3D space are members of the special Euclidean group SE(3), which is a matrix Lie group. Hence, we represent the relative geometry between a pair of body parts as a point in SE(3), We then perform classification using a combination of dynamic time warping, and particle swarm optimization on support vector machine (PSO-SVM), Experimental results on three action datasets: MSR-Action 3D, UT-Kinect, Florence 3D-Action, show that the proposed representation performs better than many existing skeletal representations. The proposed approach also outperforms various state-of-the-art skeleton-based human action recognition approaches.","PeriodicalId":184170,"journal":{"name":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125535204","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"L1-norm minimization for octonion signals","authors":"Rui Wang, Guijun Xiang, F. Zhang","doi":"10.1109/ICALIP.2016.7846602","DOIUrl":"https://doi.org/10.1109/ICALIP.2016.7846602","url":null,"abstract":"An algorithm for recovering the octonion signals in both noiseless and noise contaminated scenarios by solving an L1-norm minimization problem is presented. The L1-norm minimization problem over the octonion field is solved by converting it to an equivalent second-order cone programming problem over the real number field, which can be readily solved by convex optimization solvers like CVX. An application example of the proposed algorithm is also given for practical guidelines of perfect recovery of octonion signals. The proposed algorithm may find its potential application when CS theory meets the octonion signal processing.","PeriodicalId":184170,"journal":{"name":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","volume":"66 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116983851","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"An improved speech enhancement algorithm based on generalized sidelobe canceller","authors":"Bin Li, Linghua Zhang","doi":"10.1109/ICALIP.2016.7846528","DOIUrl":"https://doi.org/10.1109/ICALIP.2016.7846528","url":null,"abstract":"In speech enhancement algorithm based on generalized sidelobe canceller (GSC), when there is error in direction estimating, the target speech will not be blocked by blocking matrix (BM) module completely. Then in the later multiple-input canceller (MC) module, the target speech will be eliminated, which will cause the leakage of the target speech. In this paper, a new optimization algorithm is proposed for the leakage of the speech caused by the error of signal direction of arrival (DOA). The blocking matrix would be adjusted adaptively according to the characteristics of the correlation between the final output of GSC and the output of BM module. This way, the estimated direction can be closer to the real target speech direction in the blocking matrix in order to reduce the leakage of the target speech and the leakage of the target speech in multiple-input canceller will be reduced. The simulation results show that the proposed algorithm have better speech enhancement performances in both objective and subjective evaluations.","PeriodicalId":184170,"journal":{"name":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","volume":"80 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128566981","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Z. Ding, Ni Qi, Fang Dong, Liang Jinhui, Yao Wei, Yuan Shenggui
{"title":"Application of multispectral remote sensing technology in surface water body extraction","authors":"Z. Ding, Ni Qi, Fang Dong, Liang Jinhui, Yao Wei, Yuan Shenggui","doi":"10.1109/ICALIP.2016.7846565","DOIUrl":"https://doi.org/10.1109/ICALIP.2016.7846565","url":null,"abstract":"This paper addresses an improved water-body extraction method by the multispectral remote sensing image of four-bands. Firstly, traditional multispectral water body extraction technologies are reviewed, and their features are compared. It shows that finding the most difference of the reflection ratio of the water body and the background is the core to improve the identification rate. Furthermore, combing data's feature of GF-01 and GF-02 satellites, an improved normalization index method by introducing the weighted average of the blue and green light bands was proposed. Finally, the comparative results illustrating the improvement of water body extraction are provided and analyzed.","PeriodicalId":184170,"journal":{"name":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","volume":"107 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122053867","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"An RFID indoor positioning system by using Particle Swarm Optimization-based Artificial Neural Network","authors":"Changzhi Wang, Zhicai Shi, Fei Wu, Juan Zhang","doi":"10.1109/ICALIP.2016.7846624","DOIUrl":"https://doi.org/10.1109/ICALIP.2016.7846624","url":null,"abstract":"Indoor Location information service (ILS) has been the hot topics of research in recent years. However, localization cost and positioning accuracy is still a challenge for indoor positioning system (IPS). RFID positioning technology is low cost but high positioning accuracy which is usually used for an IPS. In this study, a RFID indoor positioning algorithm is proposed, which is based on the Particle Swarm Optimization Artificial Neural Network (PSO-ANN). The algorithm uses PSO to optimize the weight and threshold of ANN network, and establish an accurate classification model that can learn the relationship between the Received Signal Strength Indication (RSSI) and tag position. In addition, in order to reduce the impact of the environmental factors on the position estimation effectively, the Gaussian Filter is adopted to process the RSSI information. The experimental result demonstrates that the proposed algorithm has better performance than other artificial neural network.","PeriodicalId":184170,"journal":{"name":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114537950","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Comparison of sparse-view CT image reconstruction algorithms","authors":"Shu Zhang, Youshen Xia, Changzhong Zou","doi":"10.1109/ICALIP.2016.7846575","DOIUrl":"https://doi.org/10.1109/ICALIP.2016.7846575","url":null,"abstract":"In recent years, the restoration of computerized tomography (CT)images with low-dose projection is a key issue in CT image processing. The sparse views-based methods have been proposed to achieve reasonable image quality. This paper studies three conventional sparse-view CT image reconstruction algorithms: the total variational minimization projection onto convex set (TVM-POCS) algorithm, the two-step iterative Shrinkage-Thresholding (TwIST) algorithm, and the iterative filtered back projection (FBP) algorithm. The three algorithms are compared and analyzed in terms of the computational complexity, universal quality index(UQI), and structure similarity index(SSIM). Two experiments with comparison are performed in the case of sparse-view and low-dose projection, respectively. The computed results reveal that under Poisson noise environments, the TVM-POCS algorithm has superior performance over other algorithms in restoration quality and computing time.","PeriodicalId":184170,"journal":{"name":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115025526","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Javaweb login authentication based on improved MD5 algorithm","authors":"Linxia Zhong, W. Wan, Deke Kong","doi":"10.1109/ICALIP.2016.7846653","DOIUrl":"https://doi.org/10.1109/ICALIP.2016.7846653","url":null,"abstract":"MD5(message digest algorithm) plays an important role in the digital signature, identity authentication and data encryption on account of its special ability of strong one-way encryption and irreversibility. Consequently, MD5 is widely used in login authentication module in which password is encrypted by MD5 algorithm. The login process is to compare the encrypted password value and the password stored in database which is no longer the original password match or not. Because traditional MD5 algorithm cannot be very good against the collision attack, differential attack and dictionary attack, this paper presents a kind of improved MD5 algorithm to address these problems. First depart password into number and character form, then number is respectively added 1,2,3, ……, at the mean time, the character is added 1 in ASCII, by encryption combination of numbers and characters form the final password . The experiment has proved that this algorithm can effectively resist the brute force attack and differential attack.","PeriodicalId":184170,"journal":{"name":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","volume":"57 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124552669","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Research and application of dynamic and interactive data visualization based on D3","authors":"Lianjun Chen, Hong-bo Zhou","doi":"10.1109/ICALIP.2016.7846608","DOIUrl":"https://doi.org/10.1109/ICALIP.2016.7846608","url":null,"abstract":"Data visualization is an effective method for people to carry out deeper observation and analysis in the context of big data. To solve the issue of deficient interaction with static data, the paper conducted research based on D3 data visualization technology. It proposed a D3 data-driven visualization model, and applied the model in the school equipment maintenance management system with comparison charts of maintenance group workload. The paper discussed the entire procedures of dynamic and interactive data visualization from several angles: data processing, application, and data visualization. The results supported the users' need of statistical charts, and confirmed the advantage of the D3 technique in dynamic and interactive data visualization.","PeriodicalId":184170,"journal":{"name":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","volume":"59 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116291495","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}