Guoquan Li, Ruiyang Xia, Zhengwen Huang, Lingyun Wen, Huiqian Wang
{"title":"Clustering SIFT: An Efficient Way to Achieve Flipping Invariance","authors":"Guoquan Li, Ruiyang Xia, Zhengwen Huang, Lingyun Wen, Huiqian Wang","doi":"10.1109/ICSP48669.2020.9320908","DOIUrl":"https://doi.org/10.1109/ICSP48669.2020.9320908","url":null,"abstract":"The design and improvement of feature descriptor is always a prevalent research point. A strong design of feature descriptor can be applied to many industrial fields. However, in many successful designs, there are still have some shortcomings that need to be improved like the flipping operation in image. Although there are many techniques to deal with this problem recently, they still exist some uncertainty factors because they do not compare with different images which have the same but flipped object in their experiments. In this paper, we design an unsupervised learning structure algorithm based on SIFT to achieve flipping invariant and change object to get as many correct matches as possible. Then we will use this algorithm to match in UKBench dataset and indicate that this approach not only solves the problem of flipping operation but also ensures high matching accuracy. Compared with previous other algorithms, our algorithm is extremely simple to implement and does not destroy the structure of SIFT.","PeriodicalId":237073,"journal":{"name":"2020 15th IEEE International Conference on Signal Processing (ICSP)","volume":"64 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114059880","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Arbitrary Clutter PHD Filter and Its Implementation","authors":"Xinglin Shen, Luping Zhang, Moufa Hu","doi":"10.1109/ICSP48669.2020.9320953","DOIUrl":"https://doi.org/10.1109/ICSP48669.2020.9320953","url":null,"abstract":"This paper is focus on multiple target tracking with arbitrary clutter process by using the probability hypothesis density (PHD) filter. Based on the Random Finite Set (RFS) framework, the clutter process in the classical PHD filter is modeled as Poisson RFS, which is only reasonable for some scenarios in reality. The PHD filter suitable for arbitrary clutter process has not yet been implemented. In this paper, an arbitrary clutter PHD filter, as well as its particle filter implementation, are derived by using the probability generating functional (PGFL) based on the RFS framework. Simulation results show that the proposed arbitrary clutter PHD filter can be used for multiple target tracking with arbitrary clutter process.","PeriodicalId":237073,"journal":{"name":"2020 15th IEEE International Conference on Signal Processing (ICSP)","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122829174","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Language Identification with Unsupervised Phoneme-like Sequence and TDNN-LSTM-RNN","authors":"Linjia Sun","doi":"10.1109/ICSP48669.2020.9320919","DOIUrl":"https://doi.org/10.1109/ICSP48669.2020.9320919","url":null,"abstract":"A novel language identification (LID) method is proposed that accepts the architecture of time delay neural network (TDNN) followed by long short term memory (LSTM) recurrent neural network (RNN) to learn long-term phonetic patterns and model the phonetic dynamics for different languages. Instead of the linguistic phonemes, the phoneme-like speech units are used to train the TDNN-LSTM-RNN, which can be found without prior linguistic knowledge and manual transcriptions. Compared with PPRLM, the experiment results show that the phoneme-like speech units by unsupervised discovering and the linguistic phonemes by manual annotation have the same effect in the LID task. Furtherly, the proposed LID method is built and reported the test results on the NIST LRE07 and the task of dialect identification. We compare the proposed LID method with other state-of-the-art methods, including the acoustic feature based LID methods and the phonetic feature based LID methods. The experimental results show that our method provides competitive performance with the existing methods in the LID task. In particular, our method helps to capture robust discriminative information for short duration language identification and high accuracy for dialect identification.","PeriodicalId":237073,"journal":{"name":"2020 15th IEEE International Conference on Signal Processing (ICSP)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128810386","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Weichuan Yu, Peiyu He, Zili Xu, Ao Cui, Xuewei Shi, Jing Lei
{"title":"Optimization Method of Sparse Array Frequency and Steering Invariant Beamforming","authors":"Weichuan Yu, Peiyu He, Zili Xu, Ao Cui, Xuewei Shi, Jing Lei","doi":"10.1109/ICSP48669.2020.9321015","DOIUrl":"https://doi.org/10.1109/ICSP48669.2020.9321015","url":null,"abstract":"The conventional beamforming optimization method cannot meet the requirements of constant beamwidth of spare array in different frequency and different beam pointing angles. In this paper, a frequency and steering invariant beamforming method based on compressed sensing (CSFSIB) is proposed. First, the array elements are extended virtually, the sparse signal obtained by virtual array is reconstructed by compressed sensing (CS) recovery algorithm. Then the Second-order cone programming (SOCP) method is used to calculate the weighted vector and optimize the steering invariant beamforming (SIB). Finally, frequency invariant beamforming (FIB) is designed based on inverse Fourier transform (IFT). Simulation results indicate that the proposed method can reduce the sidelobe level and save the number of array elements while keeping the mainlobe width constant.","PeriodicalId":237073,"journal":{"name":"2020 15th IEEE International Conference on Signal Processing (ICSP)","volume":"58 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116808610","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Simulation of Dynamic Ship Radiated Noise Signal","authors":"Yuan Zheng, Bin Jiang, Gang Yang","doi":"10.1109/ICSP48669.2020.9320983","DOIUrl":"https://doi.org/10.1109/ICSP48669.2020.9320983","url":null,"abstract":"The total intensity and the spectral distribution of ship’s radiated noise signal changes dynamically as the ship navigates. Conventional static simulation methods are focused on exhibiting non-changing or fixed patterns of signal’s characteristics. For applications that require high-fidelity models, methods that can correctly simulate the above mentioned dynamic characteristics are preferred. In this paper, such a method is proposed and investigated. It is an enhancement of the static simulation method. Firstly, the resulting statically simulated intermediate signals are taken and divided into segments in the time domain. Secondly, for each segment and each frequency point of signal’s linear and continuous spectrum, the intensities are adjusted according to the acoustic transmission loss formula and then merged. Finally, the modulation process is conducted. The simulation experiment results show that the simulated noise signals can exhibit the dynamic characteristics of the navigating ship through the variation on total signal intensity and spectral distribution. Moreover, the simulated signals are also smooth, natural, and non-sectional in auditory sense.","PeriodicalId":237073,"journal":{"name":"2020 15th IEEE International Conference on Signal Processing (ICSP)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122001650","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Xiaohui Yu, Xinbo Li, Yiran Shi, Xiaodong Sun, Shiqian Wang
{"title":"Maximum Likelihood Optimization of Linear Frequency Modulated Signal Based on Particle Swarm","authors":"Xiaohui Yu, Xinbo Li, Yiran Shi, Xiaodong Sun, Shiqian Wang","doi":"10.1109/ICSP48669.2020.9321091","DOIUrl":"https://doi.org/10.1109/ICSP48669.2020.9321091","url":null,"abstract":"The maximum likelihood estimation (MLE) is an optimal parameter estimation for the linear frequency modulated (LFM) signal. It can reach the Cramer-Rao lower bound. However, the great calculation load caused by multidimensional search limits its practical application. In this paper, the particle swarm optimization (PSO) algorithms based on the MLE of LFM parameters are proposed. Three different PSO algorithms, namely global mode standard PSO, local mode standard PSO, and hybrid PSO combined with global mode and local mode, are applied to optimize the MLE of LFM parameters. Through the updating of velocities and positions of the particles in multidimensional space, the estimation speed of chirp rate and initial frequency of LFM signal is accelerated effectively. The convergence performance, statistical performance and calculation time of the three PSO algorithms are compared by MATLAB experiments, through which the performance of the three optimization algorithms for LFM parameter estimation is analyzed.","PeriodicalId":237073,"journal":{"name":"2020 15th IEEE International Conference on Signal Processing (ICSP)","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128934451","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Yiren Liu, Yanjiang Wang, Limiao Deng, Mingyue Gao, Weifeng Liu
{"title":"Visual Images Memory Recall Based on ClassRBM and Free Energy Minimization","authors":"Yiren Liu, Yanjiang Wang, Limiao Deng, Mingyue Gao, Weifeng Liu","doi":"10.1109/ICSP48669.2020.9320956","DOIUrl":"https://doi.org/10.1109/ICSP48669.2020.9320956","url":null,"abstract":"Mounting evidence suggests that the brain works in a Bayesian way. Probabilistic inference models based on Bayesian theory have been widely applied in human memory modeling, such as search of associative memory (SAM) and retrieving effectively from memory (REM). However, these lines of memory models can only determine the class of the test images during memory recall. They fail to recall the test visual image, which is referred to as mental imagery in cognitive psychology. In order to address this issue, in this paper, we first propose a memory model based on classification RBM (ClassRBM), in which a three-layer neural network is constructed with 500-800-500 units. The input layer and output layer represent the visual sensory images and corresponding labels, respectively, while the hidden layer stores the encoded visual information. Then we apply free energy minimization principle to train the model, in which we assume that the storage and recall processes of memory in human brain involve free energy minimization. Finally, the learned visual images can be reconstructed by probabilistically sampling the visual units over the hidden units. Comprehensive experiments show that the proposed memory model is capable of recalling images studied and can be applied to the modeling of mental images in the mind.","PeriodicalId":237073,"journal":{"name":"2020 15th IEEE International Conference on Signal Processing (ICSP)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130885103","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Three-Dimensional Cooperative TDOA Location Method with Multi-UAV Based on Quantum Wind Driven Optimization","authors":"Hongyuan Gao, Shihao Wang, Zhiwei Zhang","doi":"10.1109/ICSP48669.2020.9321087","DOIUrl":"https://doi.org/10.1109/ICSP48669.2020.9321087","url":null,"abstract":"Three-dimensional (3D) cooperative time- difference-of-arrival (TDOA) location model aims to solve the 3D location information of unmanned air vehicles (UAV) in complex environment without GPS. This paper uses a singlechain coding quantum wind driven optimization combined with Chan algorithm (Chan-QWDO) to solve the non-linear optimization problem in 3D cooperative TDOA location model. QWDO algorithm uses quantum rotation angle combined with chaotic equation and quantum rotation gate strategy on population evolution. Compared with Chan algorithm (Chan), particle swarm optimization (PSO) and genetic algorithms (GA), the Chan-QWDO algorithm shows better performance in 3D cooperative TDOA location problem. The simulation results show that if the parameters are assumed reasonably, Chan- QWDO algorithm has stable performance, fast convergence speed and strong adaptability. And the mean square error (MSE) is lower than other algorithms which means Chan-QWDO algorithm has higher positioning accuracy.","PeriodicalId":237073,"journal":{"name":"2020 15th IEEE International Conference on Signal Processing (ICSP)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130339558","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Background Initialization Based on Adaptive Online Low-rank Subspace Learning","authors":"Guang Han, Guanghao Zhang, Xi Cai","doi":"10.1109/ICSP48669.2020.9320960","DOIUrl":"https://doi.org/10.1109/ICSP48669.2020.9320960","url":null,"abstract":"Background initialization is to estimate an appropriate representation for background of a scene, and has a decisive role in determining the performance of background subtraction. Background initialization based on low-rank subspace learning can obtain the background by learning the low-rank subspace. However, most of these methods are batch- based methods requiring heavy memory cost and unable to adapt to dynamic scenes. Accordingly, in this paper, we propose a background initialization method based on adaptive online low-rank subspace learning. The low-rank background subspace is estimated by online robust principal component analysis (PCA) in an online manner. An adaptive weighting parameter is utilized in the online robust PCA to enhance its ability to dynamically model the background. Experimental results demonstrate that, the proposed method can effectively gain the backgrounds of dynamic scenes.","PeriodicalId":237073,"journal":{"name":"2020 15th IEEE International Conference on Signal Processing (ICSP)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130593714","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Detection of Railway Tunnel lining Based on Adaptive Background Learning","authors":"Yuxin Liu, Enze Yang, Shuoyan Liu","doi":"10.1109/ICSP48669.2020.9320937","DOIUrl":"https://doi.org/10.1109/ICSP48669.2020.9320937","url":null,"abstract":"Tunnel is the absolutely necessary part of railway line. Its state of service is often influenced by linings. Once the lining has fallen on the concrete bed, it will directly threaten the safety of trains in the tunnel. So, one timely lining detection system is a key assistant to keep the tunnel a good operating condition. In this paper, the approach of lining fall-blocks detection by intelligent video analysis based on adaptive background modeling is proposed. Without a large number of labeled samples, the unsupervised method based on gaussian mixture model (GMM) is applied to monitor the fallen block in real time. The basic idea is to set the learning rate automatically, which is mainly according to the critical attributes including image intensity and feature point of current frame. Because of the strategy the method has the advantage of modeling the background in variable scenes such as illumination-changing, camera-shaking, train-passing, etc. Finally, in our experiment, the results demonstrated the effectiveness of the proposed approach.","PeriodicalId":237073,"journal":{"name":"2020 15th IEEE International Conference on Signal Processing (ICSP)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133622142","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}