{"title":"Low Complex Simple Measurement Matrix for Sparse Recovery of Speech Signal","authors":"M. A. Sankar, S. P. Savithri","doi":"10.1145/3271553.3271597","DOIUrl":"https://doi.org/10.1145/3271553.3271597","url":null,"abstract":"Compressed Sensing (CS), the methodology of signal capturing, allows sampling at flexible rates below Nyquist, with the constraint that the sparsifying basis and the level of sparsity are known in advance for the signal of interest. Many speech codecs based on CS frame work are developed using Linear Predictive Coding (LPC), Discrete Cosine Transform (DCT) and Code Excited Linear Prediction (CELP). In most of them, Gaussian random matrix is used for deriving the observation vector which is computationally complex and has large memory requirements. In this paper, a modified binary sensing matrix, specifically for speech signal is proposed, which has low coherence with the sparsifying bases used for reconstruction. The Signal-to-Noise Ratio (SNR) improvement goes beyond 3-4 dB and it is more significant at very high compression ratios. The application of the proposed sensing matrix to CS based codecs using CELP and dynamic DCT&LPC bases shows significant improvement in the perceptual quality of the reconstructed speech. This enables the functioning of these codecs at lower bit rates without compromising the quality.","PeriodicalId":414782,"journal":{"name":"Proceedings of the 2nd International Conference on Vision, Image and Signal Processing","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122366180","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Study of Gearbox Fault Diagnosis Based on Wavelet Decompostion Adaptive ELMD","authors":"Hongxia Pan, Mingzhi Pan, Jun Huang","doi":"10.1145/3271553.3271613","DOIUrl":"https://doi.org/10.1145/3271553.3271613","url":null,"abstract":"Based on local mean decomposition (LMD) method in the process of decomposition of gearbox vibration signal of modal aliasing phenomenon, put forward a kind of based on adaptive wavelet decomposition ELMD gearbox fault diagnosis methods. First to Gaussian white noise signal mixed in the vibration signal, and then to LMD signal decomposed product function (PF) component, and has carried on the spectrum of PF component analysis, analog signal decomposition and processing results demonstrate the effectiveness of the method. The standard deviation ratio of white noise is determined by using the adaptive criterion based on wavelet decomposition. Finally using adaptive wavelet decomposition based ensemble local mean decomposition (ELMD) method for gear tooth surface wear and fracture failure of the vibration signal of cage, the PF decomposition, and the decomposition of PF component zoom spectrum analysis, the gear box fault diagnosis, the diagnosis of achieved good results.","PeriodicalId":414782,"journal":{"name":"Proceedings of the 2nd International Conference on Vision, Image and Signal Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129009916","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Research and Implementation on Multi-beacon Aided AUV Integrated Navigation Algorithm Based on UKF","authors":"Dajun Sun, C. Zheng, Miao Yu, Yixian Shuai","doi":"10.1145/3271553.3271606","DOIUrl":"https://doi.org/10.1145/3271553.3271606","url":null,"abstract":"The primary objective of this work is to solve the problem that the long baseline (LBL) positioning system has poor real-time positioning results during the navigation for autonomous underwater vehicles (AUVs). In this paper, it is proposed to use the unscented Kalman filter (UKF) to enhance the navigation and positioning accuracy of the AUV, using the speed information measured by the Doppler Velocity Log (DVL) and the ranging information obtained by the acoustic beacons. Utilizing the unscented transformation to perform integrated navigation, it can avoid linearization truncation errors and Jacobi matrix computation compared with extended Kalman filtering (EKF). The experimental results verify that in the multi-beacon aided AUV integrated navigation system, compared with the conventional LBL and EKF, UKF provides more stable and accurate position information in real time.","PeriodicalId":414782,"journal":{"name":"Proceedings of the 2nd International Conference on Vision, Image and Signal Processing","volume":"107 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115185551","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Monitoring Network Traffic of Different OSes in Different IP Protocols","authors":"Chenhuan Liu, Chen Su, Xing Li","doi":"10.1145/3271553.3271599","DOIUrl":"https://doi.org/10.1145/3271553.3271599","url":null,"abstract":"Recently, the booming big data era has brought increasing attention on the network traffic classification problem. To cope with the problem, methods based on port, payload, behavior and machine learning have been proposed since 2000s. However, these methods rely on people's prior knowledge to classify and their accuracy is hardly to be convincing. To solve the problem above, we propose a method through connecting a switch on the host network to mirror the host's network traffic. In this way, network traffic of hosts under different operating systems and different IP protocol configurations can be monitored. We conducted experiments based on three weeks of data measured on a public network. Results show that the traffic of different IP protocols are independent. Comparison with Moore-set shows that our method can classify specific network traffic from different OSes under IPv4, IPv6 and dual stack protocols.","PeriodicalId":414782,"journal":{"name":"Proceedings of the 2nd International Conference on Vision, Image and Signal Processing","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127757291","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Deep Active Learning for Text Classification","authors":"Bang An, Wenjun Wu, Huimin Han","doi":"10.1145/3271553.3271578","DOIUrl":"https://doi.org/10.1145/3271553.3271578","url":null,"abstract":"In recent years, Active Learning (AL) has been applied in the domain of text classification successfully. However, traditional methods need researchers to pay attention to feature extraction of datasets and different features will influence the final accuracy seriously. In this paper, we propose a new method that uses Recurrent Neutral Network (RNN) as the acquisition function in Active Learning called Deep Active Learning (DAL). For DAL, there is no need to consider how to extract features because RNN can use its internal state to process sequences of inputs. We have proved that DAL can achieve the accuracy that cannot be reached by traditional Active Learning methods when dealing with text classification. What's more, DAL can decrease the need of the great number of labeled instances for Deep Learning (DL). At the same time, we design a strategy to distribute label work to different workers. We have proved by using a proper batch size of instance, we can save much time but not decrease the model's accuracy. Based on this, we provide batch of instances for different workers and the size of batch is determined by worker's ability and scale of dataset, meanwhile, it can be updated with the performance of the workers.","PeriodicalId":414782,"journal":{"name":"Proceedings of the 2nd International Conference on Vision, Image and Signal Processing","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126814078","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Text Mining of English Picture Books","authors":"Hiromi Ban, T. Oyabu","doi":"10.1145/3271553.3271616","DOIUrl":"https://doi.org/10.1145/3271553.3271616","url":null,"abstract":"Picture books play an important role as a material that develops children's linguistic competence. Thus, English picture books can be considered to be indispensable in children's English study. In this paper, metrical characteristics of some English picture books were investigated, compared with English textbooks for Japanese junior high schools students. In short, frequency characteristics of character- and word-appearance were investigated. These characteristics were approximated by an exponential function. Furthermore, the percentage of Japanese junior high school required vocabulary and American basic vocabulary was calculated to obtain the difficulty-level. As a result, it was clearly shown that the English picture books have a similar tendency to literary writings in the characteristics of character-appearance, and some books are more difficult than English textbooks.","PeriodicalId":414782,"journal":{"name":"Proceedings of the 2nd International Conference on Vision, Image and Signal Processing","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126986733","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Technology of Radar Terminal on Researching Software Display","authors":"Zhenkui Miao, Wei Xu, Hong Chen","doi":"10.1145/3271553.3271615","DOIUrl":"https://doi.org/10.1145/3271553.3271615","url":null,"abstract":"In this paper, The software display of radar terminal is the trend of radar equipment development, and the utilization of computer resources has become an important hot topic. This paper based on Windows multimedia instructions, the development of DirectX3D multimedia applications and the method of OpenGL language development kit, uses three different ways to achieve the displaying of radar and attenuation. The effect is ideal.","PeriodicalId":414782,"journal":{"name":"Proceedings of the 2nd International Conference on Vision, Image and Signal Processing","volume":"7 1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121234161","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Facial Expression Recognition Using Convolutional Neural Network","authors":"Yijun Gan","doi":"10.1145/3271553.3271584","DOIUrl":"https://doi.org/10.1145/3271553.3271584","url":null,"abstract":"Facial expressions are part of human language and are often used to convey emotions. With the development of human-computer interaction technology, people pay more and more attention to facial expression recognition (FER) technology. Besides, in the domain of FER, human beings have made some progress. In this paper, we reviewed the development of FER: VGGNet, ResNet, GoogleNet, and AlexNet. Besides, we studied some ideas of CNN (Convolutional Neural Network), and we used FER2013, which is one of the most significant databases of human faces, as the dataset to be considered. Furthermore, we made some improvements based on the original methods of FER. By training the FER2013 dataset with different revised ways, the best result of accuracy we got is 0.6424. At last, we generated and summarized the progress and deficiencies in this study.","PeriodicalId":414782,"journal":{"name":"Proceedings of the 2nd International Conference on Vision, Image and Signal Processing","volume":"65 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115960438","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Cryptographic Key Generation Using Burning Ship Fractal","authors":"Shafali Agarwal","doi":"10.1145/3271553.3271577","DOIUrl":"https://doi.org/10.1145/3271553.3271577","url":null,"abstract":"The study introduces a key generation scheme using a burning ship fractal function, Hilbert transformation and an external key. The burning ship function is a modified version of a well-known Mandelbrot set function in which absolute value of a complex variable is considered. The process starts with the scrambling of the fractal image pixels by applying a Hilbert curve scanning. To enhance the randomness and complexity, an external key is obtained using a pseudo random number generator (PRNG), whose length depends on the size of the used fractal image. Further, a covering module is applied in which eight different types of operations are performed recursively to cover the scrambled fractal image pixels using eight different keys. At each iteration, a modified external key was used to perform the respective operation to the remaining image pixels. Moreover, to ensure the robustness of the proposed scheme, each block of tempkey (created in previous step) permuted using the sorting indexes of the modified tempkey blocks. The performance analysis of the given method is carried out in terms of the key space, key sensitivity, key generation time, histogram, and correlation coefficient. The results indicate that the proposed method is reliable and secure with great potential to be further use in the image encryption applications.","PeriodicalId":414782,"journal":{"name":"Proceedings of the 2nd International Conference on Vision, Image and Signal Processing","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125150356","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Gaussian Reciprocal Sequences from the Viewpoint of Conditionally Markov Sequences","authors":"R. Rezaie, X. Rong Li","doi":"10.1145/3271553.3271587","DOIUrl":"https://doi.org/10.1145/3271553.3271587","url":null,"abstract":"The conditionally Markov (CM) sequence contains several classes, including the reciprocal sequence. Reciprocal sequences have been widely used in many areas of engineering, including image processing, acausal systems, intelligent systems, and intent inference. In this paper, the reciprocal sequence is studied from the CM sequence point of view, which is different from the viewpoint of the literature and leads to more insight into the reciprocal sequence. Based on this viewpoint, new results, properties, and easily applicable tools are obtained for the reciprocal sequence. The nonsingular Gaussian (NG) reciprocal sequence is modeled and characterized from the CM viewpoint. It is shown that a NG sequence is reciprocal if and only if it is both CML and CMF (two special classes of CM sequences). New dynamic models are presented for the NG reciprocal sequence. These models (unlike the existing one, which is driven by colored noise) are driven by white noise and are easily applicable. As a special reciprocal sequence, the Markov sequence is also discussed. Finally, it can be seen how all CM sequences, including Markov and reciprocal, are unified.","PeriodicalId":414782,"journal":{"name":"Proceedings of the 2nd International Conference on Vision, Image and Signal Processing","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125496626","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}