Lan Liu, Chengfan Li, Yong-mei Lei, Junjuan Zhao, Xian-kun Sun
{"title":"Thematic information extraction in high-resolution remote sensing image based on weighted PCA and VBICA","authors":"Lan Liu, Chengfan Li, Yong-mei Lei, Junjuan Zhao, Xian-kun Sun","doi":"10.1109/ICALIP.2016.7846612","DOIUrl":"https://doi.org/10.1109/ICALIP.2016.7846612","url":null,"abstract":"The thematic information extraction has been a difficult problem in high-resolution remote sensing application. Principal component analysis (PCA) is able to extract data's independent features on the basis of the second-order statistics, the variational Bayesian independent component analysis (VBICA) not only overcome the inconsistency between the standard ICA model and remote sensing image but also decrease the computational complexity. In view of the characteristics of high-resolution remote sensing, a thematic information extraction method based on weighted PCA and VBICA is presented in this article, and IKONOS high-resolution remote sensing image experiments are performed. The result shows that the classification accuracy of proposed method reaches 78.30% under certain conditions with the suitable number of eigenvectors and weighted values.","PeriodicalId":184170,"journal":{"name":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127030553","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Oil depots detection from high resolution remote sensing images based on salient region extraction","authors":"Chaoyang Li, H. Huo, T. Fang","doi":"10.1109/ICALIP.2016.7846574","DOIUrl":"https://doi.org/10.1109/ICALIP.2016.7846574","url":null,"abstract":"The traditional methods of detecting oil depots usually use Hough transform and template matching, which often have lower detection rates and are difficult to implement. An efficient two-step detection framework is proposed in this paper to detect oil depots in high resolution remote sensing images. In the first stage, LC saliency model is used to detect the salient regions and shows a good performance on highlighting oil depots. In the second stage, task related targets from these salient regions are extracted by removing the irrelevant salient areas according to the special properties of the targets. According to the final shape, the area and distribution of oil depots, using image threshold segmentation and the graph-based clustering procedure, oil depots are detected with fairly good accuracy and efficiency.","PeriodicalId":184170,"journal":{"name":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132070606","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Zhou Lei, Hongguang Du, Shengbo Chen, C. Zhu, Xianyang Liu
{"title":"DCSPARK: Virtualizing spark using Docker containers","authors":"Zhou Lei, Hongguang Du, Shengbo Chen, C. Zhu, Xianyang Liu","doi":"10.1109/ICALIP.2016.7846626","DOIUrl":"https://doi.org/10.1109/ICALIP.2016.7846626","url":null,"abstract":"As MapReduce has become a popular model for large-scale data procession in recent years, companies and researchers take advantage of this model to solve their problems. The applications may run on the same MapReduce cluster, with their own system-wide configure settings and library dependencies, respectively. Sometimes, their configure settings and library dependencies are conflicted with each other. How to ensure these applications to run together correctly without mutual interference and achieve high resources utilization gives a challenge to the researchers. In this paper, we propose DCSpark, a framework that leverages the power of Docker containers that allows users to run Spark applications which have conflicting configurations and library dependencies in one physical cluster. In addition, it's presented an implementation of our framework called DCM which is aimed at managing the physical cluster, processing scheduling problem and building the container-based Spark cluster images automatically according to the dependence environment of the applications. Our experimental evaluation shows that DCSpark introduces negligible overhead for CPU and memory performance compared with the native Spark cluster.","PeriodicalId":184170,"journal":{"name":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130808968","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"The effects of language similarity on bilinguals' speech production","authors":"Zhanling Cui, Xunbing Shen","doi":"10.1109/ICALIP.2016.7846540","DOIUrl":"https://doi.org/10.1109/ICALIP.2016.7846540","url":null,"abstract":"The authors carried out three experiments exploring the influence of language similarity on language selection mechanisms. In Experiment 1, The participants were asked to perform the task of language-switching between their two dissimilar but highly proficient languages (Tibetan-Mandarin) ,in which they had to name the pictures quickly and accurately by using the cued language in the picture-word interference paradigm. In Experiment 2 and 3, the partcipants finished the same tasks except that they switched the languages between a more-proficient language (Tibetan or Mandarin) and dissimilar and less proficient language (English). The results showed there was no asymmetrical switching cost between Tibetan and Mandarin; and there was asymmetrical switching cost between non-fluent and proficient languages; meanwhile, language similarity affected speech production for non-proficient bilinguals. The results suggested that language similarity may play a role in the lexical selection mechanisms used by highly proficient bilinguals.","PeriodicalId":184170,"journal":{"name":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","volume":"50 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127949951","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"3D point cloud matching based on principal component analysis and iterative closest point algorithm","authors":"Chi Yuan, Xiaoqing Yu, Ziyue Luo","doi":"10.1109/ICALIP.2016.7846655","DOIUrl":"https://doi.org/10.1109/ICALIP.2016.7846655","url":null,"abstract":"Point cloud matching is one of the key technologies of optical three-dimensional contour measurement. Most of the point cloud matching without landmark used the iterative closest point algorithm. In order to improve the performance of the iterative closest point algorithm, the two-step iterative closest point algorithm was proposed. The improved algorithm is divided into a rough matching step and accurate matching step. Rough matching used the principal component analysis algorithm, while the fine matching used the improved iterative closest point algorithm. Compared with the classic iterative closest point algorithm, the improved algorithm can match the partial coincident point cloud. At the same time, the experiment can validate the effectiveness of the proposed algorithm.","PeriodicalId":184170,"journal":{"name":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","volume":"146 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131730701","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Rate-distoriton analysis for Compressive Sensing based coding","authors":"Wei Jiang, Junjie Yang","doi":"10.1109/ICALIP.2016.7846546","DOIUrl":"https://doi.org/10.1109/ICALIP.2016.7846546","url":null,"abstract":"Compressive Sensing (CS) is an emerging technology which samples a sparse signal at a rate corresponding to its actual information content rather than to its bandwidth. Different from traditional coding schemes in which distortion mainly comes from quantizer, distortion are related to quantization and compressive sampling in compressive sensing based coding schemes. Since the total coding bits are often constrained in the practical application, it is a great challenge to balance the number of measurements and quantization parameter to minimization the distortion. In this paper, a source rate and distortion model is proposed. The accuracy of the proposed R-D model is verified through experiments. Based on the R-D model, the optimal number of measurements and quantization step size are determined according to the rate-distortion criteria. Experimental results show that the proposed algorithm improves coding performances substantially.","PeriodicalId":184170,"journal":{"name":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","volume":"4 5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129228065","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Feng Tian, Pan Wang, Chunyan Dong, Jie Ying Gao, Haifeng Tang, Li Qian, Haojun Xu
{"title":"3D design tools for equipment manufacturing and exhibition based on internet","authors":"Feng Tian, Pan Wang, Chunyan Dong, Jie Ying Gao, Haifeng Tang, Li Qian, Haojun Xu","doi":"10.1109/ICALIP.2016.7846539","DOIUrl":"https://doi.org/10.1109/ICALIP.2016.7846539","url":null,"abstract":"To realize fast and universal 3D interactive media production for equipment exhibition, the internet 3D platform for equipment manufacturing and exhibition is designed based on WebGL graphics-engine. The main function of the plat-form includes sequential content creation, panoramic content creation, 3D animation creation. Meanwhile, fast cable simulation method based on rigid body link is proposed, cable simulation function is achieved on all kinds of platform. The internet authoring platform supports assistant 3D, the naked-eye 3D, helmet VR 3D, kinect, and other hardware peripheral equipment. Trials by multiple users, designers can create internet-based interactive panoramic content, sequence content, 3D animated content, cable simulation independently and rapidly without assistance of programmer. The 3D platform provides a simple design method and a new design mode for equipment manufacturing and exhibition. On the basis of the 3D authoring platform, the application of other 3D function can be upgraded. The 3D authoring platform is a platform foundation for the equipment manufacturing, maintenance, sales in the whole process of 3D content information.","PeriodicalId":184170,"journal":{"name":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126260319","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A non-data-aided frequency offset estimation algorithm of two co-frequency 16QAM signals","authors":"Peiqiang Wang, Kai Liu","doi":"10.1109/ICALIP.2016.7846586","DOIUrl":"https://doi.org/10.1109/ICALIP.2016.7846586","url":null,"abstract":"For single-channel of 16QAM mixed signals, a non-data-aided (NDA) frequency offset estimation (FOE) algorithm is proposed. The frequency offset of signals after down conversion affects the phase of the signal and make the signal rotate so that the extremum of the in-phase components and the quadrature components of signals are changed. Calculation of the frequency offset by optimizing the object functions which is obtained from the extremums of the I-channel signals and the Q-channel signals, and the optimized process is accomplished by utilizing hierarchical search. This method does not require any prior information and it is insensitive for unknown parameters of mixed signals, for example, amplitude and timing offset. The simulation-results verify the effectiveness and practicality of the proposed algorithm.","PeriodicalId":184170,"journal":{"name":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","volume":"92 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116568094","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A scheme discriminating between synthetic speech and normal speech","authors":"Jilun Chen, Weiqiang Zhang, Jia Liu","doi":"10.1109/ICALIP.2016.7846613","DOIUrl":"https://doi.org/10.1109/ICALIP.2016.7846613","url":null,"abstract":"This paper develops a system to automatically distinguish natural speech from synthetic speech. The issue of feature selection is considered. We take commonly used feature Mel-Frequency Cepstrum Coefficient (MFCC) in consideration, as well as other features such as Relative Phase Shift (RPS) and pitch tuned for Automatically Speech Recognition (ASR). We found some features are complimentary in the task of discriminating synthetic and natural speech. Gaussian Mixture Model Support Vector Machine (GMM-SVM) system is applied as classifier with feature input modified and compared to that of feature is applied in speaker recognition. Experiment on Librespeech versus online Text-to-Speech (TTS) speech synthesis platforms data set verified the effectiveness of the combination of these features.","PeriodicalId":184170,"journal":{"name":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","volume":"136 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115911706","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A 600BPS MELP vocoder with voice activity detection","authors":"Qiuyun Hao, Ye Li, Peng Zhang, Yanhong Fan, Xiaofeng Ma, Jingsai Jiang","doi":"10.1109/ICALIP.2016.7846549","DOIUrl":"https://doi.org/10.1109/ICALIP.2016.7846549","url":null,"abstract":"In the underwater communication, satellite communication, secure communication and other channels, the channel bandwidth is narrow and the channel condition is relatively poor. Therefore, higher quality and lower rate speech coding is needed. In order to improve the synthetic speech quality and save channel bandwidth, voice activity detection (VAD) technique is introduced to Mixed Excitation Linear Prediction (MELP) vocoder at 600bps in this paper. It can save channel bandwidth and reduce noise, coding rate and power consumption. In order to improve the accuracy of speech endpoint detection at low signal-to-noise ratio (SNR), noise reduction is adopted to improve SNR, and the VAD algorithm based on statistical model (STAT-VAD) is used. The MELP vocoder with VAD and noise reduction not only has good anti-noise ability and can improve robustness in random channel, but also can reduce the average coding rate and save channel bandwidth. The quality of synthetic speech can achieve the desired results at low SNR. In addition, the vocoder at 600bps with VAD can run in real-time on TMS320VC5510 DSP platform.","PeriodicalId":184170,"journal":{"name":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125365858","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}