2016 International Conference on Audio, Language and Image Processing (ICALIP)最新文献

筛选
英文 中文
Thematic information extraction in high-resolution remote sensing image based on weighted PCA and VBICA 基于加权PCA和VBICA的高分辨率遥感图像主题信息提取
2016 International Conference on Audio, Language and Image Processing (ICALIP) Pub Date : 2016-07-01 DOI: 10.1109/ICALIP.2016.7846612
Lan Liu, Chengfan Li, Yong-mei Lei, Junjuan Zhao, Xian-kun Sun
{"title":"Thematic information extraction in high-resolution remote sensing image based on weighted PCA and VBICA","authors":"Lan Liu, Chengfan Li, Yong-mei Lei, Junjuan Zhao, Xian-kun Sun","doi":"10.1109/ICALIP.2016.7846612","DOIUrl":"https://doi.org/10.1109/ICALIP.2016.7846612","url":null,"abstract":"The thematic information extraction has been a difficult problem in high-resolution remote sensing application. Principal component analysis (PCA) is able to extract data's independent features on the basis of the second-order statistics, the variational Bayesian independent component analysis (VBICA) not only overcome the inconsistency between the standard ICA model and remote sensing image but also decrease the computational complexity. In view of the characteristics of high-resolution remote sensing, a thematic information extraction method based on weighted PCA and VBICA is presented in this article, and IKONOS high-resolution remote sensing image experiments are performed. The result shows that the classification accuracy of proposed method reaches 78.30% under certain conditions with the suitable number of eigenvectors and weighted values.","PeriodicalId":184170,"journal":{"name":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127030553","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Oil depots detection from high resolution remote sensing images based on salient region extraction 基于显著区提取的高分辨率遥感影像油库检测
2016 International Conference on Audio, Language and Image Processing (ICALIP) Pub Date : 2016-07-01 DOI: 10.1109/ICALIP.2016.7846574
Chaoyang Li, H. Huo, T. Fang
{"title":"Oil depots detection from high resolution remote sensing images based on salient region extraction","authors":"Chaoyang Li, H. Huo, T. Fang","doi":"10.1109/ICALIP.2016.7846574","DOIUrl":"https://doi.org/10.1109/ICALIP.2016.7846574","url":null,"abstract":"The traditional methods of detecting oil depots usually use Hough transform and template matching, which often have lower detection rates and are difficult to implement. An efficient two-step detection framework is proposed in this paper to detect oil depots in high resolution remote sensing images. In the first stage, LC saliency model is used to detect the salient regions and shows a good performance on highlighting oil depots. In the second stage, task related targets from these salient regions are extracted by removing the irrelevant salient areas according to the special properties of the targets. According to the final shape, the area and distribution of oil depots, using image threshold segmentation and the graph-based clustering procedure, oil depots are detected with fairly good accuracy and efficiency.","PeriodicalId":184170,"journal":{"name":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132070606","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
DCSPARK: Virtualizing spark using Docker containers DCSPARK:使用Docker容器虚拟化spark
2016 International Conference on Audio, Language and Image Processing (ICALIP) Pub Date : 2016-07-01 DOI: 10.1109/ICALIP.2016.7846626
Zhou Lei, Hongguang Du, Shengbo Chen, C. Zhu, Xianyang Liu
{"title":"DCSPARK: Virtualizing spark using Docker containers","authors":"Zhou Lei, Hongguang Du, Shengbo Chen, C. Zhu, Xianyang Liu","doi":"10.1109/ICALIP.2016.7846626","DOIUrl":"https://doi.org/10.1109/ICALIP.2016.7846626","url":null,"abstract":"As MapReduce has become a popular model for large-scale data procession in recent years, companies and researchers take advantage of this model to solve their problems. The applications may run on the same MapReduce cluster, with their own system-wide configure settings and library dependencies, respectively. Sometimes, their configure settings and library dependencies are conflicted with each other. How to ensure these applications to run together correctly without mutual interference and achieve high resources utilization gives a challenge to the researchers. In this paper, we propose DCSpark, a framework that leverages the power of Docker containers that allows users to run Spark applications which have conflicting configurations and library dependencies in one physical cluster. In addition, it's presented an implementation of our framework called DCM which is aimed at managing the physical cluster, processing scheduling problem and building the container-based Spark cluster images automatically according to the dependence environment of the applications. Our experimental evaluation shows that DCSpark introduces negligible overhead for CPU and memory performance compared with the native Spark cluster.","PeriodicalId":184170,"journal":{"name":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130808968","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
The effects of language similarity on bilinguals' speech production 语言相似性对双语者言语产生的影响
2016 International Conference on Audio, Language and Image Processing (ICALIP) Pub Date : 2016-07-01 DOI: 10.1109/ICALIP.2016.7846540
Zhanling Cui, Xunbing Shen
{"title":"The effects of language similarity on bilinguals' speech production","authors":"Zhanling Cui, Xunbing Shen","doi":"10.1109/ICALIP.2016.7846540","DOIUrl":"https://doi.org/10.1109/ICALIP.2016.7846540","url":null,"abstract":"The authors carried out three experiments exploring the influence of language similarity on language selection mechanisms. In Experiment 1, The participants were asked to perform the task of language-switching between their two dissimilar but highly proficient languages (Tibetan-Mandarin) ,in which they had to name the pictures quickly and accurately by using the cued language in the picture-word interference paradigm. In Experiment 2 and 3, the partcipants finished the same tasks except that they switched the languages between a more-proficient language (Tibetan or Mandarin) and dissimilar and less proficient language (English). The results showed there was no asymmetrical switching cost between Tibetan and Mandarin; and there was asymmetrical switching cost between non-fluent and proficient languages; meanwhile, language similarity affected speech production for non-proficient bilinguals. The results suggested that language similarity may play a role in the lexical selection mechanisms used by highly proficient bilinguals.","PeriodicalId":184170,"journal":{"name":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","volume":"50 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127949951","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
3D point cloud matching based on principal component analysis and iterative closest point algorithm 基于主成分分析和迭代最近点算法的三维点云匹配
2016 International Conference on Audio, Language and Image Processing (ICALIP) Pub Date : 2016-07-01 DOI: 10.1109/ICALIP.2016.7846655
Chi Yuan, Xiaoqing Yu, Ziyue Luo
{"title":"3D point cloud matching based on principal component analysis and iterative closest point algorithm","authors":"Chi Yuan, Xiaoqing Yu, Ziyue Luo","doi":"10.1109/ICALIP.2016.7846655","DOIUrl":"https://doi.org/10.1109/ICALIP.2016.7846655","url":null,"abstract":"Point cloud matching is one of the key technologies of optical three-dimensional contour measurement. Most of the point cloud matching without landmark used the iterative closest point algorithm. In order to improve the performance of the iterative closest point algorithm, the two-step iterative closest point algorithm was proposed. The improved algorithm is divided into a rough matching step and accurate matching step. Rough matching used the principal component analysis algorithm, while the fine matching used the improved iterative closest point algorithm. Compared with the classic iterative closest point algorithm, the improved algorithm can match the partial coincident point cloud. At the same time, the experiment can validate the effectiveness of the proposed algorithm.","PeriodicalId":184170,"journal":{"name":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","volume":"146 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131730701","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 30
Rate-distoriton analysis for Compressive Sensing based coding 基于压缩感知编码的速率失真分析
2016 International Conference on Audio, Language and Image Processing (ICALIP) Pub Date : 2016-07-01 DOI: 10.1109/ICALIP.2016.7846546
Wei Jiang, Junjie Yang
{"title":"Rate-distoriton analysis for Compressive Sensing based coding","authors":"Wei Jiang, Junjie Yang","doi":"10.1109/ICALIP.2016.7846546","DOIUrl":"https://doi.org/10.1109/ICALIP.2016.7846546","url":null,"abstract":"Compressive Sensing (CS) is an emerging technology which samples a sparse signal at a rate corresponding to its actual information content rather than to its bandwidth. Different from traditional coding schemes in which distortion mainly comes from quantizer, distortion are related to quantization and compressive sampling in compressive sensing based coding schemes. Since the total coding bits are often constrained in the practical application, it is a great challenge to balance the number of measurements and quantization parameter to minimization the distortion. In this paper, a source rate and distortion model is proposed. The accuracy of the proposed R-D model is verified through experiments. Based on the R-D model, the optimal number of measurements and quantization step size are determined according to the rate-distortion criteria. Experimental results show that the proposed algorithm improves coding performances substantially.","PeriodicalId":184170,"journal":{"name":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","volume":"4 5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129228065","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
3D design tools for equipment manufacturing and exhibition based on internet 基于互联网的装备制造和展览三维设计工具
2016 International Conference on Audio, Language and Image Processing (ICALIP) Pub Date : 2016-07-01 DOI: 10.1109/ICALIP.2016.7846539
Feng Tian, Pan Wang, Chunyan Dong, Jie Ying Gao, Haifeng Tang, Li Qian, Haojun Xu
{"title":"3D design tools for equipment manufacturing and exhibition based on internet","authors":"Feng Tian, Pan Wang, Chunyan Dong, Jie Ying Gao, Haifeng Tang, Li Qian, Haojun Xu","doi":"10.1109/ICALIP.2016.7846539","DOIUrl":"https://doi.org/10.1109/ICALIP.2016.7846539","url":null,"abstract":"To realize fast and universal 3D interactive media production for equipment exhibition, the internet 3D platform for equipment manufacturing and exhibition is designed based on WebGL graphics-engine. The main function of the plat-form includes sequential content creation, panoramic content creation, 3D animation creation. Meanwhile, fast cable simulation method based on rigid body link is proposed, cable simulation function is achieved on all kinds of platform. The internet authoring platform supports assistant 3D, the naked-eye 3D, helmet VR 3D, kinect, and other hardware peripheral equipment. Trials by multiple users, designers can create internet-based interactive panoramic content, sequence content, 3D animated content, cable simulation independently and rapidly without assistance of programmer. The 3D platform provides a simple design method and a new design mode for equipment manufacturing and exhibition. On the basis of the 3D authoring platform, the application of other 3D function can be upgraded. The 3D authoring platform is a platform foundation for the equipment manufacturing, maintenance, sales in the whole process of 3D content information.","PeriodicalId":184170,"journal":{"name":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126260319","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
A non-data-aided frequency offset estimation algorithm of two co-frequency 16QAM signals 两个同频16QAM信号的非数据辅助频偏估计算法
2016 International Conference on Audio, Language and Image Processing (ICALIP) Pub Date : 2016-07-01 DOI: 10.1109/ICALIP.2016.7846586
Peiqiang Wang, Kai Liu
{"title":"A non-data-aided frequency offset estimation algorithm of two co-frequency 16QAM signals","authors":"Peiqiang Wang, Kai Liu","doi":"10.1109/ICALIP.2016.7846586","DOIUrl":"https://doi.org/10.1109/ICALIP.2016.7846586","url":null,"abstract":"For single-channel of 16QAM mixed signals, a non-data-aided (NDA) frequency offset estimation (FOE) algorithm is proposed. The frequency offset of signals after down conversion affects the phase of the signal and make the signal rotate so that the extremum of the in-phase components and the quadrature components of signals are changed. Calculation of the frequency offset by optimizing the object functions which is obtained from the extremums of the I-channel signals and the Q-channel signals, and the optimized process is accomplished by utilizing hierarchical search. This method does not require any prior information and it is insensitive for unknown parameters of mixed signals, for example, amplitude and timing offset. The simulation-results verify the effectiveness and practicality of the proposed algorithm.","PeriodicalId":184170,"journal":{"name":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","volume":"92 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116568094","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A scheme discriminating between synthetic speech and normal speech 一种区分合成语音和正常语音的方案
2016 International Conference on Audio, Language and Image Processing (ICALIP) Pub Date : 2016-07-01 DOI: 10.1109/ICALIP.2016.7846613
Jilun Chen, Weiqiang Zhang, Jia Liu
{"title":"A scheme discriminating between synthetic speech and normal speech","authors":"Jilun Chen, Weiqiang Zhang, Jia Liu","doi":"10.1109/ICALIP.2016.7846613","DOIUrl":"https://doi.org/10.1109/ICALIP.2016.7846613","url":null,"abstract":"This paper develops a system to automatically distinguish natural speech from synthetic speech. The issue of feature selection is considered. We take commonly used feature Mel-Frequency Cepstrum Coefficient (MFCC) in consideration, as well as other features such as Relative Phase Shift (RPS) and pitch tuned for Automatically Speech Recognition (ASR). We found some features are complimentary in the task of discriminating synthetic and natural speech. Gaussian Mixture Model Support Vector Machine (GMM-SVM) system is applied as classifier with feature input modified and compared to that of feature is applied in speaker recognition. Experiment on Librespeech versus online Text-to-Speech (TTS) speech synthesis platforms data set verified the effectiveness of the combination of these features.","PeriodicalId":184170,"journal":{"name":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","volume":"136 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115911706","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A 600BPS MELP vocoder with voice activity detection 具有语音活动检测的600BPS MELP声码器
2016 International Conference on Audio, Language and Image Processing (ICALIP) Pub Date : 2016-07-01 DOI: 10.1109/ICALIP.2016.7846549
Qiuyun Hao, Ye Li, Peng Zhang, Yanhong Fan, Xiaofeng Ma, Jingsai Jiang
{"title":"A 600BPS MELP vocoder with voice activity detection","authors":"Qiuyun Hao, Ye Li, Peng Zhang, Yanhong Fan, Xiaofeng Ma, Jingsai Jiang","doi":"10.1109/ICALIP.2016.7846549","DOIUrl":"https://doi.org/10.1109/ICALIP.2016.7846549","url":null,"abstract":"In the underwater communication, satellite communication, secure communication and other channels, the channel bandwidth is narrow and the channel condition is relatively poor. Therefore, higher quality and lower rate speech coding is needed. In order to improve the synthetic speech quality and save channel bandwidth, voice activity detection (VAD) technique is introduced to Mixed Excitation Linear Prediction (MELP) vocoder at 600bps in this paper. It can save channel bandwidth and reduce noise, coding rate and power consumption. In order to improve the accuracy of speech endpoint detection at low signal-to-noise ratio (SNR), noise reduction is adopted to improve SNR, and the VAD algorithm based on statistical model (STAT-VAD) is used. The MELP vocoder with VAD and noise reduction not only has good anti-noise ability and can improve robustness in random channel, but also can reduce the average coding rate and save channel bandwidth. The quality of synthetic speech can achieve the desired results at low SNR. In addition, the vocoder at 600bps with VAD can run in real-time on TMS320VC5510 DSP platform.","PeriodicalId":184170,"journal":{"name":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125365858","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信