2016 International Conference on Audio, Language and Image Processing (ICALIP)最新文献

筛选
英文 中文
A method of single channel blind source separation of co-frequency 16QAM signals 一种共频16QAM信号的单通道盲源分离方法
2016 International Conference on Audio, Language and Image Processing (ICALIP) Pub Date : 2016-07-01 DOI: 10.1109/ICALIP.2016.7846587
Zhu Wengui, Zhang Yu-ren
{"title":"A method of single channel blind source separation of co-frequency 16QAM signals","authors":"Zhu Wengui, Zhang Yu-ren","doi":"10.1109/ICALIP.2016.7846587","DOIUrl":"https://doi.org/10.1109/ICALIP.2016.7846587","url":null,"abstract":"With the rapid development of wireless communication, the receiver equipped one antenna receives more than one signal at the same frequency band simultaneously. For the above-mentioned reasons, the single channel blind source separation (BSS) becomes a hot topic in signal processing field. Nevertheless, the study of single channel blind source separation have great challenges since this problem is usually ill-posed one. Many scholars make hard efforts to solve this ill-posed problem. In this paper, one method is proposed to separate two co-frequency16QAM overlapped signals based maximum likelihood method (ML) since communication signals have finite set of symbols and some preamble symbols we already know. The performance of the method through computer simulations shows its reliability.","PeriodicalId":184170,"journal":{"name":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","volume":"275 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125144784","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Six dimensional clustering segmentation of color point cloud 彩色点云的六维聚类分割
2016 International Conference on Audio, Language and Image Processing (ICALIP) Pub Date : 2016-07-01 DOI: 10.1109/ICALIP.2016.7846670
Z. Ximin, Wan Wanggen
{"title":"Six dimensional clustering segmentation of color point cloud","authors":"Z. Ximin, Wan Wanggen","doi":"10.1109/ICALIP.2016.7846670","DOIUrl":"https://doi.org/10.1109/ICALIP.2016.7846670","url":null,"abstract":"This paper focuses on the clustering segmentation of 3D color point cloud. We extend the mean shift algorithm to the 3D xyz space, and what's more, we also consider the rgb color information, so the 6 dimensional data is adopted in the algorithm. The cluster center converges to the joint position of the local maximum density and the minimum gradient change of color, so our clustering segmentation not only considers the local geometrical features, but also utilizes the color information. The experiments show that our segmentation has better region consistency and has clear segmenting border in different color neighbors.","PeriodicalId":184170,"journal":{"name":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130641784","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Music boundary detection with multiple features 多特征音乐边界检测
2016 International Conference on Audio, Language and Image Processing (ICALIP) Pub Date : 2016-07-01 DOI: 10.1109/ICALIP.2016.7846614
Weiyao Xue, Shutao Sun, Fengyan Wu, Yongbin Wang
{"title":"Music boundary detection with multiple features","authors":"Weiyao Xue, Shutao Sun, Fengyan Wu, Yongbin Wang","doi":"10.1109/ICALIP.2016.7846614","DOIUrl":"https://doi.org/10.1109/ICALIP.2016.7846614","url":null,"abstract":"Music structural analysis tasks have an important position in the field of Music information retrieval which require an understanding of how humans process music internally, such as music indexing, music summarization, and similarity analysis. Many schemes have been proposed to analyze the structure of recorded music, however they usually use single feature to detect boundaries of songs and the results are not satisfactory. In this paper, we present a method which is based on novelty detection and combines multiple features to the task of music boundaries detection. We extract peaks of novelty function derived from various features as potential boundaries, then eliminate non-boundaries from potential boundaries derived from distinct feature sets. Three types of features, including intensity, timbre, and harmony are employed to represent the characteristics of a music clip. On our testing database composed of 175 entire songs, the best accuracy of boundary detection with tolerance ±3 seconds achieves up to 65.7%.","PeriodicalId":184170,"journal":{"name":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133442395","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Scene recognition algorithm based on multi-feature and weighted minimum distance classifier for digital hearing aids 基于多特征加权最小距离分类器的数字助听器场景识别算法
2016 International Conference on Audio, Language and Image Processing (ICALIP) Pub Date : 2016-07-01 DOI: 10.1109/ICALIP.2016.7846557
Ru-wei Li, Shuang Zhang, Xiaoqun Yi
{"title":"Scene recognition algorithm based on multi-feature and weighted minimum distance classifier for digital hearing aids","authors":"Ru-wei Li, Shuang Zhang, Xiaoqun Yi","doi":"10.1109/ICALIP.2016.7846557","DOIUrl":"https://doi.org/10.1109/ICALIP.2016.7846557","url":null,"abstract":"The recognition precision of the existing auditory scene recognition algorithms is relatively satisfactory, but they can only be applied to several noise scenarios, and it can't meet the performance requirements of digital hearing aids in complex environment. In order to solve the above problems, scene recognition algorithm based on multi-feature and weighted minimum distance classifier is proposed in this paper. In this algorithm, the speech endpoint detection algorithm based on the band-partitioning spectral entropy and spectral energy is used to divide the noisy speech into speech segment and noise segment. Then the characteristics such as Critical Band Ratio and band-partitioning spectral entropy as well as adaptive short-time zero crossing rate of each segment are extracted for the weighted minimum distance classifier to recognize the noise scenario. The experiments result shows that the proposed algorithm has strong robustness and high accuracy. It's suitable to be applied in digital hearing aids.","PeriodicalId":184170,"journal":{"name":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","volume":"42 4","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114101051","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A precise evaluation method of prosodic quality of non-native speakers using average voice and prosody substitution 一种基于平均语音和韵律替换的非母语者韵律质量精确评价方法
2016 International Conference on Audio, Language and Image Processing (ICALIP) Pub Date : 2016-07-01 DOI: 10.1109/ICALIP.2016.7846620
Hafiyan Prafianto, Takashi Nose, A. Ito
{"title":"A precise evaluation method of prosodic quality of non-native speakers using average voice and prosody substitution","authors":"Hafiyan Prafianto, Takashi Nose, A. Ito","doi":"10.1109/ICALIP.2016.7846620","DOIUrl":"https://doi.org/10.1109/ICALIP.2016.7846620","url":null,"abstract":"We propose a method to improve the consistency of human evaluation of non-native speaker's utterance, with a capability to evaluate features such as accent and rhythm. In this method, human evaluators evaluate the accent and the rhythm independently by using average voice model and prosody substitution. We also investigated the advantages of evaluating those features independently. We found that, when the prosodic features are not evaluated independently, the accent scores are affected by the goodness of the rhythm and vice versa. The correlation coefficient of the accent score and the rhythm score of identical utterances was 0.23 using the conventional method and −0.026 using the proposed method. This also leads to greater disagreement between the scores given by different evaluators. Using the conventional method, 23% of the pairs between evaluators have their inter-evaluator correlation of the rhythm score more than 0.5, while using this proposed method, 67% of the pairs have the inter-evaluator correlation more than 0.5.","PeriodicalId":184170,"journal":{"name":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123926467","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Movie audio scene recognition based on WFST 基于WFST的电影音频场景识别
2016 International Conference on Audio, Language and Image Processing (ICALIP) Pub Date : 2016-07-01 DOI: 10.1109/ICALIP.2016.7846543
Jichen Yang, Min Cai, Yanxiong Li, Hai Jin
{"title":"Movie audio scene recognition based on WFST","authors":"Jichen Yang, Min Cai, Yanxiong Li, Hai Jin","doi":"10.1109/ICALIP.2016.7846543","DOIUrl":"https://doi.org/10.1109/ICALIP.2016.7846543","url":null,"abstract":"In order to improve movie audio scene (MAS) recognition accuracy, weighted finite-state transducer (WFST) is proposed to recognize MAS in this paper. WFST is introduced firstly, how to construct WFST is introduced secondly, WFST is used to recognize MAS using FBANK, MFCC and PLPCC, separately. The experimental results on twenty MASs using the three features shows that WFST can recognize MAS well, FBANK feature performs better than MFCC and PLPCC, which can reach 79.9%.","PeriodicalId":184170,"journal":{"name":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122394428","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Computer virtual reconstruction of a three dimensional scene in integral imaging 积分成像中三维场景的计算机虚拟重建
2016 International Conference on Audio, Language and Image Processing (ICALIP) Pub Date : 2016-07-01 DOI: 10.1109/ICALIP.2016.7846529
Min Guo, Yu-juan Si, Shigang Wang, Yuan-zhi Lyu, Bowen Jia, Wei Wu
{"title":"Computer virtual reconstruction of a three dimensional scene in integral imaging","authors":"Min Guo, Yu-juan Si, Shigang Wang, Yuan-zhi Lyu, Bowen Jia, Wei Wu","doi":"10.1109/ICALIP.2016.7846529","DOIUrl":"https://doi.org/10.1109/ICALIP.2016.7846529","url":null,"abstract":"To solve the problem of rapid perception of the collected content and non-contact measurement in integral imaging, computer virtual reconstruction algorithm of a three dimensional scene is proposed. Firstly, calculate the combined disparity map of an elemental image array using the region-based iterative matching algorithm according to the distribution characteristics of the homologous pixels in the elemental image array; then calculate the spatial coordinates of the reconstructed object points in line with the triangulation principle; and at last, delete error points and reduce the data redundancy using the data simplification algorithm based on the scanning line, and then the reconstructed three dimensional scene is obtained. The experimental results indicate that the method can not only reconstruct a clear and complete three dimensional scene, restore the relative position of the objects, but also measure the objects' sizes in the three dimensional scene.","PeriodicalId":184170,"journal":{"name":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","volume":"87 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124857862","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Supervised Feature Learning Network Based on the Improved LLE for face recognition 基于改进LLE的有监督特征学习网络人脸识别
2016 International Conference on Audio, Language and Image Processing (ICALIP) Pub Date : 2016-07-01 DOI: 10.1109/ICALIP.2016.7846591
Dan Meng, Guitao Cao, W. Cao, Zhihai He
{"title":"Supervised Feature Learning Network Based on the Improved LLE for face recognition","authors":"Dan Meng, Guitao Cao, W. Cao, Zhihai He","doi":"10.1109/ICALIP.2016.7846591","DOIUrl":"https://doi.org/10.1109/ICALIP.2016.7846591","url":null,"abstract":"Deep neural networks (DNNs) have been successfully applied in the fields of computer vision and pattern recognition. One drawback of DNNs is that most of existing DNNs models and their variants usually need to learn a very large set of parameters. Another drawback of DNNs is that DNNs does not fully take the class label and local structure into account during the training stage. To address these issues, this paper proposes a novel approach, called Supervised Feature Learning Network Based on the Improved LLE (SFLNet) for face recognition. The goal of SFLNet is to extract features efficiently. Thus SFLNet consists of learning kernels based on the improved Locally Linear Embedding (LLE) and multiscale feature analysis. Instead of taking image pixels as the input of LLE algorithm, the improved LLE uses linear discriminant kernel distance (LDKD). Besides, the outputs of the improved LLE are convolutional kernels, not the dimensional reduction features. Mutiscale feature analysis enhances the insensitive to complex changes caused by large pose, expression, or illumination variations. So SFLNet has better discrimination and is more suitable for face recognition task. Experimental results on Extended Yale B and AR dataset shows the impressive improvement of the proposed method and robustness to occlusion when compared with other state-of-art methods.","PeriodicalId":184170,"journal":{"name":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122143732","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Corrupted old film sequences restoration using improved PatchMatch and low-rank matrix recovery 使用改进的PatchMatch和低秩矩阵恢复损坏的旧电影序列
2016 International Conference on Audio, Language and Image Processing (ICALIP) Pub Date : 2016-07-01 DOI: 10.1109/ICALIP.2016.7846589
Ting Yu, Youdong Ding, Xi Huang, Bing Wu
{"title":"Corrupted old film sequences restoration using improved PatchMatch and low-rank matrix recovery","authors":"Ting Yu, Youdong Ding, Xi Huang, Bing Wu","doi":"10.1109/ICALIP.2016.7846589","DOIUrl":"https://doi.org/10.1109/ICALIP.2016.7846589","url":null,"abstract":"Taking the corrupted old film as the research object, this paper proposed a new video restoration method based on improved PatchMatch and low-rank matrix recovery. Our method is divided into three steps. Firstly, we divide each frame in the video sequence into image patches with overlap region, and similar interframe patches are found using the proposed improved PatchMatch algorithm. Then, the low-rank matrix recovery is used to separate the patch group into low-rank matrix component and sparse error component. Finally, synthesizing the video frame by the recorded location of patches, and completing the multi-frame joint automatic restoration frame by frame. The proposed method has been tested on a set of old film sequences in this paper. Experiment demonstrates that it is an effective method for corrupted old film restoration.","PeriodicalId":184170,"journal":{"name":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128965589","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Progressive compression and transmission of 3D model with WebGL 基于WebGL的三维模型渐进式压缩与传输
2016 International Conference on Audio, Language and Image Processing (ICALIP) Pub Date : 2016-07-01 DOI: 10.1109/ICALIP.2016.7846665
Pengfei Li, Xiaoqing Yu, Jingjing Wang
{"title":"Progressive compression and transmission of 3D model with WebGL","authors":"Pengfei Li, Xiaoqing Yu, Jingjing Wang","doi":"10.1109/ICALIP.2016.7846665","DOIUrl":"https://doi.org/10.1109/ICALIP.2016.7846665","url":null,"abstract":"This paper presents a system of progressive compression and transmission of 3D model with WebGL. The algorithm based on edge collapse is chosen in this system to complete the compression work and is modified in order to adapt the system. WebGL is used in the client part of the system so the system could be multi-platform as long as the browser supports the HTML5 on that platform, like chrome, firefox and so on. These browsers supported almost all the platform. With the progressive compression method running on the server side and WebGL technology running on the browser, people can get the view of the 3D models in much more quickly even on the smartphone as well as the quality of the models still can be accepted.","PeriodicalId":184170,"journal":{"name":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128742875","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信