2012 8th International Symposium on Chinese Spoken Language Processing最新文献_第2页

Synthesized stereo-based stochastic mapping with data selection for robust speech recognition 基于数据选择的合成立体随机映射鲁棒语音识别

2012 8th International Symposium on Chinese Spoken Language Processing Pub Date : 2012-12-01 DOI: 10.1109/ISCSLP.2012.6423542

Jun Du, Qiang Huo

引用次数: 6

The lossless adaptive arithmetic coding based on context for ITU-T G.719 at variable rate 基于上下文的ITU-T G.719可变速率无损自适应算术编码

2012 8th International Symposium on Chinese Spoken Language Processing Pub Date : 2012-12-01 DOI: 10.1109/ISCSLP.2012.6423462

Xuan Ji, Jing Wang, Hailong He, Jingming Kuang

引用次数: 0

Resonance-based spectral deformation in HMM-based speech synthesis 基于hmm的语音合成中基于共振的频谱变形

2012 8th International Symposium on Chinese Spoken Language Processing Pub Date : 2012-12-01 DOI: 10.1109/ISCSLP.2012.6423478

Jinfu Ni, Y. Shiga, H. Kawai, H. Kashioka

引用次数: 1

mENUNCIATE: Development of a computer-aided pronunciation training system on a cross-platform framework for mobile, speech-enabled application development mENUNCIATE:开发一个基于跨平台框架的计算机辅助发音训练系统，用于移动语音应用程序开发

2012 8th International Symposium on Chinese Spoken Language Processing Pub Date : 2012-12-01 DOI: 10.1109/ISCSLP.2012.6423507

Pengfei Liu, K. Yuen, Wai-Kim Leung, H. Meng

引用次数: 8

Enhanced lengthening cancellation using bidirectional pitch similarity alignment for spontaneous speech 利用双向音高相似性对齐增强的自发语音延长消除

2012 8th International Symposium on Chinese Spoken Language Processing Pub Date : 2012-12-01 DOI: 10.1109/ISCSLP.2012.6423517

Po-Yi Shih, Bo-Wei Chen, Jhing-Fa Wang, Jhing-Wei Wu

引用次数: 1

Text-Dependent Speaker Recognition with long-term features based on functional data analysis 基于功能数据分析的具有长期特征的文本依赖说话人识别

2012 8th International Symposium on Chinese Spoken Language Processing Pub Date : 2012-12-01 DOI: 10.1109/ISCSLP.2012.6423461

Chenhao Zhang, T. Zheng, Ruxin Chen

{"title":"Text-Dependent Speaker Recognition with long-term features based on functional data analysis","authors":"Chenhao Zhang, T. Zheng, Ruxin Chen","doi":"10.1109/ISCSLP.2012.6423461","DOIUrl":"https://doi.org/10.1109/ISCSLP.2012.6423461","url":null,"abstract":"Text-Dependent Speaker Recognition (TDSR) is widely used nowadays. The short-term features like Mel-Frequency Cepstral Coefficient (MFCC) have been the dominant features used in traditional Dynamic Time Warping (DTW) based TDSR systems. The short-term features capture better local portion of the significant temporal dynamics but worse in overall sentence statistical characteristics. Functional Data Analysis (FDA) has been proven to show significant advantage in exploring the statistic information of data, so in this paper, a long-term feature extraction based on MFCC and FDA theory is proposed, where the extraction procedure consists of the following steps: Firstly, the FDA theory is applied after the MFCC feature extraction; Secondly, for the purpose of compressing the redundant data information, new feature based on the Functional Principle Component Analysis (FPCA) is generated; Thirdly, the distance between train features and test features is calculated for the use of the recognition procedure. Compared with the existing MFCC plus DTW method, experimental results show that the new features extracted with the proposed method plus the cosine similarity measure demonstrates better performance.","PeriodicalId":186099,"journal":{"name":"2012 8th International Symposium on Chinese Spoken Language Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116304579","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Context dependant phone mapping for cross-lingual acoustic modeling 上下文依赖电话映射跨语言声学建模

2012 8th International Symposium on Chinese Spoken Language Processing Pub Date : 2012-12-01 DOI: 10.1109/ISCSLP.2012.6423496

Van Hai Do, Xiong Xiao, Chng Eng Siong, Haizhou Li

引用次数: 10

Structured modeling based on generalized variable parameter HMMs and speaker adaptation 基于广义变参数hmm和说话人自适应的结构化建模

2012 8th International Symposium on Chinese Spoken Language Processing Pub Date : 2012-12-01 DOI: 10.1109/ISCSLP.2012.6423526

Yang Li, Xunying Liu, Lan Wang

引用次数: 7

An improved steady segment based decoding algorithm by using response probability for LVCSR 基于响应概率的LVCSR稳定段译码改进算法

2012 8th International Symposium on Chinese Spoken Language Processing Pub Date : 2012-12-01 DOI: 10.1109/ISCSLP.2012.6423525

Zhanlei Yang, Wenju Liu, Hao Chao

引用次数: 1

An improved tone labeling and prediction method with non-uniform segmentation of F0 contour 一种改进的F0轮廓非均匀分割的音调标记与预测方法

2012 8th International Symposium on Chinese Spoken Language Processing Pub Date : 2012-12-01 DOI: 10.1109/ISCSLP.2012.6423467

Xingyu Na, Xiang Xie, Jingming Kuang, Yaling He

引用次数: 0