International Symposium on Chinese Spoken Language Processing最新文献

The modeling of tongue tip in Standard Chinese using MRI 标准汉语舌尖的MRI建模

International Symposium on Chinese Spoken Language Processing Pub Date : 2014-10-27 DOI: 10.1109/ISCSLP.2014.6936625

Gaowu Wang, J. Dang, Jiangping Kong

引用次数: 1

Speaker adaptation of speaking rate-dependent hierarchical prosodic model for Mandarin TTS 基于语速的汉语TTS分层韵律模型的说话人自适应

International Symposium on Chinese Spoken Language Processing Pub Date : 2014-10-27 DOI: 10.1109/ISCSLP.2014.6936616

Po-Chun Wang, I-Bin Liao, Chen-Yu Chiang, Yih-Ru Wang, Sin-Horng Chen

{"title":"Speaker adaptation of speaking rate-dependent hierarchical prosodic model for Mandarin TTS","authors":"Po-Chun Wang, I-Bin Liao, Chen-Yu Chiang, Yih-Ru Wang, Sin-Horng Chen","doi":"10.1109/ISCSLP.2014.6936616","DOIUrl":"https://doi.org/10.1109/ISCSLP.2014.6936616","url":null,"abstract":"In this paper, a speaker adaptation method to adapt an existing speaking rate-dependent hierarchical prosodic model (SR-HPM) of an SR-controlled Mandarin TTS system to new speaker's data for realizing a new voice is proposed. Two main problems are addressed: data sparseness for few adaptation utterances existing only in a small range of normal speaking rate and no adaptation data in both ranges of fast and slow speaking rates. The proposed method follows the idea of SR-HPM training to firstly normalize the prosodic-acoustic features of the new speaker's speech data, to then train an HPM by the prosody labeling and modeling algorithm, and to lastly refine the HPM to an SR-dependent model. The MAP adaptation method with model parameter extrapolation is applied to cope with the above two problems. Experimental results on a male speaker's adaptation data confirmed that the resulting adaptive SR-HPM has reasonable parameters covering a wide range of speaking rates and hence can be used in the TTS system to generate prosodic-acoustic features for synthesizing the new speaker's voice of any given SR.","PeriodicalId":271277,"journal":{"name":"International Symposium on Chinese Spoken Language Processing","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-10-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121115668","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 8

How to describe speech emotion more completely - An investigation on Chinese broadcast news speech 如何更完整地描述言语情感——对中国广播新闻言语的调查

International Symposium on Chinese Spoken Language Processing Pub Date : 2012-12-01 DOI: 10.1109/ISCSLP.2012.6423508

Yingying Gao, Weibin Zhu

引用次数: 3

Acoustic modeling for native and non-native Mandarin speech recognition 母语和非母语普通话语音识别的声学建模

International Symposium on Chinese Spoken Language Processing Pub Date : 2012-12-01 DOI: 10.1109/ISCSLP.2012.6423544

Xin Chen, Jian Cheng

引用次数: 2

Development of an articulatory visual-speech synthesizer to support language learning 一种支持语言学习的发音视觉语音合成器的开发

International Symposium on Chinese Spoken Language Processing Pub Date : 2010-11-01 DOI: 10.1109/ISCSLP.2010.5684832

Ka Ho WONG, Wai-Kim Leung, W. Lo, H. Meng

引用次数: 8

Towards Automatic Tone Correction in Non-native Mandarin 非母语普通话语音自动校正研究

International Symposium on Chinese Spoken Language Processing Pub Date : 2006-12-13 DOI: 10.1007/11939993_62

Mitchell Peabody, S. Seneff

引用次数: 25

Nonlinear Emotional Prosody Generation and Annotation 非线性情感韵律生成与标注

International Symposium on Chinese Spoken Language Processing Pub Date : 2006-12-13 DOI: 10.1007/11939993_23

J. Tao, Jian Yu, Yongguo Kang

引用次数: 0

Pitch Mean Based Frequency Warping 基于基音平均值的频率翘曲

International Symposium on Chinese Spoken Language Processing Pub Date : 2006-12-13 DOI: 10.1007/11939993_13

Jian Liu, T. Zheng, Wenhu Wu

引用次数: 16

Meeting Segmentation Using Two-Layer Cascaded Subband Filters 采用双层级联子带滤波器的会议分割

International Symposium on Chinese Spoken Language Processing Pub Date : 2006-12-13 DOI: 10.1007/11939993_68

M. Giuliani, T. Nwe, Haizhou Li

引用次数: 2

Language Identification by Using Syllable-Based Duration Classification on Code-Switching Speech 基于音节时长分类的语码转换语音语言识别

International Symposium on Chinese Spoken Language Processing Pub Date : 2006-12-13 DOI: 10.1007/11939993_50

Dau-Cheng Lyu, Ren-Yuan Lyu, Yuang-Chin Chiang, Chun-Nan Hsu

引用次数: 5