2004 International Symposium on Chinese Spoken Language Processing最新文献_第4页

Large vocabulary continuous Mandarin speech recognition using finite state machine 基于有限状态机的大词汇量连续普通话语音识别

2004 International Symposium on Chinese Spoken Language Processing Pub Date : 2004-12-15 DOI: 10.1109/CHINSL.2004.1409572

Yi-Cheng Pan, Chia-Hsing Yu, Lin-Shan Lee

引用次数: 0

Analysis of Shanghainese F/sub 0/ contours based on the command-response model 基于命令响应模型的上海F/sub /等高线分析

2004 International Symposium on Chinese Spoken Language Processing Pub Date : 2004-12-15 DOI: 10.1109/CHINSL.2004.1409591

Wentao Gu, K. Hirose, H. Fujisaki

引用次数: 2

Hearer model based stress prediction for Chinese TTS system 基于Hearer模型的中国TTS系统应力预测

2004 International Symposium on Chinese Spoken Language Processing Pub Date : 2004-12-15 DOI: 10.1109/CHINSL.2004.1409611

Guoping Hu, Qingfeng Liu, Yu Hu, Ren-Hua Wang

引用次数: 0

Emotion recognition from Mandarin speech signals 基于普通话语音信号的情感识别

2004 International Symposium on Chinese Spoken Language Processing Pub Date : 2004-12-15 DOI: 10.1109/CHINSL.2004.1409646

T. Pao, Yu-Te Chen, Jun-Heng Yeh

引用次数: 18

An investigation into subspace rapid speaker adaptation 子空间快速说话人自适应研究

2004 International Symposium on Chinese Spoken Language Processing Pub Date : 2004-12-15 DOI: 10.1109/CHINSL.2004.1409639

Michael Zhang, Jun Xu

{"title":"An investigation into subspace rapid speaker adaptation","authors":"Michael Zhang, Jun Xu","doi":"10.1109/CHINSL.2004.1409639","DOIUrl":"https://doi.org/10.1109/CHINSL.2004.1409639","url":null,"abstract":"Speaker adaptation is an essential part of any state-of-the-art automatic speech recognizer (ASR). Recently, more and more application requirements have appeared for embedded ASR. For these cases, a more compact speech model, subspace distribution clustering hidden Markov model (SDCHMM) is used instead of continuous density hidden Markov model (CDHMM). In previous studies on SDCHMM adaptation, the subspace Gaussian pools of SDCHMM are the parameters to be adjusted for speaker variations. Alternatively, we try to employ the link table parameters of SDCHMM, which defines the tying structure in subspaces, to model the inter-speaker mismatch, with the Gaussian parameters maintained. Since the variation range for the parameters is highly limited, this method is potentially faster than conventional Gaussian pools adaptation. A comparative study on a continuous digital dialing (CDD) task shows that when data is seriously insufficient, link table adaptation is more effective than conventional methods, with 17% relative improvement in utterance accuracy rate, compared to 14% improvement by previous Gaussian adaptation. However, further improvement with more data is limited. When data size is doubled, this method gave 21% improvement, compared to 30% improvement by the conventional method.","PeriodicalId":212562,"journal":{"name":"2004 International Symposium on Chinese Spoken Language Processing","volume":"43 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133254460","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

An information gain and grammar complexity based approach to attribute selection in speech enabled information retrieval dialogs 基于信息增益和语法复杂度的语音信息检索对话框属性选择方法

2004 International Symposium on Chinese Spoken Language Processing Pub Date : 2004-12-15 DOI: 10.1109/CHINSL.2004.1409657

Haiping Li, Haixin Chai

{"title":"An information gain and grammar complexity based approach to attribute selection in speech enabled information retrieval dialogs","authors":"Haiping Li, Haixin Chai","doi":"10.1109/CHINSL.2004.1409657","DOIUrl":"https://doi.org/10.1109/CHINSL.2004.1409657","url":null,"abstract":"An effective dialog driven method is required for today's speech enabled information retrieval systems, such as name dialers. Similar to the dynamic sales dialog for electronic commerce scenarios, information gain measure based approaches are widely used for attribute selection and dialog length reduction. However, for speech enabled information retrieval systems, another important factor influencing attribute selection is speech recognition accuracy. Too low accuracy results in a failed dialog. Recognition accuracy varies with many issues, including acoustic model performance and grammar complexity. The acoustic model is fixed for a whole dialog, while grammar is different for each interaction round, thereby grammar complexity influences the attribute selected for the next question. An approach combining both information gain measurement and grammar complexity is presented for a dynamic dialog driven system. Offline evaluations show that this approach can give a trade-off between the target of faster discrimination of the candidates for retrieval and higher recognition accuracy.","PeriodicalId":212562,"journal":{"name":"2004 International Symposium on Chinese Spoken Language Processing","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125559704","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Dependence of correct pronunciation of Chinese aspirated sounds on power during voice onset time 汉语送气音的正确发音对发音时间的力量依赖性

2004 International Symposium on Chinese Spoken Language Processing Pub Date : 2004-12-15 DOI: 10.1109/CHINSL.2004.1409601

A. Hoshino, Akio Yasuda

引用次数: 5

Energy contour enhancement for noisy speech recognition 噪声语音识别的能量轮廓增强

2004 International Symposium on Chinese Spoken Language Processing Pub Date : 2004-12-15 DOI: 10.1109/CHINSL.2004.1409633

Tai-Hwei Hwang, Sen-Chia Chang

引用次数: 9

Chinese-English mixed-lingual keyword spotting 中英文混合关键词识别

2004 International Symposium on Chinese Spoken Language Processing Pub Date : 2004-12-15 DOI: 10.1109/CHINSL.2004.1409630

Shan-Ruei You, Shih-Chieh Chien, Chih-Hsing Hsu, Ke-Shiu Chen, Jia-Jang Tu, Jeng-Shien Lin, Sen-Chia Chang

引用次数: 6

MCE-based training of subspace distribution clustering HMM 基于mce的子空间分布聚类HMM训练

2004 International Symposium on Chinese Spoken Language Processing Pub Date : 2004-12-15 DOI: 10.1109/CHINSL.2004.1409599

Xiao-Bing Li, Lirong Dai, Ren-Hua Wang

引用次数: 0