2004 International Symposium on Chinese Spoken Language Processing最新文献_第2页

On noise robustness of dynamic and static features for continuous Cantonese digit recognition 连续粤语数字识别中动态和静态特征的噪声鲁棒性研究

2004 International Symposium on Chinese Spoken Language Processing Pub Date : 2004-12-15 DOI: 10.1109/CHINSL.2004.1409640

Chen Yang, F. Soong, Tan Lee

引用次数: 1

Analysis and synthesis of Cantonese F/sub 0/ contours based on the command-response model 基于命令响应模型的粤式F/sub / 0等高线分析与综合

2004 International Symposium on Chinese Spoken Language Processing Pub Date : 2004-12-15 DOI: 10.1109/CHINSL.2004.1409617

Wentao Gu, K. Hirose, H. Fujisaki

引用次数: 3

Use of direct modeling in natural language generation for Chinese and English translation 直接建模在汉英翻译自然语言生成中的应用

2004 International Symposium on Chinese Spoken Language Processing Pub Date : 2004-12-15 DOI: 10.1109/CHINSL.2004.1409650

Fu-hua Liu, Yuqing Gao

引用次数: 0

Enabling natural computing 启用自然计算

2004 International Symposium on Chinese Spoken Language Processing Pub Date : 2004-12-15 DOI: 10.1109/CHINSL.2004.1409565

Xuedong Huang

引用次数: 0

A New Two-Layer Approach for Spoken Language Translation 一种新的口语翻译双层方法

2004 International Symposium on Chinese Spoken Language Processing Pub Date : 2004-12-15 DOI: 10.1109/CHINSL.2004.1409651

Jhing-Fa Wang, Shun-Chieh Lin, Hsueh-Wei Yang

引用次数: 2

Tone recognition for Chinese speech: a comparative study of Mandarin and Cantonese 汉语语音的声调识别:普通话与广东话的比较研究

2004 International Symposium on Chinese Spoken Language Processing Pub Date : 2004-12-15 DOI: 10.1109/CHINSL.2004.1409629

Gang Peng, Hongying Zheng, William S-Y. Wang

引用次数: 2

Discriminative transform for confidence estimation in Mandarin speech recognition 判别变换在普通话语音识别中的置信度估计

2004 International Symposium on Chinese Spoken Language Processing Pub Date : 2004-12-15 DOI: 10.1109/CHINSL.2004.1409638

Gang Guo, Ren-Hua Wang

{"title":"Discriminative transform for confidence estimation in Mandarin speech recognition","authors":"Gang Guo, Ren-Hua Wang","doi":"10.1109/CHINSL.2004.1409638","DOIUrl":"https://doi.org/10.1109/CHINSL.2004.1409638","url":null,"abstract":"In automatic speech recognition (ASR) applications, log likelihood ratio testing (LRT) is one of the most popular techniques to obtain a confidence measure (CM). Unlike the traditional (log likelihood ratio) LLR related method, we apply nonlinear transformations towards LLR before computing string-level CM. Different phonemes may have different transformation functions. Through suitable LLR transformations, the verification performance of those string-level CM may increase. Transformation functions are implemented by a multilayer perceptron (MLP). Two algorithms are used to optimize the parameters of the MLP: one is the minimum verification error (MVE) algorithm; another is the figure-of-merit (FOM) training algorithm. In our Mandarin command recognition system, the two methods remarkably improve the performance of confidence measures for out-of-vocabulary word rejection compared with the performance of standard LRT related CM, and we obtain a best 45.5% relative reduction in equal error rate (EER). In addition, in our Mandarin command recognition experiments, the FOM training algorithm outperforms the MVE algorithm even they share an approximately same best performance, while due to limited experimental setups in our experiments, which algorithm is the better still needs to be explored.","PeriodicalId":212562,"journal":{"name":"2004 International Symposium on Chinese Spoken Language Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114509451","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Minimum classification error rate pattern recognition approach for speech and language processing 语音和语言处理的最小分类错误率模式识别方法

2004 International Symposium on Chinese Spoken Language Processing Pub Date : 2004-12-15 DOI: 10.1109/CHINSL.2004.1409568

W. Chou

引用次数: 0

Quantization of SEW and REW magnitude for 2 kb/s waveform interpolation speech coding 2 kb/s波形插值语音编码中SEW和REW量级的量化

2004 International Symposium on Chinese Spoken Language Processing Pub Date : 2004-12-15 DOI: 10.1109/CHINSL.2004.1409606

Jing Li, C. Bao

引用次数: 0

A framework for fast segment model by avoidance of redundant computation on segment 一种避免段上冗余计算的快速段模型框架

2004 International Symposium on Chinese Spoken Language Processing Pub Date : 2004-12-15 DOI: 10.1109/CHINSL.2004.1409600

Yun Tang, Wenju Liu, Yiyan Zhang, Bo Xu

引用次数: 5