2008 6th International Symposium on Chinese Spoken Language Processing最新文献_第5页

A Two-Stage Multi-Feature Integration Approach to Unsupervised Speaker Change Detection in Real-Time News Broadcasting 实时新闻广播中无监督说话人变化检测的两阶段多特征集成方法

2008 6th International Symposium on Chinese Spoken Language Processing Pub Date : 2008-12-30 DOI: 10.1109/CHINSL.2008.ECP.99

Lei Xie, Guangsen Wang

引用次数: 6

Word Order Correction for Language Transfer Using Relative Position Language Modeling 基于相对位置语言模型的语言迁移词序校正

2008 6th International Symposium on Chinese Spoken Language Processing Pub Date : 2008-12-30 DOI: 10.1109/CHINSL.2008.ECP.20

Chao-Hong Liu, Chung-Hsien Wu, Matthew Harris

引用次数: 7

A Sample and Feature Selection Scheme for GMM-SVM Based Language Recognition 基于GMM-SVM的语言识别样本和特征选择方案

2008 6th International Symposium on Chinese Spoken Language Processing Pub Date : 2008-12-30 DOI: 10.1109/CHINSL.2008.ECP.93

Yan Song, Lirong Dai

引用次数: 0

A New Similarity Measure Between HMMS 一种新的hmm间相似性度量方法

2008 6th International Symposium on Chinese Spoken Language Processing Pub Date : 2008-12-30 DOI: 10.1109/CHINSL.2008.ECP.67

Yih-Ru Wang

引用次数: 1

Efficient System Combination for Syllable-Confusion-Network-Based Chinese Spoken Term Detection 基于音节混淆网络的汉语口语词汇检测系统组合

2008 6th International Symposium on Chinese Spoken Language Processing Pub Date : 2008-12-30 DOI: 10.1109/CHINSL.2008.ECP.103

Jie Gao, Qingwei Zhao, Yonghong Yan, J. Shao

引用次数: 8

Microphone Array Post-Filter Based on Auditory Filtering 基于听觉滤波的麦克风阵列后置滤波

2008 6th International Symposium on Chinese Spoken Language Processing Pub Date : 2008-12-30 DOI: 10.1109/CHINSL.2008.ECP.105

Peng Li, FengChai Liao, Ning Cheng, Bo Xu, Wenju Liu

引用次数: 0

Mandarin Tone Perception with Temporal Envelope and Periodicity Cues from Different Frequency Regions 基于时间包络和不同频率区域周期线索的普通话声调感知

2008 6th International Symposium on Chinese Spoken Language Processing Pub Date : 2008-12-30 DOI: 10.1109/CHINSL.2008.ECP.96

Meng Yuan, Tan Lee, S. Soli

引用次数: 4

Mandarin Learning Using Speech and Language Technologies: A Translation Game in the Travel Domain 使用语音和语言技术学习普通话:旅游领域的翻译游戏

2008 6th International Symposium on Chinese Spoken Language Processing Pub Date : 2008-12-30 DOI: 10.1109/CHINSL.2008.ECP.19

Yushi Xu, S. Seneff

引用次数: 8

Heteronym Verification for Mandarin Speech Synthesis 普通话语音合成的异义词验证

2008 6th International Symposium on Chinese Spoken Language Processing Pub Date : 2008-12-30 DOI: 10.1109/CHINSL.2008.ECP.46

Heng Lu, Zhenhua Ling, Si Wei, Yu Hu, Lirong Dai, Ren-Hua Wang

引用次数: 2

Improved Semi-Parametric Mean Trajectory Model Using Discriminatively Trained Centroids 基于判别训练质心的改进半参数平均轨迹模型

2008 6th International Symposium on Chinese Spoken Language Processing Pub Date : 2008-12-30 DOI: 10.1109/CHINSL.2008.ECP.63

Ran Xu, Jielin Pan, Yonghong Yan

{"title":"Improved Semi-Parametric Mean Trajectory Model Using Discriminatively Trained Centroids","authors":"Ran Xu, Jielin Pan, Yonghong Yan","doi":"10.1109/CHINSL.2008.ECP.63","DOIUrl":"https://doi.org/10.1109/CHINSL.2008.ECP.63","url":null,"abstract":"In order to alleviate the limitation of \"state output probability conditional independence\" assumption held by Hidden Markov models (HMMs) in speech recognition, a discriminative semi-parametric trajectory model was proposed in recent years, in which both means and variances in the acoustic models are modeled as time-varying variables. The time- varying information is modeled as a weighted contribution from all the \"centroids\", which can be viewed as the representation of the acoustic space. In previous literatures, such centroids are often obtained by clustering the Gaussians in the baseline acoustic models to some reasonable number or by training a baseline model with fewer Gaussian components. The centroids obtained in this way are maximum likelihood estimation of the acoustic space, which are relatively weak in discriminability compared to the discriminatively trained acoustic models. In this paper, we proposed an improved semi-parametric mean trajectory model training framework, in which the centroids are first discriminatively trained by minimum phone error criterion to provide a more discriminative representation of the acoustic space. This method was evaluated on the Mandarin digit string recognition task. The experimental result shows that our proposed method improves the recognition performance by a relative string error rate reduction of 7.5% compared to the traditional discriminative semi-parametric trajectory model, and it outperforms the baseline acoustic model trained with maximum likelihood criterion by a relative string error rate reduction of 28.6%.","PeriodicalId":291958,"journal":{"name":"2008 6th International Symposium on Chinese Spoken Language Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130443317","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0