2008 6th International Symposium on Chinese Spoken Language Processing最新文献_第8页

Pronunciation Space Models for Pronunciation Evaluation 语音评价的语音空间模型

2008 6th International Symposium on Chinese Spoken Language Processing Pub Date : 2008-12-30 DOI: 10.1109/CHINSL.2008.ECP.17

Si Wei, Yi-Qian Pan, Guoping Hu, Yu Hu, Ren-Hua Wang

引用次数: 4

Word Alignment Based on Multi-Grain Model 基于多粒度模型的词对齐

2008 6th International Symposium on Chinese Spoken Language Processing Pub Date : 2008-12-30 DOI: 10.1109/CHINSL.2008.ECP.79

Yanqing He, Yu Zhou, Chengqing Zong

引用次数: 2

Frequency Modulation Technique for Prosodic Modification 调频技术的韵律修改

2008 6th International Symposium on Chinese Spoken Language Processing Pub Date : 2008-12-30 DOI: 10.1109/CHINSL.2008.ECP.41

Jinfu Ni, S. Sakai, Tohru Shimizu, Satoshi Nakamura

引用次数: 4

PLSA Based Topic Mixture Language Modeling Approach 基于PLSA的主题混合语言建模方法

2008 6th International Symposium on Chinese Spoken Language Processing Pub Date : 2008-12-30 DOI: 10.1109/CHINSL.2008.ECP.58

Shuanhu Bai, Haizhou Li

引用次数: 0

The Use of Dynamic Deformable Templates for Lip Tracking in an Audio-Visual Corpus with Large Variations in Head Pose, Face Illumination and Lip Shapes 在头部姿势、面部光照和唇形变化较大的视听语料库中使用动态可变形模板进行唇形跟踪

2008 6th International Symposium on Chinese Spoken Language Processing Pub Date : 2008-12-30 DOI: 10.1109/CHINSL.2008.ECP.104

Zhiyong Wu, Jiying Wu, H. Meng

{"title":"The Use of Dynamic Deformable Templates for Lip Tracking in an Audio-Visual Corpus with Large Variations in Head Pose, Face Illumination and Lip Shapes","authors":"Zhiyong Wu, Jiying Wu, H. Meng","doi":"10.1109/CHINSL.2008.ECP.104","DOIUrl":"https://doi.org/10.1109/CHINSL.2008.ECP.104","url":null,"abstract":"This paper describes an approach for lip tracking using dynamic deformable templates. The objective is to track lip parameters from an audio-visual corpus recording a voice talent who is reading text prompts in a natural and expressive way. The corpus presents challenges to the conventional method of lip tracking with deformable templates. This is because natural and expressive speech includes relatively large motions of the head and the lips. The head motions lead to changes in the illumination of the face region and changes in the observed lip shape. In addition, emphatic pronunciations lead to large changes in the lip shape. Video frames that are affected by face illumination changes present additional difficulty in locating the mouth region (i.e. region of interest, ROI). Video frames that are affected by changes in lip shapes present additional deviations from the lip templates and hence lower tracking accuracies. Our proposed method incorporates \"dynamicity\" in the deformable templates to render them adaptive to changes in head pose, face illumination and lip shapes. Experiments show that dynamic deformable templates consistently outperform the conventional deformable templates in lip tracking.","PeriodicalId":291958,"journal":{"name":"2008 6th International Symposium on Chinese Spoken Language Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129156190","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

A Combined Task Analysis Method for Data Selection in Mandarin Isolated Word Recognition System 汉语孤立词识别系统数据选择的组合任务分析方法

2008 6th International Symposium on Chinese Spoken Language Processing Pub Date : 2008-12-30 DOI: 10.1109/CHINSL.2008.ECP.65

Z. He, Z. Wang, W. Li, J. Wu

引用次数: 0

Pitch Tracking for Model-Based Speech Separation 基于模型的语音分离的基音跟踪

2008 6th International Symposium on Chinese Spoken Language Processing Pub Date : 2008-12-30 DOI: 10.1109/CHINSL.2008.ECP.48

Siu Wa Lee, F. Soong, P. Ching, Tan Lee

引用次数: 4

Exploring Tone Variations in Chinese Dialects Using Context Dependent Tone Models 用语境关联声调模型研究汉语方言的声调变化

2008 6th International Symposium on Chinese Spoken Language Processing Pub Date : 2008-12-30 DOI: 10.1109/CHINSL.2008.ECP.106

Wei Guo, Min Chu

引用次数: 1

Pronunciation Error Detection for Computer Assisted Pronunciation Teaching in Mandarin 普通话计算机辅助语音教学中的语音错误检测

2008 6th International Symposium on Chinese Spoken Language Processing Pub Date : 2008-12-30 DOI: 10.1109/CHINSL.2008.ECP.98

Min-Siong Liang, Jian-Yung Hung, Ren-Yuan Lyu, Yuang-Chin Chiang

{"title":"Pronunciation Error Detection for Computer Assisted Pronunciation Teaching in Mandarin","authors":"Min-Siong Liang, Jian-Yung Hung, Ren-Yuan Lyu, Yuang-Chin Chiang","doi":"10.1109/CHINSL.2008.ECP.98","DOIUrl":"https://doi.org/10.1109/CHINSL.2008.ECP.98","url":null,"abstract":"In this paper, we provided a strategy of error detection of pronunciation and applied it to the computer-assisted pronunciation teaching(CAPT), especially in Mandarin language learning. In our system, it can be divided into two parts: the sentence verification(SV) and syllable identification(SI). First was used to ban out-of-task sentences. We used the likelihood ratio test, which was computed between the maximum probability of a result under two different hypotheses, i.e. null hypothesis and alternative hypothesis models, to verify the deviation degree and decide whether the student pronunciation is out-of-task. In SV part, the experimental results was significant and had 91.0% rate of F-score. The second part was applied to recognize the content of speech read by the speaker. The recognition net was built as a sausage shape with pronunciation confusion table corresponding to confusion error patterns. Then, the system could find out the wrong pronounced syllable for the appropriate feedback to correct the pronunciation of the users. In the stage of SI, the best detection rate had a F-score rate of 77.2%.","PeriodicalId":291958,"journal":{"name":"2008 6th International Symposium on Chinese Spoken Language Processing","volume":"81 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131133594","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

A Pitch Synchronous Method for Speech Modification 一种语音修饰的基音同步方法

2008 6th International Symposium on Chinese Spoken Language Processing Pub Date : 2008-12-30 DOI: 10.1109/chinsl.2008.ecp.73

Chih-Ting Kuo, Hsiao-Chuan Wang

引用次数: 1