Proceedings : ICSLP. International Conference on Spoken Language Processing最新文献_第3页

F0 declination in read-aloud and spontaneous speech 在朗读和自发演讲中F0衰退

Proceedings : ICSLP. International Conference on Spoken Language Processing Pub Date : 1996-10-03 DOI: 10.21437/ICSLP.1996-387

M. Swerts, E. Strangert, M. Heldner

引用次数: 47

Modeling segmental duration in German text-to-speech synthesis 德语文本-语音合成中片段持续时间的建模

Proceedings : ICSLP. International Conference on Spoken Language Processing Pub Date : 1996-10-03 DOI: 10.21437/ICSLP.1996-601

Bernd Möbius, J. V. Santen

引用次数: 56

Distinctions between [t] and [tch] using electropalatography data [t]和[tch]使用电腭数据的区别

Proceedings : ICSLP. International Conference on Spoken Language Processing Pub Date : 1996-10-03 DOI: 10.21437/ICSLP.1996-410

S. Mair, C. Scully, C. Shadle

引用次数: 10

Dynamic control of a production model 生产模型的动态控制

Proceedings : ICSLP. International Conference on Spoken Language Processing Pub Date : 1996-10-03 DOI: 10.21437/ICSLP.1996-583

L. Candille, H. Meloni

引用次数: 1

Evaluation of the telef nica i+d natural numbers recognizer over different dialects of Spanish from Spain and America 西班牙和美国不同西班牙语方言的telefnica i+d自然数识别器评价

Proceedings : ICSLP. International Conference on Spoken Language Processing Pub Date : 1996-10-03 DOI: 10.21437/ICSLP.1996-515

C. D. L. Torre, Francisco Javier Caminero Gil, J. Alvarez-Cercadillo, C. M. D. Alamo, L. A. H. Gómez

引用次数: 2

Effects of auditory feedback on F0 trajectory generation 听觉反馈对F0轨迹生成的影响

Proceedings : ICSLP. International Conference on Spoken Language Processing Pub Date : 1996-10-03 DOI: 10.21437/ICSLP.1996-97

Hideki Kawahara, H. Kato, J. C. Williams

引用次数: 12

Speaker adaptation by modeling the speaker variation in a continuous speech recognition system 通过对连续语音识别系统中说话人变化的建模来实现说话人的自适应

Proceedings : ICSLP. International Conference on Spoken Language Processing Pub Date : 1996-10-03 DOI: 10.21437/ICSLP.1996-249

Nikko Ström

{"title":"Speaker adaptation by modeling the speaker variation in a continuous speech recognition system","authors":"Nikko Ström","doi":"10.21437/ICSLP.1996-249","DOIUrl":"https://doi.org/10.21437/ICSLP.1996-249","url":null,"abstract":"A method for unsupervised instantaneous speaker adaptation is presented and evaluated on a continuous speech recognition task in a man-machine dialogue system. The method is based on modeling of the systematic speaker variation. The variation is modeled by a low-dimensional speaker space and the classification of speech segments is conditioned by the position in the speaker space. Because the effect of the speaker space position on the classification is determined in an off-line training procedure using the speakers in a training database, complex systematic speaker variation can be modeled. Speaker adaptation is achieved only by the constraint that the position in the speaker space is constant over each utterance. Therefore, no separate adaptation session is needed and the adaptation is present from the first utterance. Consequently, for a user there is no noticeable difference between this system and a speaker-independent system. The speaker model and the phonetic classification are implemented in the ANN part of a hybrid ANN/HMM system. In experiments with a pilot system, word accuracy is improved for utterances longer than three words and utterance level results are improved for utterances of all lengths.","PeriodicalId":90685,"journal":{"name":"Proceedings : ICSLP. International Conference on Spoken Language Processing","volume":"15 1","pages":"989-992"},"PeriodicalIF":0.0,"publicationDate":"1996-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87512032","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 21

Multi-lingual phoneme recognition exploiting acoustic-phonetic similarities of sounds 利用声音的声学-语音相似性进行多语言音素识别

Proceedings : ICSLP. International Conference on Spoken Language Processing Pub Date : 1996-10-03 DOI: 10.21437/ICSLP.1996-556

J. Köhler

{"title":"Multi-lingual phoneme recognition exploiting acoustic-phonetic similarities of sounds","authors":"J. Köhler","doi":"10.21437/ICSLP.1996-556","DOIUrl":"https://doi.org/10.21437/ICSLP.1996-556","url":null,"abstract":"The aim of the work is to exploit the acoustic-phonetic similarities between several languages. In recent work cross-language HMM-based phoneme models have been used only for bootstrapping the language-dependent models and the multi-lingual approach has been investigated only on very small speech corpora. The author introduces a statistical distance measure to determine the similarities of sounds. Further, he presents a new technique to model multi-lingual phonemes. The experiments are conducted with the OGI Multi-Language Telephone Speech Corpus for the languages American English, German and Spanish. In the first experiment phoneme recognition rates between 39.0% and 53.9% are achieved using language-dependent models. Using cross-language models yields improvement for some phonemes, but on average a degradation of recognition performance is observed. However, cross-language models speeds up the cross-language transfer and reduce the size of the phoneme inventory of multi-lingual speech recognition systems. Finally, a new method of modelling multi-lingual phonemes, which can be used for a variety of languages, is presented. This technique reduces the number of phoneme-based units in a multi-lingual speech recognition system.","PeriodicalId":90685,"journal":{"name":"Proceedings : ICSLP. International Conference on Spoken Language Processing","volume":"3 1","pages":"2195-2198"},"PeriodicalIF":0.0,"publicationDate":"1996-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85122445","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 101

Clinical applications of computer-based speech training for children with hearing impairment 计算机语言训练在听力障碍儿童中的临床应用

Proceedings : ICSLP. International Conference on Spoken Language Processing Pub Date : 1996-10-03 DOI: 10.21437/ICSLP.1996-40

Anne-Marie Öster

引用次数: 17

The natural language processing module for a voice assisted operator at telef nica i+D 用于西班牙电话i+D语音辅助操作员的自然语言处理模块

Proceedings : ICSLP. International Conference on Spoken Language Processing Pub Date : 1996-01-01 DOI: 10.21437/ICSLP.1996-265

J. Alvarez-Cercadillo, Francisco Javier Caminero Gil, C. Crespo-Casas, D. Merino

引用次数: 0