5th International Conference on Spoken Language Processing (ICSLP 1998)最新文献

A novel method of formant analysis and glottal inverse filtering 一种新的形成峰分析和声门反滤波方法

5th International Conference on Spoken Language Processing (ICSLP 1998) Pub Date : 1998-11-30 DOI: 10.21437/ICSLP.1998-543

Steve Pearson

引用次数: 1

High-speed speaker adaptation using phoneme dependent tree-structured speaker clustering 基于音素相关树形说话人聚类的高速说话人自适应

5th International Conference on Spoken Language Processing (ICSLP 1998) Pub Date : 1998-11-30 DOI: 10.21437/ICSLP.1998-745

Motoyuki Suzuki, T. Abe, H. Mori, S. Makino, H. Aso

引用次数: 2

Toward on-line learning of Chinese continuous speech recognition system 汉语连续语音识别系统的在线学习研究

5th International Conference on Spoken Language Processing (ICSLP 1998) Pub Date : 1998-11-30 DOI: 10.21437/ICSLP.1998-748

Rong Zheng, Zuoying Wang

引用次数: 0

Unsupervised training of a speech recognizer using TV broadcasts 使用电视广播对语音识别器进行无监督训练

5th International Conference on Spoken Language Processing (ICSLP 1998) Pub Date : 1998-11-30 DOI: 10.21437/ICSLP.1998-632

T. Kemp, A. Waibel

{"title":"Unsupervised training of a speech recognizer using TV broadcasts","authors":"T. Kemp, A. Waibel","doi":"10.21437/ICSLP.1998-632","DOIUrl":"https://doi.org/10.21437/ICSLP.1998-632","url":null,"abstract":"Current speech recognition systems require large amounts of transcribed data for parameter estimation. The transcription, however, is tedious and expensive. In this work we describe our experiments which are aimed at training a speech recognizer without transcriptions. The experiments were carried out with TV newscasts, that were recorded using a satellite receiver and a simple MPEG coding hardware. The newscasts were automatically segmented into segments of similar acoustic background condition. This material is inexpensive and can be made available in large quantities, but there are no transcriptions available. We develop a training scheme, where a recognizer is boot-strapped using very little transcribed data and is improved using new, untranscribed speech. We show that it is neces-sary to use a con(cid:12)dence measure to judge the initial transcriptions of the recognizer before using them. Higher im-provements can be achieved if the number of parameters in the system is increased when more data becomes available. We show, that the bene(cid:12)cial e(cid:11)ect of unsupervised training is not compensated by MLLR adaptation on the hypothesis. In a (cid:12)nal experiment, the e(cid:11)ect of untranscribed data is compared with the e(cid:11)ect of transcribed speech. Using the described methods, we found that the untranscribed data gives roughly one third of the improvement of the transcribed material.","PeriodicalId":117113,"journal":{"name":"5th International Conference on Spoken Language Processing (ICSLP 1998)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115497599","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 37

SIVHA, visual speech synthesis system 视觉语音合成系统

5th International Conference on Spoken Language Processing (ICSLP 1998) Pub Date : 1998-11-30 DOI: 10.21437/ICSLP.1998-777

Y. Blanco, Maria Cuellar, A. Villanueva, Fernando Lacunza, R. Cabeza, B. Marcotegui

引用次数: 3

Convergence of fundamental frequencies in conversation: if it happens, does it matter? 对话中基本频率的收敛:如果发生了，有什么关系吗?

5th International Conference on Spoken Language Processing (ICSLP 1998) Pub Date : 1998-11-30 DOI: 10.21437/ICSLP.1998-111

Belinda Collins

引用次数: 9

Multi-Span statistical language modeling for large vocabulary speech recognition 面向大词汇量语音识别的多跨度统计语言建模

5th International Conference on Spoken Language Processing (ICSLP 1998) Pub Date : 1998-11-30 DOI: 10.21437/ICSLP.1998-640

J. Bellegarda

引用次数: 13

A computational algorithm for F0 contour generation in Korean developed with prosodically labeled databases using k-toBI system 利用k-toBI系统开发了一种基于节奏标记数据库的韩文F0轮廓生成计算算法

5th International Conference on Spoken Language Processing (ICSLP 1998) Pub Date : 1998-11-30 DOI: 10.21437/ICSLP.1998-34

Yong-Ju Lee, Sook-Hyang Lee, Jong-Jin Kim, Hyun-Ju Ko, Young-Il Kim, Sanghun Kim, Jung-Cheol Lee

引用次数: 4

The effect of fundamental frequency on Mandarin speech recognition 基频对普通话语音识别的影响

5th International Conference on Spoken Language Processing (ICSLP 1998) Pub Date : 1998-11-30 DOI: 10.21437/ICSLP.1998-761

Sharlene A. Liu, S. Doyle, Allen Morris, Farzad Ehsani

引用次数: 10

ToBI accent type recognition ToBI重音类型识别

5th International Conference on Spoken Language Processing (ICSLP 1998) Pub Date : 1998-11-30 DOI: 10.21437/ICSLP.1998-126

Arman Maghbouleh

引用次数: 9