2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03).最新文献_第2页

A method of generating uniformly distributed sequences over [0,K], where K+1 is not a power of two 在[0,K]上生成均匀分布序列的一种方法，其中K+1不是2的幂

2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). Pub Date : 2003-07-06 DOI: 10.1109/ICASSP.2003.1202488

R. Kuehnel, Yuke Wang

引用次数: 2

Time-domain method for tracking dispersive channels in MIMO OFDM systems MIMO OFDM系统中色散信道的时域跟踪方法

2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). Pub Date : 2003-07-06 DOI: 10.1109/ICASSP.2003.1202662

T. Roman, M. Enescu, V. Koivunen

引用次数: 24

Schemes for error resilient streaming of perceptually coded audio 感知编码音频的容错流方案

2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). Pub Date : 2003-07-06 DOI: 10.1109/ICASSP.2003.1200077

J. Korhonen, Ye-Kui Wang

引用次数: 8

A comparison of subspace analysis for face recognition 子空间分析在人脸识别中的比较

2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). Pub Date : 2003-07-06 DOI: 10.1109/ICASSP.2003.1199122

Jian Li, S. Zhou, C. Shekhar

引用次数: 9

HMM-neural network monophone models for computer-based articulation training for the hearing impaired 基于计算机的听力障碍发音训练的神经网络单声道模型

2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). Pub Date : 2003-07-06 DOI: 10.1109/ICASSP.2003.1202373

M. Devarajan, Fansheng Meng, P. Hix, S. Zahorian

{"title":"HMM-neural network monophone models for computer-based articulation training for the hearing impaired","authors":"M. Devarajan, Fansheng Meng, P. Hix, S. Zahorian","doi":"10.1109/ICASSP.2003.1202373","DOIUrl":"https://doi.org/10.1109/ICASSP.2003.1202373","url":null,"abstract":"A visual speech training aid for persons with hearing impairments has been developed using a Windows-based multimedia computer. Previous papers (Zahorian, S. et al., Int. Conf. on Spoken Language Processing, 2002; Zahorian and Nossair, Z.B., IEEE Trans. on Speech and Audio Processing, vol.7, no.4, p.414-25, 1999; Zimmer, A. et al., ICASSP, vol.6, p.3625-8, 1998; Zahorian and Jagharghi, A., J. Acoust. Soc. Amer., vol.94, no.4, p.1966-82, 1993) have describe the signal processing steps and display options for giving real-time feedback about the quality of pronunciation for 10 steady-state American English monopthong vowels (/aa/, /iy/, /uw/, /ae/, /er/, /ih/, /eh/, /ao/, /ah/, and /uh/). This vowel training aid is thus referred to as a vowel articulation training aid (VATA). We now describe methods to develop a monophone-based hidden Markov model/neural network recognizer such that real time visual feedback can be given about the quality of pronunciation of short words and phrases. Experimental results are reported which indicate a high degree of accuracy for labeling and segmenting the CVC (consonant-vowel-consonant) database developed for \"training\" the display.","PeriodicalId":104473,"journal":{"name":"2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03).","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128067532","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Watermarking of 3D models using principal component analysis 基于主成分分析的三维模型水印

2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). Pub Date : 2003-07-06 DOI: 10.1109/ICASSP.2003.1200061

Andreas Kalivas, A. Tefas, I. Pitas

引用次数: 59

Using phone and diphone based acoustic models for voice conversion: a step towards creating voice fonts 使用基于电话和diphone的声学模型进行语音转换:朝着创建语音字体迈出了一步

2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). Pub Date : 2003-07-06 DOI: 10.1109/ICASSP.2003.1198882

Arun Kumar, Ashish Verma

引用次数: 12

A probabilistic approach for blind source separation of underdetermined convolutive mixtures 欠定卷积混合信号盲源分离的概率方法

2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). Pub Date : 2003-07-06 DOI: 10.1109/ICASSP.2003.1201748

J. M. Peterson, S. Kadambe

引用次数: 11

Oscillatory gestures and discourse 摇摆的手势和话语

2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). Pub Date : 2003-07-06 DOI: 10.1109/ICASSP.2003.1200090

Francis K. H. Quek, Yingen Xiong

引用次数: 7

Unconstrained motion compensated temporal filtering (UMCTF) framework for wavelet video coding 小波视频编码的无约束运动补偿时序滤波框架

2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). Pub Date : 2003-07-06 DOI: 10.1109/ICASSP.2003.1199112

M. Schaar, D. Turaga

引用次数: 12