IEEE Trans. Speech Audio Process.最新文献_第9页

The multimode transform predictive coding paradigm 多模变换预测编码范式

IEEE Trans. Speech Audio Process. Pub Date : 2003-04-15 DOI: 10.1109/TSA.2003.809195

S. Ramprashad

引用次数: 17

Iterated partitioned block frequency-domain adaptive filtering for acoustic echo cancellation 声学回声消除的迭代分块频域自适应滤波

IEEE Trans. Speech Audio Process. Pub Date : 2003-04-15 DOI: 10.1109/TSA.2003.809194

K. Eneman, M. Moonen

引用次数: 5

Optimizing feature extraction for speech recognition 优化语音识别特征提取

IEEE Trans. Speech Audio Process. Pub Date : 2003-02-19 DOI: 10.1109/TSA.2002.805644

Chulhee Lee, Donghoon Hyun, E. Choi, Jinwook Go, Chungyong Lee

引用次数: 56

A formant filtered physical model for wind instruments 管乐器的形成峰过滤物理模型

IEEE Trans. Speech Audio Process. Pub Date : 2003-02-19 DOI: 10.1109/TSA.2002.807351

A. Nackaerts, B. Moor, R. Lauwereins

引用次数: 2

A robust online secondary path modeling method with auxiliary noise power scheduling strategy and norm constraint manipulation 一种具有辅助噪声功率调度策略和范数约束的鲁棒在线二次路径建模方法

IEEE Trans. Speech Audio Process. Pub Date : 2003-02-19 DOI: 10.1109/TSA.2003.805643

Ming Zhang, H. Lan, W. Ser

{"title":"A robust online secondary path modeling method with auxiliary noise power scheduling strategy and norm constraint manipulation","authors":"Ming Zhang, H. Lan, W. Ser","doi":"10.1109/TSA.2003.805643","DOIUrl":"https://doi.org/10.1109/TSA.2003.805643","url":null,"abstract":"In many practical cases for active noise control (ANC), the online secondary path modeling methods that use auxiliary noise are often applied. However, the auxiliary noise contributes to residual noise, and thus deteriorates the noise control performance of ANC systems. Moreover, a sudden and large change in the secondary path leads to easy divergence of the existing online secondary path modeling methods. To mitigate these problems, this paper proposes a new online secondary path modeling method with auxiliary noise power scheduling and adaptive filter norm manipulation. The auxiliary noise power is scheduled based on the convergence status of an ANC system with consideration of the variation of the primary noise. The purpose is to alleviate the increment of the residual noise due to the auxiliary noise. In addition, the norm manipulation is applied to adaptive filters in the ANC system. The objective is to avoid over-updates of adaptive filters due to the sudden large change in the secondary path and thus prevent the ANC system from diverging. Computer simulations show the effectiveness and robustness of the proposed method.","PeriodicalId":13155,"journal":{"name":"IEEE Trans. Speech Audio Process.","volume":"88 1","pages":"45-53"},"PeriodicalIF":0.0,"publicationDate":"2003-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81244473","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 86

Noise reduction and echo cancellation front-end for speech codecs 语音编解码器的降噪和回声消除前端

IEEE Trans. Speech Audio Process. Pub Date : 2003-02-19 DOI: 10.1109/TSA.2002.807350

F. Basbug, K. Swaminathan, S. Nandkumar

引用次数: 24

On the computation of the Kullback-Leibler measure for spectral distances 光谱距离的Kullback-Leibler测度的计算

IEEE Trans. Speech Audio Process. Pub Date : 2003-02-19 DOI: 10.1109/TSA.2002.805641

R. Veldhuis, E. Klabbers

引用次数: 29

Discriminative training of natural language call routers 自然语言呼叫路由器的判别训练

IEEE Trans. Speech Audio Process. Pub Date : 2003-02-19 DOI: 10.1109/TSA.2002.807352

H. Kuo, Chin-Hui Lee

{"title":"Discriminative training of natural language call routers","authors":"H. Kuo, Chin-Hui Lee","doi":"10.1109/TSA.2002.807352","DOIUrl":"https://doi.org/10.1109/TSA.2002.807352","url":null,"abstract":"This paper shows how discriminative training can significantly improve classifiers used in natural language processing, using as an example the task of natural language call routing, where callers are transferred to desired departments based on natural spoken responses to an open-ended \"How may I direct your call?\" prompt. With vector-based natural language call routing, callers are transferred using a routing matrix trained on statistics of occurrence of words and word sequences in a training corpus. By re-training the routing matrix parameters using a minimum classification error criterion, a relative error rate reduction of 10-30% was achieved on a banking task. Increased robustness was demonstrated in that with 10% rejection, the error rate was reduced by 40%. Discriminative training also improves portability; we were able to train call routers with the highest known performance using as input only text transcription of routed calls, without any human intervention or knowledge about what terms are important or irrelevant for the routing task. This strategy was validated with both the banking task and a more difficult task involving calls to operators in the UK. The proposed formulation is applicable to algorithms addressing a broad range of speech understanding, information retrieval, and topic identification problems.","PeriodicalId":13155,"journal":{"name":"IEEE Trans. Speech Audio Process.","volume":"73 1","pages":"24-35"},"PeriodicalIF":0.0,"publicationDate":"2003-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82043531","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 65

Filter bank design for subband adaptive microphone arrays 子带自适应麦克风阵列滤波器组设计

IEEE Trans. Speech Audio Process. Pub Date : 2003-02-19 DOI: 10.1109/TSA.2002.807353

Jan Mark de Haan, N. Grbic, I. Claesson, S. Nordholm

引用次数: 69

Linear regression based Bayesian predictive classification for speech recognition 基于线性回归的贝叶斯预测分类语音识别

IEEE Trans. Speech Audio Process. Pub Date : 2003-02-19 DOI: 10.1109/TSA.2002.805640

Jen-Tzung Chien

{"title":"Linear regression based Bayesian predictive classification for speech recognition","authors":"Jen-Tzung Chien","doi":"10.1109/TSA.2002.805640","DOIUrl":"https://doi.org/10.1109/TSA.2002.805640","url":null,"abstract":"The uncertainty in parameter estimation due to the adverse environments deteriorates the classification performance for speech recognition. It becomes crucial to incorporate the parameter uncertainty into decision so that the classification robustness can be assured. We propose a novel linear regression based Bayesian predictive classification (LRBPC) for robust speech recognition. This framework is constructed under the paradigm of linear regression adaptation of speech hidden Markov models (HMMs). Because the regression mapping between HMMs and adaptation data is ill posed, we properly characterize the uncertainty of regression parameters using a joint Gaussian distribution . A closed-form predictive distribution can be derived to set up the LRBPC decision for speech recognition. Such decision is robust compared to the plug-in maximum a posteriori (MAP) decision adopted in the maximum likelihood linear regression (MLLR) and MAP linear regression (MAPLR). Since the specified distribution belongs to the conjugate prior family, the evolutionary hyperparameters are established. With the statistically rich hyperparameters, the LRBPC achieves decision robustness. In the experiments, we find that LRBPC decision in cases of general linear regression as well as single variable linear regression attains significantly better recognition performance than MLLR and MAPLR adaptation.","PeriodicalId":13155,"journal":{"name":"IEEE Trans. Speech Audio Process.","volume":"63 1","pages":"70-79"},"PeriodicalIF":0.0,"publicationDate":"2003-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90590519","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 28