Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing最新文献

筛选
英文 中文
Using multiple vector quantization and semicontinuous hidden Markov models for speech recognition 利用多矢量量化和半连续隐马尔可夫模型进行语音识别
A. Peinado, J. C. Segura, A. Rubio, M. C. Benítez
{"title":"Using multiple vector quantization and semicontinuous hidden Markov models for speech recognition","authors":"A. Peinado, J. C. Segura, A. Rubio, M. C. Benítez","doi":"10.1109/ICASSP.1994.389355","DOIUrl":"https://doi.org/10.1109/ICASSP.1994.389355","url":null,"abstract":"Although the continuous HMM (CHMM) technique seems to be the most flexible and complete tool for speech modeling, it is not always used for the implementation of speech recognition systems due to several problems related to training and computational complexity. Besides, it is not clear the superiority of continuous models over other well-known types of HMMs, such as discrete (DHMM) or semicontinuous (SCHMM) models, or multiple vector quantization (MVQ) models, a new type of HMM modeling. The authors propose a new variant of HMM models, the SCMVQ, HMM models (semicontinuous multiple vector quantization HMM), that uses one VQ codebook per recognition unit and several quantization candidates, Formally, SCMVQ modeling is the closest one to CHMM, although requiring less computation than SCHMMs. Besides, the authors show that SCMVQs can obtain better recognition results than DHMMs, SCHMMs or MVQs.<<ETX>>","PeriodicalId":290798,"journal":{"name":"Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing","volume":"67 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1994-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133852969","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
The automatic generation of 3D object model from range image 基于距离图像的三维物体模型自动生成
Wentao Zheng, H. Harashima
{"title":"The automatic generation of 3D object model from range image","authors":"Wentao Zheng, H. Harashima","doi":"10.1109/ICASSP.1994.389459","DOIUrl":"https://doi.org/10.1109/ICASSP.1994.389459","url":null,"abstract":"This paper presents an algorithmic procedure for the generation of a 3D wireframe model from range image. The basic idea is to represent a surface by dominant points which have important shape attributes. To determine dominant points, we propose a criterion based on 3D invariant characteristics of surfaces, which we call mean-square curvature. This quantity has some desirable properties and, compared to conventional Gaussian curvature or mean curvature, is more suitable for dominant point selection. It also allows a physical explanation. The 3D wireframe model is constructed by detecting dominant points followed by triangulating them in 3D space. The algorithms are tested on a real range image and the results are shown.<<ETX>>","PeriodicalId":290798,"journal":{"name":"Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing","volume":"79 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1994-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124114663","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
"Whiter than white" noise “比白还白”的噪音
T. Durrani, A. R. Leyman, J. Soraghan
{"title":"\"Whiter than white\" noise","authors":"T. Durrani, A. R. Leyman, J. Soraghan","doi":"10.1109/ICASSP.1994.389789","DOIUrl":"https://doi.org/10.1109/ICASSP.1994.389789","url":null,"abstract":"This paper brings together two strands of current interest in signal processing. While second order techniques and minimum variance criteria are well understood, there is a growing requirement to study the performance criteria that involve higher order statistics in order to evaluate deviation from Gaussianity, linearity and stationarity of observed data. There is a complimentary requirement for the generation of random test sequences which have prescribed (or minimal) higher order statistics, to facilitate the analysis of systems in order to determine linearity/non-linearity, time invariance vs time varying parameters. This paper proposes a new method for minimising the third order cumulant spread of random sequences with symmetric pdf, and provides a closed form solution for the weightings required to achieve this. Numerous computed results are included to verify performance.<<ETX>>","PeriodicalId":290798,"journal":{"name":"Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing","volume":"54 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1994-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124117717","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Demonstrations and applications of spoken language technology: highlights and perspectives from the 1993 ARPA Spoken Language Technology and Applications Day 口语技术的演示和应用:1993年ARPA口语技术和应用日的亮点和观点
C. Weinstein
{"title":"Demonstrations and applications of spoken language technology: highlights and perspectives from the 1993 ARPA Spoken Language Technology and Applications Day","authors":"C. Weinstein","doi":"10.1109/ICASSP.1994.389287","DOIUrl":"https://doi.org/10.1109/ICASSP.1994.389287","url":null,"abstract":"The ARPA Spoken Language Technology and Applications Day (SLTA'93) was a special workshop which presented a set of live, state-of-the-art demonstrations of speech recognition and spoken language understanding systems. The purpose of this paper is to provide perspective on current opportunities for applications of spoken language technology by summarizing the demonstrations and the related applications which they can enable, and reviewing the applications opportunities and needs cited by panelists and other members of the user community.<<ETX>>","PeriodicalId":290798,"journal":{"name":"Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing","volume":"144 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1994-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124397026","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A constrained neural network with complex activation function: application to time-frequency analysis 复激活函数约束神经网络在时频分析中的应用
M. Ibnkahla, S. Puechmorel, F. Castanie
{"title":"A constrained neural network with complex activation function: application to time-frequency analysis","authors":"M. Ibnkahla, S. Puechmorel, F. Castanie","doi":"10.1109/ICASSP.1994.389598","DOIUrl":"https://doi.org/10.1109/ICASSP.1994.389598","url":null,"abstract":"Many signal processing problems need to be solved in an adaptive way under some constraints. The paper introduces a constrained complex-valued neural network (CCNN) model. It is composed of two sub networks: a master which gives the main energy function (the error power between the master's output and a desired output), and a slave which gives a secondary energy function (related to the constraints imposed by the problem). The sum of these energy functions gives the cost function to be minimized by the CCNN. An extension of the classical back propagation algorithm to the complex plane, under some inequality constraints, is used for the training process. This model finds a natural application in the time-frequency analysis as it gives direct access to the time-frequency signature.<<ETX>>","PeriodicalId":290798,"journal":{"name":"Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing","volume":"194 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1994-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114304191","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Multiple testing for seismic data using bootstrap 利用自举法对地震数据进行多次测试
D. Maiwald, J. Böhme
{"title":"Multiple testing for seismic data using bootstrap","authors":"D. Maiwald, J. Böhme","doi":"10.1109/ICASSP.1994.389900","DOIUrl":"https://doi.org/10.1109/ICASSP.1994.389900","url":null,"abstract":"Investigates the estimation of both the number of waves and of the wave parameters for wavefields in a geophysical application. A parametric method for wave parameter estimation in connection with a multiple test procedure is presented. The distribution of the corresponding test statistic for wideband data is approximated by the central limit theorem and alternatively by a bootstrap procedure. Finally the application of the algorithms to real seismic data is studied.<<ETX>>","PeriodicalId":290798,"journal":{"name":"Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1994-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114439879","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
A robust FLS algorithm for linearly-constrained adaptive filtering 线性约束自适应滤波的鲁棒FLS算法
L. Resende, J. Romano, M. Bellanger
{"title":"A robust FLS algorithm for linearly-constrained adaptive filtering","authors":"L. Resende, J. Romano, M. Bellanger","doi":"10.1109/ICASSP.1994.390010","DOIUrl":"https://doi.org/10.1109/ICASSP.1994.390010","url":null,"abstract":"A robust approach to implement the FLS algorithm for linearly constrained adaptive filtering is derived in this work. The robustness is provided by means of an additional correcting term which is also updated by a LS procedure. In fact, the novel algorithm works as the LS version of the classical LMS-based Frost algorithm. Simulation results with a long data input sequence show the performance of the proposed technique.<<ETX>>","PeriodicalId":290798,"journal":{"name":"Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1994-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114557478","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Noise immunization using neural net for speech recognition 基于神经网络的噪声免疫语音识别
R. Sankar, Shrenik Patravali
{"title":"Noise immunization using neural net for speech recognition","authors":"R. Sankar, Shrenik Patravali","doi":"10.1109/ICASSP.1994.389563","DOIUrl":"https://doi.org/10.1109/ICASSP.1994.389563","url":null,"abstract":"The multilayer perceptron (MLP) type of neural network classifiers using backpropagation has become increasingly popular for speech recognition. However, for the case of noisy speech, studies have not been very extensive. In this paper, a robust speech recognition system using a neural network is studied. Robustness is achieved by noise immunization, thereby enabling the system to maintain a high recognition accuracy for speech input at different signal-to-noise ratio (SNR) conditions. Noise immunization is achieved by gradual contamination of the signal with noise thereby creating a more reliable reference database in spite of low SNR. The learning is done by a modified backpropagation algorithm. Tenth order LPC coefficients are used to represent the data. The order or sequence in which the data is presented to the neural network for training to provide fast convergence and better performance is studied.<<ETX>>","PeriodicalId":290798,"journal":{"name":"Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing","volume":"180 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1994-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114531068","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Robust signal extrapolation using wavelets 使用小波的鲁棒信号外推
Li-Chien Lin, C.-C. Jay Kuo
{"title":"Robust signal extrapolation using wavelets","authors":"Li-Chien Lin, C.-C. Jay Kuo","doi":"10.1109/ICASSP.1994.390044","DOIUrl":"https://doi.org/10.1109/ICASSP.1994.390044","url":null,"abstract":"A new approach for signal extrapolation based on wavelet representation: known as scale-time limited extrapolation and a denoising process is investigated in this research. We first examine a new signal modeling technique using wavelets and the corresponding scale-time limited signal extrapolation algorithm. Then, the sensitivity of the algorithm to noise is discussed, and a denoising algorithm based on the time-localization property of the wavelet transform is proposed. By integrating the denoising process and the iterative scale-time limited extrapolation algorithm, we obtain a very robust signal extrapolation algorithm for noisy data. A simulation result of signal extrapolation from noisy observed data is presented to illustrate the performance of the proposed robust signal extrapolation algorithm.<<ETX>>","PeriodicalId":290798,"journal":{"name":"Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1994-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115040271","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Maximum likelihood velocity estimation of multiple seismic wavefronts 多个地震波前的最大似然速度估计
Hui Liu, Fu Li
{"title":"Maximum likelihood velocity estimation of multiple seismic wavefronts","authors":"Hui Liu, Fu Li","doi":"10.1109/ICASSP.1994.389773","DOIUrl":"https://doi.org/10.1109/ICASSP.1994.389773","url":null,"abstract":"Presents a new approach to simultaneously estimate stacking velocity and zero-offset time of seismic wave propagation, which is specially designed for multiple seismic wavefronts while the traditional semblance approach and the subspace approach are not. Simulations show a good performance of the new approach.<<ETX>>","PeriodicalId":290798,"journal":{"name":"Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1994-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115091946","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信