6th International Conference on Signal Processing, 2002.最新文献

筛选
英文 中文
Noise reduction and echo cancellation system 降噪和回波消除系统
6th International Conference on Signal Processing, 2002. Pub Date : 2002-08-26 DOI: 10.1109/ICOSP.2002.1180036
S. Zoican
{"title":"Noise reduction and echo cancellation system","authors":"S. Zoican","doi":"10.1109/ICOSP.2002.1180036","DOIUrl":"https://doi.org/10.1109/ICOSP.2002.1180036","url":null,"abstract":"The paper presents an algorithm, based on the transform domain approach, used in order to reduce noise and to eliminate echo in telecommunications systems such as hands free telephony. The algorithm has the ability to suppress the noise from the microphone signal as well as the acoustic echo. The audio echo cancellation is based on transform domain LMS adaptive filter theory. A noise reduction filter is introduced in order to achieve better performance. The computational complexity is increased, but the system can be implemented in real time using a DSP microcomputer.","PeriodicalId":159807,"journal":{"name":"6th International Conference on Signal Processing, 2002.","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116202913","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A new phonetic model for continuous speech recognition systems 连续语音识别系统的新语音模型
6th International Conference on Signal Processing, 2002. Pub Date : 2002-08-26 DOI: 10.1109/ICOSP.2002.1181120
R. Fagundes, J. S. Correa, P. Dumouchel
{"title":"A new phonetic model for continuous speech recognition systems","authors":"R. Fagundes, J. S. Correa, P. Dumouchel","doi":"10.1109/ICOSP.2002.1181120","DOIUrl":"https://doi.org/10.1109/ICOSP.2002.1181120","url":null,"abstract":"The main goal of this work is to describe a new model for a large vocabulary continuous speech recognition system using a phonetic-phonological approach. This work proposes a statistical phonetic structure, applied at the phonetic-phonological level, to improve the speech recognition performance in systems with phonetic-phonological modeling. It is shown that the general likelihood scores are increased, indicating better recognition performance. This is due to the fact that the statistical phonetic structure leads to enhancement of some frequent phonetic combinations from the language itself. Such a structure should be considered as an additional knowledge base, containing information about the real language phonetic structure. Also this new phonetic-phonological approach should be strongly recommended for use in spontaneous speech recognition systems.","PeriodicalId":159807,"journal":{"name":"6th International Conference on Signal Processing, 2002.","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116413052","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A comprehensive understanding for radial basis probabilistic neural networks 对径向基概率神经网络的全面理解
6th International Conference on Signal Processing, 2002. Pub Date : 2002-08-26 DOI: 10.1109/ICOSP.2002.1180015
De-shuang Huang, Wen-Bo Zhao
{"title":"A comprehensive understanding for radial basis probabilistic neural networks","authors":"De-shuang Huang, Wen-Bo Zhao","doi":"10.1109/ICOSP.2002.1180015","DOIUrl":"https://doi.org/10.1109/ICOSP.2002.1180015","url":null,"abstract":"The paper makes a profound analysis on radial basis probabilistic neural networks (RBPNN) from the viewpoint of linear algebra. Specifically, the transformation properties and internal representations of the RBPNNs are investigated in alliance with the properties of the input samples so that one may understand and grasp the mechanisms for pattern classification and function approximation of the RBPNNs. In addition, we analyse the convergence behaviour of the output class weight vectors of the RBPNNs, which can be shown to be orthogonal as well. Finally, one example for classifying five kinds of different distribution patterns are given to further support our understandings and claims.","PeriodicalId":159807,"journal":{"name":"6th International Conference on Signal Processing, 2002.","volume":"94 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122099487","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Efficient wavelet-based temporally scalable video coding 高效的基于小波的时间可伸缩视频编码
6th International Conference on Signal Processing, 2002. Pub Date : 2002-08-26 DOI: 10.1109/ICOSP.2002.1181197
K. Ho, D. Lun
{"title":"Efficient wavelet-based temporally scalable video coding","authors":"K. Ho, D. Lun","doi":"10.1109/ICOSP.2002.1181197","DOIUrl":"https://doi.org/10.1109/ICOSP.2002.1181197","url":null,"abstract":"A new temporally scalable video coding algorithm based on the interpolating wavelet transform is proposed. With the proposed approach, the input video frames are first applied to an interpolating wavelet transform which generates video frames with reduced temporal redundancy in its high pass branch and original video frames at lower rate in its low pass branch. We further propose the reversible rounding method to convert the floating point coefficients given by the interpolating wavelet transform into integers without loss of resolution. The proposed video codec shares the same advantage of the traditional temporal subband (TSB) approach in that it is very simple in nature since it does not require the complicated motion compensation process. It outperforms substantially the TSB approach in generating lower frame rate videos.","PeriodicalId":159807,"journal":{"name":"6th International Conference on Signal Processing, 2002.","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122118971","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
The research and implementation of Mongolian text to speech system 蒙古语文本转语音系统的研究与实现
6th International Conference on Signal Processing, 2002. Pub Date : 2002-08-26 DOI: 10.1109/ICOSP.2002.1181094
Gao Guang Lai, Hee-Joo Min, Zhao Si Qin
{"title":"The research and implementation of Mongolian text to speech system","authors":"Gao Guang Lai, Hee-Joo Min, Zhao Si Qin","doi":"10.1109/ICOSP.2002.1181094","DOIUrl":"https://doi.org/10.1109/ICOSP.2002.1181094","url":null,"abstract":"In this paper, a Mongolian text to speech system using the PSOLA method is presented. According to the characteristics of Mongolian, the system uses a speech waveform concatenation method that is comparatively mature in text-to-speech synthesis. The author also set up a usable Mongolian diphone database, and the elements of the database are all marked accurately. At the same time, a speech synthesis dictionary is firstly set up. The experiment of prosodic modifications is also done and the result is comparatively satisfying. Mongolian is an influential language in the world, so our system will surely improve the Mongolian information processing.","PeriodicalId":159807,"journal":{"name":"6th International Conference on Signal Processing, 2002.","volume":"104 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117179632","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Control and description of gesture of virtual human 虚拟人的手势控制与描述
6th International Conference on Signal Processing, 2002. Pub Date : 2002-08-26 DOI: 10.1109/ICOSP.2002.1179961
Xu Lin, Yuan Bao-zong, Gao Wen, Tang Xiao-fang
{"title":"Control and description of gesture of virtual human","authors":"Xu Lin, Yuan Bao-zong, Gao Wen, Tang Xiao-fang","doi":"10.1109/ICOSP.2002.1179961","DOIUrl":"https://doi.org/10.1109/ICOSP.2002.1179961","url":null,"abstract":"It's an advanced behavior for virtual humans to gesticulate. Control and description of this kind of behavior are among the most challenging problems in the field of computer graphics. In order to lower its complexity, a gesture description language is designed and developed in this paper, in which a fuzzy declarative description is used to specify the shape of hands to lower the difficulty of the description of gestures. That lays the foundation for synthesis and recognition of gesture in computer.","PeriodicalId":159807,"journal":{"name":"6th International Conference on Signal Processing, 2002.","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129799261","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A robust cepstrum-based algorithm for image registration using projections 一种鲁棒的基于倒谱的投影图像配准算法
6th International Conference on Signal Processing, 2002. Pub Date : 2002-08-26 DOI: 10.1109/ICOSP.2002.1181180
H. Sarnel
{"title":"A robust cepstrum-based algorithm for image registration using projections","authors":"H. Sarnel","doi":"10.1109/ICOSP.2002.1181180","DOIUrl":"https://doi.org/10.1109/ICOSP.2002.1181180","url":null,"abstract":"The 2D phase correlation and the 2D cepstrum technique are known as two solutions to the translation-based image registration problem. However, both methods have a large computational load. Satisfactory results were obtained in a smaller translation range by applying phase correlations to the 1D projections of images to reduce the computational load (Alliney, S. and Morandi, C., 1986). The paper presents a new algorithm based on applying the cepstrum technique to image projections in a similar way. An enhanced cepstrum technique is developed by subtracting the cepstrum of the projection differences from the cepstrum of the projection additions. A superior performance compared to that of the 1D phase correlation can be obtained by whitening the power spectrum by the square-root instead of the logarithm and using a high-pass filter afterwards. The algorithm yields larger translation ranges in registering noisy images and remarkably outperforms the phase correlation method in the case of different degrees of blurring between images.","PeriodicalId":159807,"journal":{"name":"6th International Conference on Signal Processing, 2002.","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128408203","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Study on detecting abnormal construction of geology with multi-scale edges detecting method 多尺度边缘检测方法在地质异常施工检测中的应用研究
6th International Conference on Signal Processing, 2002. Pub Date : 2002-08-26 DOI: 10.1109/ICOSP.2002.1180151
Xianghong Tang, Shuqing Xie, Qiliang Li
{"title":"Study on detecting abnormal construction of geology with multi-scale edges detecting method","authors":"Xianghong Tang, Shuqing Xie, Qiliang Li","doi":"10.1109/ICOSP.2002.1180151","DOIUrl":"https://doi.org/10.1109/ICOSP.2002.1180151","url":null,"abstract":"In a seismic record, the amplitude varies with different geological surfaces; this variation is very like the variation of gray levels in images. Based on the image edge detection method, we have studied thin-bed recognition methods. Experimental results show that our method is very effective.","PeriodicalId":159807,"journal":{"name":"6th International Conference on Signal Processing, 2002.","volume":"33 21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128471544","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Feature difference classification method in fractal image coding 分形图像编码中的特征差分分类方法
6th International Conference on Signal Processing, 2002. Pub Date : 2002-08-26 DOI: 10.1109/ICOSP.2002.1181139
C. Yisong, Wang Guoping, D. Shihai
{"title":"Feature difference classification method in fractal image coding","authors":"C. Yisong, Wang Guoping, D. Shihai","doi":"10.1109/ICOSP.2002.1181139","DOIUrl":"https://doi.org/10.1109/ICOSP.2002.1181139","url":null,"abstract":"In this paper, we present a classification algorithm in fractal image coding. Based on the contraction characteristics of transformations in fractal image coding, the algorithm uses the notion of feature difference to speed up the domain-range matching routine of the coding. The algorithm can effectively exclude pseudo matches during the process of domain-range matching and result in significant improvement of the rate-distortion performance. It can also be easily realized in cooperation with many other speedup schemes.","PeriodicalId":159807,"journal":{"name":"6th International Conference on Signal Processing, 2002.","volume":"43 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128685112","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
A new and efficient audio compression based on EBCOT and RVLC 一种基于EBCOT和RVLC的新型高效音频压缩技术
6th International Conference on Signal Processing, 2002. Pub Date : 2002-08-26 DOI: 10.1109/ICOSP.2002.1181086
Zhanguo Wang, Fuzong Lin
{"title":"A new and efficient audio compression based on EBCOT and RVLC","authors":"Zhanguo Wang, Fuzong Lin","doi":"10.1109/ICOSP.2002.1181086","DOIUrl":"https://doi.org/10.1109/ICOSP.2002.1181086","url":null,"abstract":"The RVLC which is an EBCOT-based coding method has been utilized successfully recently in image compression as in JPEG 2000. In this paper we propose a new and efficient audio compression based on EBCOT and RVLC. Firstly the original audio signal is decomposed with a lifting wavelet packet transform, then RVLC and Huffman methods are used to process the coefficients of low and high subbands respectively. The psychoacoustic model and probability model are also taken into account to achieve high quality of reconstructed audio signal. The results of our experiments show that the method is efficient.","PeriodicalId":159807,"journal":{"name":"6th International Conference on Signal Processing, 2002.","volume":"92 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129622173","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信