2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03).最新文献

筛选
英文 中文
A real-time curve evolution-based image fusion algorithm for multisensory image segmentation 一种基于实时曲线进化的多感官图像分割融合算法
Yuhua Ding, G. Vachtsevanos, A. Yezzi, W. Daley, Bonnie S. Heck-Ferri
{"title":"A real-time curve evolution-based image fusion algorithm for multisensory image segmentation","authors":"Yuhua Ding, G. Vachtsevanos, A. Yezzi, W. Daley, Bonnie S. Heck-Ferri","doi":"10.1109/ICASSP.2003.1199855","DOIUrl":"https://doi.org/10.1109/ICASSP.2003.1199855","url":null,"abstract":"A partial differential equation (PDE)-based feature-level image fusion approach is proposed for multisensory image segmentation. The energy functional of the proposed fusion model is a weighted sum of several functionals, each constructed based on the characteristics of the sensor image. The weight selection decides the way that the model handles redundant, conflicting, or complementary information involved in the multisensory data. The method is implemented using level sets and is fast enough for real-time segmentation tasks. Finally the algorithm is applied to the segmentation of X-ray and visual images, and the results show that the fusion algorithm is efficient, accurate, and robust.","PeriodicalId":104473,"journal":{"name":"2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03).","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128638960","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Audio restoration by constrained audio texture synthesis 基于约束音频纹理合成的音频恢复
Lie Lu, Yi Mao, Wenyin Liu, HongJiang Zhang
{"title":"Audio restoration by constrained audio texture synthesis","authors":"Lie Lu, Yi Mao, Wenyin Liu, HongJiang Zhang","doi":"10.1109/ICASSP.2003.1200051","DOIUrl":"https://doi.org/10.1109/ICASSP.2003.1200051","url":null,"abstract":"Audio texture, a new audio medium, is used to synthesize long audio streams according to a given short example audio clip. In this paper, we extend this idea to audio texture restoration, or constrained audio texture synthesis for restoring those missing parts in an audio clip. It is useful in many applications such as audio restoration and audio reconstruction. It can also be used in error concealment for audio/music delivery with packet loss on the Internet. A novel method is proposed for constrained audio texture synthesis. Preliminary results are provided for evaluation.","PeriodicalId":104473,"journal":{"name":"2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03).","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116641909","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Rapid prototyping of JPEG encoder using the ASIP development system: PEAS-III JPEG编码器的快速原型使用ASIP开发系统:pease - iii
Shinsuke Kobayashi, Kentaro Mita, Y. Takeuchi, M. Imai
{"title":"Rapid prototyping of JPEG encoder using the ASIP development system: PEAS-III","authors":"Shinsuke Kobayashi, Kentaro Mita, Y. Takeuchi, M. Imai","doi":"10.1109/ICASSP.2003.1202409","DOIUrl":"https://doi.org/10.1109/ICASSP.2003.1202409","url":null,"abstract":"In this paper, the JPEG encoder application, one of the DSP applications, was implemented using the ASIP development system: PEAS-III. Instructions for the JPEG encoder, such as DCT instruction, and butterfly instructions, were added to the initial design. Area, performance, and execution cycles of the processors were calculated using the generated HDL description, compiler, and assembler by PEAS-III. From the experimental results, 12 architectures can be designed in 160 hours, and the designer can select an optimal architecture that satisfies design constraints considering the hardware cost, clock frequency and execution cycles.","PeriodicalId":104473,"journal":{"name":"2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03).","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126104663","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
A measure of aperiodicity and periodicity in speech 语速:言语中非周期性和周期性的量度
Om Deshmukh, C. Espy-Wilson
{"title":"A measure of aperiodicity and periodicity in speech","authors":"Om Deshmukh, C. Espy-Wilson","doi":"10.1109/ICASSP.2003.1198814","DOIUrl":"https://doi.org/10.1109/ICASSP.2003.1198814","url":null,"abstract":"In this paper, we discuss a direct measure for aperiodic energy and periodic energy in speech signals. Most measures for aperiodicity have been indirect, such as zero crossing rate, high-frequency energy and the ratio of high-frequency energy to low-frequency energy. Such indirect measurements will usually fail in situations where there is both strong periodic and aperiodic energy in the speech signal, as in the case of some voiced fricatives or when there is a need to distinguish between high frequency periodic versus high frequency aperiodic energy. We propose an AMDF based temporal method to estimate directly the amount of periodic and aperiodic energy in the speech signal. The algorithm also gives an estimate of the pitch period in periodic regions.","PeriodicalId":104473,"journal":{"name":"2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03).","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129262154","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Joint optimization of short-term and long-term predictors in CELP speech coders CELP语音编码器中短期和长期预测因子的联合优化
H. Zarrinkoub, P. Mermelstein
{"title":"Joint optimization of short-term and long-term predictors in CELP speech coders","authors":"H. Zarrinkoub, P. Mermelstein","doi":"10.1109/ICASSP.2003.1202318","DOIUrl":"https://doi.org/10.1109/ICASSP.2003.1202318","url":null,"abstract":"The objective of this work is to investigate whether joint optimization of short-term and long-term predictors manifests significant advantages over the sequential optimization in speech coding. We propose a new joint optimization method based on Wiener filtering. The proposed analysis model resolves the pitch-bias problem of classical LPC analysis by considering the contribution of the long-term predictor while optimizing the short-term predictor. Our approach to joint optimization is based on analysis-by-synthesis and guarantees the synthesis filter stability. By applying our proposed joint optimization approach to CELP coding we obtain superior objective and subjective performance relative to CELP coding with sequential optimization. To provide voice quality equivalent to that of sequentially optimized CELP, the jointly optimized coder needs fewer FCB pulses and requires a reduced bit budget for LPC quantization. Our listening tests suggest that the JCELP coder at 4.25 kbps is equivalent in quality to the G.729 at 8 kbps.","PeriodicalId":104473,"journal":{"name":"2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03).","volume":"130 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130910026","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Structural risk minimization using nearest neighbor rule 基于最近邻规则的结构风险最小化
A. Hamza, H. Krim, Bilge Karaçali
{"title":"Structural risk minimization using nearest neighbor rule","authors":"A. Hamza, H. Krim, Bilge Karaçali","doi":"10.1109/ICASSP.2003.1201643","DOIUrl":"https://doi.org/10.1109/ICASSP.2003.1201643","url":null,"abstract":"We present a novel nearest neighbor rule-based implementation of the structural risk minimization principle to address a generic classification problem. We propose a fast reference set thinning algorithm on the training data set similar to a support vector machine approach. We then show that the nearest neighbor rule based on the reduced set implements the structural risk minimization principle, in a manner which does not involve selection of a convenient feature space. Simulation results on real data indicate that this method significantly reduces the computational cost of the conventional support vector machines, and achieves a nearly comparable test error performance.","PeriodicalId":104473,"journal":{"name":"2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03).","volume":"105 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128599760","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Rapid prototyping for an optimized MPEG-4 decoder implementation over a parallel heterogenous architecture 基于并行异构架构的优化MPEG-4解码器实现的快速原型
N. Ventroux, J. Nezan, M. Raulet, O. Déforges
{"title":"Rapid prototyping for an optimized MPEG-4 decoder implementation over a parallel heterogenous architecture","authors":"N. Ventroux, J. Nezan, M. Raulet, O. Déforges","doi":"10.1109/ICASSP.2003.1202393","DOIUrl":"https://doi.org/10.1109/ICASSP.2003.1202393","url":null,"abstract":"Sequential MPEG-4 solutions actually developed for single processors try to integrate the most functionalities as possible in an unique software, and are generally oversized compared with the actual service requirement. Moreover, they can hardly be projected onto multiprocessors targets, leading to an extra load of source code and calculations, but also to a sub-optimal use of the architecture parallelism. This paper introduces a distributed MPEG-4 application, where the system part is hosted by a standard PC, and the video decoder is supported by a multi-DSPs board. In particular, we present our AVSynDEx methodology allowing both an incremental building, an easy update on the video decoder description, and a quasi-automatic implementation onto a multi-C6x platform. We also define a global scheduler managing the parallel execution of the video and system applications.","PeriodicalId":104473,"journal":{"name":"2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03).","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125011773","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Adaptive skin segmentation in color images 彩色图像的自适应皮肤分割
S. L. Phung, D. Chai, A. Bouzerdoum
{"title":"Adaptive skin segmentation in color images","authors":"S. L. Phung, D. Chai, A. Bouzerdoum","doi":"10.1109/ICASSP.2003.1199483","DOIUrl":"https://doi.org/10.1109/ICASSP.2003.1199483","url":null,"abstract":"A new skin segmentation technique for color images is proposed. The proposed technique uses a human skin color model that is based on the Bayesian decision theory and developed using a large training set of skin colors and nonskin colors. The proposed technique is novel and unique in that texture characteristics of the human skin are used to select appropriate skin color thresholds for skin segmentation. Two homogeneity measures for skin regions that take into account both global and local image features are also proposed. Experimental results showed that the proposed technique can achieve good skin segmentation performance (false detection rate of 4.5% and false rejection rate of 4.0%).","PeriodicalId":104473,"journal":{"name":"2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03).","volume":"76 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126210228","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 48
A necessary and sufficient condition for the BIBO stability of general-order Bode-type variable-amplitude wave-digital equalizers 一般阶波德型变幅波数均衡器BIBO稳定性的充分必要条件
B. Nowrouzian, A. Fuller, M. Swamy
{"title":"A necessary and sufficient condition for the BIBO stability of general-order Bode-type variable-amplitude wave-digital equalizers","authors":"B. Nowrouzian, A. Fuller, M. Swamy","doi":"10.1109/ICASSP.2003.1201696","DOIUrl":"https://doi.org/10.1109/ICASSP.2003.1201696","url":null,"abstract":"Recently, the authors developed a new synthesis technique for the design of higher-order Bode-type variable-amplitude (VA) wave-digital (WD) equalizers. The salient feature of the resulting VA WD equalizers is that they permit the continuous variation of the WD equalizer transfer function from a shaping transfer function to its inverse by changing the value of a single variable digital multiplier only. The proposed design technique was based on the WD realization of the corresponding positive-real analog prototype shaping impedance function, and on the realization of the equalizer transfer function as the reflectance of the shaping impedance function with respect to the constituent variable digital multiplier. This paper is concerned with an investigation of the bounded-input bounded-output (BIBO) stability of general-order VA WD equalizers. It is shown that the resulting conditions are both necessary and sufficient for the BIBO stability of the VA WD equalizers for the entire range of values for the variable digital multiplier. These conditions can be checked in a straightforward fashion in terms of the characteristics of the shaping transfer function alone. An application example is given to illustrate the main results.","PeriodicalId":104473,"journal":{"name":"2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03).","volume":"94 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115105566","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Scalable encryption for multimedia content access control 用于多媒体内容访问控制的可扩展加密
H. H. Yu
{"title":"Scalable encryption for multimedia content access control","authors":"H. H. Yu","doi":"10.1109/ICASSP.2003.1202388","DOIUrl":"https://doi.org/10.1109/ICASSP.2003.1202388","url":null,"abstract":"Traditional cryptography systems treat every portion of a message (a video, an image, or an audio) equally and encrypt the entire message as a whole. As a result, those security systems often have only two states: access authorization and access denial. To facilitate multi-level access control with interoperability, scalable security mechanism is needed. Further, the availability of varying network bandwidth and diverse receiver device capabilities demand scalable and flexible approaches that are capable of adapting to changing network conditions as well as device capabilities. We describe a multimedia encryption scheme that supports scalability. One desirable feature of the proposed scheme is its simplicity and flexibility in supporting scalable content access control.","PeriodicalId":104473,"journal":{"name":"2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03).","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128434138","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信