2010 IEEE International Conference on Acoustics, Speech and Signal Processing最新文献_第4页

Glottal features for speech-based cognitive load classification 基于语音认知负荷分类的声门特征

2010 IEEE International Conference on Acoustics, Speech and Signal Processing Pub Date : 2010-03-14 DOI: 10.1109/ICASSP.2010.5494987

T. Yap, J. Epps, E. Choi, E. Ambikairajah

引用次数: 17

Particle filtering based recovery of noisy GARCH processes 基于粒子滤波的GARCH过程恢复

2010 IEEE International Conference on Acoustics, Speech and Signal Processing Pub Date : 2010-03-14 DOI: 10.1109/ICASSP.2010.5495789

T. Michaeli, I. Cohen

引用次数: 2

Image retargeting using a bandelet-based similarity measure 使用基于频带的相似度度量的图像重定位

2010 IEEE International Conference on Acoustics, Speech and Signal Processing Pub Date : 2010-03-14 DOI: 10.1109/ICASSP.2010.5495291

A. Maalouf, M. Larabi

{"title":"Image retargeting using a bandelet-based similarity measure","authors":"A. Maalouf, M. Larabi","doi":"10.1109/ICASSP.2010.5495291","DOIUrl":"https://doi.org/10.1109/ICASSP.2010.5495291","url":null,"abstract":"Media content retargeting aims to adapt images/ videos to displays of large or small sizes. In this work, we propose a bandelet-based image retargeting algorithm for summarizing image data into smaller sizes. First, we define a multi-scale bandelet-based perceptual similarity measure which measures the geometric and perceptual similarities between two images at different bandelet scales. Two images are said to be geometrically similar if they have approximately the same geometric flow and quadtree structure. After determining the geometric similarity, a perceptual similarity measure based on the properties of the human visual system is defined to assess the perceptual difference between the original image and the retargeted one. Then, the problem of image retargeting is considered as a geometric optimization problem based on the bandelet-based geometric and perceptual similarity measures. That is, for an image S we search for a retargeted image T that contains as much as possible of geometric and perceptual information from S and, consequently, preserves visual coherence. The proposed retargeting algorithm outperforms the state-of-the-art methods in terms of the visual quality of the retargeted image.","PeriodicalId":293333,"journal":{"name":"2010 IEEE International Conference on Acoustics, Speech and Signal Processing","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123550466","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

A small dodecahedral microphone array for blind source separation 用于盲源分离的小型十二面体传声器阵列

2010 IEEE International Conference on Acoustics, Speech and Signal Processing Pub Date : 2010-03-14 DOI: 10.1109/ICASSP.2010.5496003

Motoki Ogasawara, Takanori Nishino, K. Takeda

{"title":"A small dodecahedral microphone array for blind source separation","authors":"Motoki Ogasawara, Takanori Nishino, K. Takeda","doi":"10.1109/ICASSP.2010.5496003","DOIUrl":"https://doi.org/10.1109/ICASSP.2010.5496003","url":null,"abstract":"A sound source separation method based on frequency-domain independent component analysis (FD-ICA) is proposed. This method fully utilizes the dodecahedral microphone array (DHMA), which has several merits: 1) the size of the array is very small and thus easy to handle; 2) the amplitude difference among microphones on the different surfaces is large; and 3) it is less affected by spatial aliasing in the higher frequency region. In the proposed method, in order to solve the permutation problem in FD-ICA through clustering acoustic transfer functions, amplitude and phase differences are optimally combined as a function of frequency. A DHMA of 8 cm in diameter with 60 microphones is used for the experiment, where up to twelve sound sources (speech/musical instruments) are separated using the proposed algorithm. The separation performance of the proposed method attains 24 dB in the signal-to-interference ratio (SIR) improvement score for the case of twelve sources. Since the performance is better by up to 10 dB in comparison to the conventional method, our results confirm the effectiveness of the proposed method.","PeriodicalId":293333,"journal":{"name":"2010 IEEE International Conference on Acoustics, Speech and Signal Processing","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125279815","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

Circulant space-time codes for integration with beamforming 与波束形成集成的循环空时码

2010 IEEE International Conference on Acoustics, Speech and Signal Processing Pub Date : 2010-03-14 DOI: 10.1109/ICASSP.2010.5496288

Yiyue Wu, A. Calderbank

引用次数: 4

A new Fractional Fourier Transform based monopulse tracking radar processor 一种新的基于分数阶傅立叶变换的单脉冲跟踪雷达处理器

2010 IEEE International Conference on Acoustics, Speech and Signal Processing Pub Date : 2010-03-14 DOI: 10.1109/ICASSP.2010.5496208

S. Elgamel, J. Soraghan

引用次数: 4

Development of digital watermarking application technologies for newspapers 报纸数字水印应用技术的发展

2010 IEEE International Conference on Acoustics, Speech and Signal Processing Pub Date : 2010-03-14 DOI: 10.1109/ICASSP.2010.5495431

R. Ebisawa, Takaaki Yamada

引用次数: 0

Independent subspace analysis with prior information for fMRI data 基于先验信息的fMRI数据独立子空间分析

2010 IEEE International Conference on Acoustics, Speech and Signal Processing Pub Date : 2010-03-14 DOI: 10.1109/ICASSP.2010.5495320

Sai Ma, Xi-Lin Li, N. Correa, T. Adalı, V. Calhoun

引用次数: 24

A new approach to cross-layer optimization of multimedia systems 多媒体系统跨层优化的新方法

2010 IEEE International Conference on Acoustics, Speech and Signal Processing Pub Date : 2010-03-14 DOI: 10.1109/ICASSP.2010.5495995

Nicholas Mastronarde, M. Schaar

引用次数: 2

Using duration and pitch for mandarin digit string recognition 使用时长和音高进行中文数字字符串识别

2010 IEEE International Conference on Acoustics, Speech and Signal Processing Pub Date : 2010-03-14 DOI: 10.1109/ICASSP.2010.5495128

Rui Zhao, Yusuke Kida, X. Yan, P. Ding, Lei He

引用次数: 4