2010 IEEE International Conference on Acoustics, Speech and Signal Processing最新文献

筛选
英文 中文
Estimating direct-to-reverberant energy ratio based on spatial correlation model segregating direct sound and reverberation 基于分离直声和混响的空间相关模型估计直混响能量比
2010 IEEE International Conference on Acoustics, Speech and Signal Processing Pub Date : 2010-03-14 DOI: 10.1109/ICASSP.2010.5496103
Yusuke Hioka, K. Niwa, Sumitaka Sakauchi, K. Furuya, Y. Haneda
{"title":"Estimating direct-to-reverberant energy ratio based on spatial correlation model segregating direct sound and reverberation","authors":"Yusuke Hioka, K. Niwa, Sumitaka Sakauchi, K. Furuya, Y. Haneda","doi":"10.1109/ICASSP.2010.5496103","DOIUrl":"https://doi.org/10.1109/ICASSP.2010.5496103","url":null,"abstract":"A new approach for estimating the direct-to-reverberant energy ratio (DRR) using a microphone array is proposed. The method is based on amodel of a spatial correlation matrix that segregates direct sound and reverberation. It estimates DRR from the power spectra of both components, which are derived from the correlation matrix of the observed signal. In experiments performed in simulated and actual reverberant environments, the proposed method mostly succeeded in estimating DRR accurately. We also present speech enhancement using binary masking as an example of an application of the estimated DRR. By utilization of the DRR as a factor to discriminate the distances of speakers, separation of speech signals whose sources were located in the same direction but at different distances was achieved.","PeriodicalId":293333,"journal":{"name":"2010 IEEE International Conference on Acoustics, Speech and Signal Processing","volume":"75 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129627755","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Multichannel-compressive estimation of doubly selective channels in MIMO-OFDM systems: Exploiting and enhancing joint sparsity MIMO-OFDM系统中双选择信道的多信道压缩估计:利用和增强联合稀疏性
2010 IEEE International Conference on Acoustics, Speech and Signal Processing Pub Date : 2010-03-14 DOI: 10.1109/ICASSP.2010.5496098
Daniel Eiwen, G. Tauböck, F. Hlawatsch, H. Rauhut, N. Czink
{"title":"Multichannel-compressive estimation of doubly selective channels in MIMO-OFDM systems: Exploiting and enhancing joint sparsity","authors":"Daniel Eiwen, G. Tauböck, F. Hlawatsch, H. Rauhut, N. Czink","doi":"10.1109/ICASSP.2010.5496098","DOIUrl":"https://doi.org/10.1109/ICASSP.2010.5496098","url":null,"abstract":"We propose a compressive estimator of doubly selective channels within pulse-shaping multicarrier MIMO systems (including MIMO-OFDM as a special case). The use of multichannel compressed sensing exploits the joint sparsity of the MIMO channel for improved performance. We also propose a multichannel basis optimization for enhancing joint sparsity. Simulation results demonstrate significant advantages over channel-by-channel compressive estimation.","PeriodicalId":293333,"journal":{"name":"2010 IEEE International Conference on Acoustics, Speech and Signal Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128260049","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 32
Tone injection with aggressive clipping projection for OFDM PAPR reduction 带有积极剪影投影的调音注入,用于降低OFDM的PAPR
2010 IEEE International Conference on Acoustics, Speech and Signal Processing Pub Date : 2010-03-14 DOI: 10.1109/ICASSP.2010.5496028
Cagdas Tuna, Douglas L. Jones
{"title":"Tone injection with aggressive clipping projection for OFDM PAPR reduction","authors":"Cagdas Tuna, Douglas L. Jones","doi":"10.1109/ICASSP.2010.5496028","DOIUrl":"https://doi.org/10.1109/ICASSP.2010.5496028","url":null,"abstract":"The main drawback of Orthogonal Frequency Division Multiplexing (OFDM) systems is the high peak-to-average power ratio (PAPR), which leads to a significant reduction in performance and power efficiency. Tone injection (TI) is a promising PAPR reduction technique that cyclically extends QAM constellations to allow an alternative encoding with lower PAPR at the transmitter. We present a new efficient complex-baseband algorithm which performs TI by using aggressive clipping to combat large peaks. An approximated-analog PAPR reduction of up to 5.1 dB at a 10–5 symbol-clip probability is obtained for 64-channel 16-QAM OFDM. This is a very fast and practical peak-power reduction method for OFDM systems that essentially achieves the same PAPR as single-carrier modulation for large constellations.","PeriodicalId":293333,"journal":{"name":"2010 IEEE International Conference on Acoustics, Speech and Signal Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128961774","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
A scalable H.264/AVC deblocking filter architecture using dynamic partial reconfiguration 采用动态部分重构的可扩展H.264/AVC去块滤波器架构
2010 IEEE International Conference on Acoustics, Speech and Signal Processing Pub Date : 2010-03-14 DOI: 10.1109/ICASSP.2010.5495525
Rakan Khraisha, Jooheung Lee
{"title":"A scalable H.264/AVC deblocking filter architecture using dynamic partial reconfiguration","authors":"Rakan Khraisha, Jooheung Lee","doi":"10.1109/ICASSP.2010.5495525","DOIUrl":"https://doi.org/10.1109/ICASSP.2010.5495525","url":null,"abstract":"This paper presents a scalable H.264/AVC deblocking filter architecture based on FPGA using dynamic partial reconfiguration. This desirable feature of FPGAs makes it possible for different hardware configurations to be implemented during run-time. Architectural scalability to adapt to different users' requirements intelligently is demonstrated through dynamic self-reconfiguration on the reconfigurable hardware fabric. When exploiting the full capability of the proposed design, filtering operations up to four different edges at the same time can be performed resulting in significant reduction of total processing time. The architecture can easily support the required computing capability for different resolutions and frame rates of video sequences. The implemented architecture has been evaluated using Xilinx Virtex-4 ML410 FPGA board. The design can operate at a maximum frequency of 103 MHz. The reconfiguration is done through Internal Configuration Access Port (ICAP) to achieve maximum performance needed by real time applications.","PeriodicalId":293333,"journal":{"name":"2010 IEEE International Conference on Acoustics, Speech and Signal Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129248583","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
A scalable block cipher design using filter banks over finite fields 在有限域上使用滤波器组的可扩展分组密码设计
2010 IEEE International Conference on Acoustics, Speech and Signal Processing Pub Date : 2010-03-14 DOI: 10.1109/ICASSP.2010.5495404
S. Saraireh, M. Benaissa
{"title":"A scalable block cipher design using filter banks over finite fields","authors":"S. Saraireh, M. Benaissa","doi":"10.1109/ICASSP.2010.5495404","DOIUrl":"https://doi.org/10.1109/ICASSP.2010.5495404","url":null,"abstract":"A scalable block cipher based on a filter bank structure over GF(28) is proposed. The filter bank structure is used to introduce the diffusion during the circular convolution process between the filters coefficients (which are generated from the key) and the plaintext. The confusion is achieved by the mixing between the analysis filter bank and a novel addition mod 2n and XOR scheme. The proposed cipher is scalable in both block and key lengths. The cipher is shown to be secure against differential and linear cryptanalysis and of lesser complexity than the AES. The proposed cipher structure enables security versus complexity versus performance trade-offs to be made, an increasingly important aspect of security in communications systems.","PeriodicalId":293333,"journal":{"name":"2010 IEEE International Conference on Acoustics, Speech and Signal Processing","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124598979","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Reduced-rank DOA estimation based on joint iterative subspace recursive optimization and grid search 基于联合迭代子空间递归优化和网格搜索的降阶DOA估计
2010 IEEE International Conference on Acoustics, Speech and Signal Processing Pub Date : 2010-03-14 DOI: 10.1109/ICASSP.2010.5496256
Lei Wang, R. D. Lamare, M. Haardt
{"title":"Reduced-rank DOA estimation based on joint iterative subspace recursive optimization and grid search","authors":"Lei Wang, R. D. Lamare, M. Haardt","doi":"10.1109/ICASSP.2010.5496256","DOIUrl":"https://doi.org/10.1109/ICASSP.2010.5496256","url":null,"abstract":"In this paper, we propose a reduced-rank direction of arrival (DOA) estimation algorithm based on joint and iterative subspace optimization (JISO) with grid search . The reduced-rank scheme includes a rank reduction matrix and an auxiliary reduced-rank parameter vector. They are jointly and iteratively optimized with a recursive least squares algorithm (RLS) to calculate the output power spectrum. The proposed JISO-RLS DOA estimation algorithm provides an efficient way to iteratively estimate the rank reduction matrix and the auxiliary reduced-rank vector. It is suitable for DOA estimation with large arrays and can be extended to arbitrary array geometries. It exhibits an advantage over MUSIC and ESPRIT when many sources exist in the system. A spatial smoothing (SS) technique is employed for dealing with highly correlated sources. Simulation results show that the JISO-RLS has a better performance than existing Capon and subspace-based DOA estimation methods.","PeriodicalId":293333,"journal":{"name":"2010 IEEE International Conference on Acoustics, Speech and Signal Processing","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124669641","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Parametric emotional singing voice synthesis 参数化情感歌声合成
2010 IEEE International Conference on Acoustics, Speech and Signal Processing Pub Date : 2010-03-14 DOI: 10.1109/ICASSP.2010.5495137
Younsung Park, Sungrack Yun, C. Yoo
{"title":"Parametric emotional singing voice synthesis","authors":"Younsung Park, Sungrack Yun, C. Yoo","doi":"10.1109/ICASSP.2010.5495137","DOIUrl":"https://doi.org/10.1109/ICASSP.2010.5495137","url":null,"abstract":"This paper describes an algorithm to control the expressed emotion of a synthesized song. Based on the database of various melodies sung neutrally with restricted set of words, hidden semi-Markov models (HSMMs) of notes ranging from E3 to G5 are constructed for synthesizing singing voice. Three steps are taken in the synthesis: (1) Pitch and duration are determined according to the notes indicated by the musical score; (2) Features are sampled from appropriate HSMMs with the duration set to the maximum probability; (3) Singing voice is synthesized by the mel-log spectrum approximation (MLSA) filter using the sampled features as parameters of the filter. Emotion of a synthesized song is controlled by varying the duration and the vibrato parameters according to the Thayer's mood model. Perception test is performed to evaluate the synthesized song. The results show that the algorithm can control the expressed emotion of a singing voice given a neutral singing voice database.","PeriodicalId":293333,"journal":{"name":"2010 IEEE International Conference on Acoustics, Speech and Signal Processing","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124698608","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Analyzing grasping for inferring cognitive states of users 分析抓取来推断用户的认知状态
2010 IEEE International Conference on Acoustics, Speech and Signal Processing Pub Date : 2010-03-14 DOI: 10.1109/ICASSP.2010.5495795
Kotaro Ogino, T. Jitsuhiro, C. Miyajima, K. Takeda
{"title":"Analyzing grasping for inferring cognitive states of users","authors":"Kotaro Ogino, T. Jitsuhiro, C. Miyajima, K. Takeda","doi":"10.1109/ICASSP.2010.5495795","DOIUrl":"https://doi.org/10.1109/ICASSP.2010.5495795","url":null,"abstract":"We study the effect of cognitive states, feelings about tasks, on grasping behavior to estimate user's feelings from their motion. Since people solve the inverse kinematics problem of grasping based on their cognition for the task, when they grasp an object, the way to grasp the object reflects their cognitive states. We are analyzing the way of grasping a cup depending on whether a user is stressed. The physical properties of grasping, volume and entropy of Grasp Jacobian ellipsoids are analyzed. The volume of Grasp Jacobian ellipsoids, which indicates the possible size of object movement, was shrunk after learning the grasp motion. Also the volumes between the relaxed and the stressed cognitive conditions were significantly different. These results show that the user's cognition for tasks reflects the grasp forms and the possible size of object movement.","PeriodicalId":293333,"journal":{"name":"2010 IEEE International Conference on Acoustics, Speech and Signal Processing","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124757642","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Adaptive score fusion using Weighted Logistic Linear Regression for spoken language recognition 基于加权Logistic线性回归的自适应分数融合语音识别
2010 IEEE International Conference on Acoustics, Speech and Signal Processing Pub Date : 2010-03-14 DOI: 10.1109/ICASSP.2010.5495069
K. Sim, Kong-Aik Lee
{"title":"Adaptive score fusion using Weighted Logistic Linear Regression for spoken language recognition","authors":"K. Sim, Kong-Aik Lee","doi":"10.1109/ICASSP.2010.5495069","DOIUrl":"https://doi.org/10.1109/ICASSP.2010.5495069","url":null,"abstract":"State-of-the-art spoken language recognition systems typically consist of a combination of sub-systems. These sub-systems generate language detection scores for each speech segment, which will be fused (combined) to yield the overall detection scores. Typically, score fusion is achieved using a linear model and Logistic Linear Regression (LLR) is commonly used to estimate the model parameters. This paper proposes an extension to the LLR model, known as the Weighted LLR (WLLR). WLLR is obtained using a weighted combination of multiple LLRs where the weights are obtained as a nonlinear function of the speech segments. Although the resultant score is still linear with respect to the scores of the individual sub-systems, the linear function depends on the speech segment. Hence, the overall score fusion model can be regarded as an adaptive model. Experimental results shows that WLLR outperforms LLR by approximately 10% relative for PPRLM system fusion on the NIST 2003 and 2005 language recognition evaluation sets.","PeriodicalId":293333,"journal":{"name":"2010 IEEE International Conference on Acoustics, Speech and Signal Processing","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129481316","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Improvement of power analysis attacks using Kalman filter 利用卡尔曼滤波器改进功率分析攻击
2010 IEEE International Conference on Acoustics, Speech and Signal Processing Pub Date : 2010-03-14 DOI: 10.1109/ICASSP.2010.5495428
Youssef Souissi, S. Guilley, J. Danger, S. Mekki, Guillaume Duc
{"title":"Improvement of power analysis attacks using Kalman filter","authors":"Youssef Souissi, S. Guilley, J. Danger, S. Mekki, Guillaume Duc","doi":"10.1109/ICASSP.2010.5495428","DOIUrl":"https://doi.org/10.1109/ICASSP.2010.5495428","url":null,"abstract":"Power analysis attacks are non intrusive and easily mounted. As a consequence, there is a growing interest in efficient implementation of these attacks against block cipher algorithms such as Data Encryption Standard (DES) and Advanced Encryption Standard (AES). In our paper we propose a new technique based on the Kalman theory. We show how this technique could be useful for the cryptographic domain by making power analysis attacks faster. Moreover we prove that the Kalman filter is more powerful than the High Order Statistics technique.","PeriodicalId":293333,"journal":{"name":"2010 IEEE International Conference on Acoustics, Speech and Signal Processing","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129488941","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信