2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)最新文献

筛选
英文 中文
Channel and sensing aware channel access policy for multi-channel cognitive radio networks 多信道认知无线网络的信道和感知信道接入策略
Shu-Hsien Wang, Chih-yu Hsu, Y. Hong
{"title":"Channel and sensing aware channel access policy for multi-channel cognitive radio networks","authors":"Shu-Hsien Wang, Chih-yu Hsu, Y. Hong","doi":"10.1109/ICASSP.2012.6288581","DOIUrl":"https://doi.org/10.1109/ICASSP.2012.6288581","url":null,"abstract":"We propose a reservation-based channel access policy for multi-channel cognitive radio networks. To enhance the throughput of secondary users (SUs), SUs are allowed to select channels opportunistically according to both the local channel state information (CSI) and the spectrum sensing outcomes. SUs will then compete for the right of transmission on the chosen channel by emitting reservation packets to the access point sequentially according to their local CSI. We further devise a proper threshold on channel gains such that only the SUs whose channel gains are sufficiently high can reserve channels and the interference from SUs to the licensed network can be limited. A channel aware splitting algorithm is adopted to schedule the SU with the highest channel gain to transmit at each time instant. From simulations, the proposed channel access policy outperforms the policies that take into consideration only CSI or sensing outcomes.","PeriodicalId":6443,"journal":{"name":"2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81492998","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Joint spectral and temporal normalization of features for robust recognition of noisy and reverberated speech 联合频谱和时间归一化特征对噪声和混响语音的鲁棒识别
Xiong Xiao, Chng Eng Siong, Haizhou Li
{"title":"Joint spectral and temporal normalization of features for robust recognition of noisy and reverberated speech","authors":"Xiong Xiao, Chng Eng Siong, Haizhou Li","doi":"10.1109/ICASSP.2012.6288876","DOIUrl":"https://doi.org/10.1109/ICASSP.2012.6288876","url":null,"abstract":"In this paper, we propose a framework for joint normalization of spectral and temporal statistics of speech features for robust speech recognition. Current feature normalization approaches normalize the spectral and temporal aspects of feature statistics separately to overcome noise and reverberation. As a result, the interaction between the spectral normalization (e.g. mean and variance normalization, MVN) and temporal normalization (e.g. temporal structure normalization, TSN) is ignored. We propose a joint spectral and temporal normalization (JSTN) framework to simultaneously normalize these two aspects of feature statistics. In JSTN, feature trajectories are filtered by linear filters and the filters' coefficients are optimized by maximizing a likelihood-based objective function. Experimental results on Aurora-5 benchmark task show that JSTN consistently out-performs the cascade of MVN and TSN on test data corrupted by both additive noise and reverberation, which validates our proposal. Specifically, JSTN reduces average word error rate by 8-9% relatively over the cascade of MVN and TSN for both artificial and real noisy data.","PeriodicalId":6443,"journal":{"name":"2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81593817","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Bias analysis of source localization using the maximum likelihood estimator 使用极大似然估计器的源定位偏差分析
Liyang Rui, K. C. Ho
{"title":"Bias analysis of source localization using the maximum likelihood estimator","authors":"Liyang Rui, K. C. Ho","doi":"10.1109/ICASSP.2012.6288450","DOIUrl":"https://doi.org/10.1109/ICASSP.2012.6288450","url":null,"abstract":"The nonlinear nature of the source localization problem creates bias to a location estimate. The bias could play a significant role in limiting the performance of localization and tracking when multiple measurements at different instants are available. This paper performs bias analysis of the source location estimate obtained by the maximum likelihood estimator, where the positioning measurements can be TOA, TDOA, or AOA. The effect of bias to the mean-square localization error is examined and the amounts of bias introduced by the three types of measurements are contrasted.","PeriodicalId":6443,"journal":{"name":"2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81944244","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 24
Expected-utility-based sensor selection for state estimation 基于期望效用的状态估计传感器选择
David M. Cohen, Douglas L. Jones, S. Narayanan
{"title":"Expected-utility-based sensor selection for state estimation","authors":"David M. Cohen, Douglas L. Jones, S. Narayanan","doi":"10.1109/ICASSP.2012.6288470","DOIUrl":"https://doi.org/10.1109/ICASSP.2012.6288470","url":null,"abstract":"Applications such as long-term environmental monitoring and large-scale surveillance demand reliable performance from sensor nodes while operating within strict energy constraints. There is often not enough power for sensors to make measurements all of the time. In these cases, one must decide when to run each sensor. To this end, we develop a one-step optimal sensor-scheduling algorithm based on expected-utility maximization. “Utility” is an application-specific measure of the benefit from a given sensor measurement. In sensing environments that can be modeled using a hidden Markov model, selecting the appropriate combination of sensors at each time instant enables maximization of the expected utility while operating within an energy budget. For some budgets, the utility-based algorithm shows more than 300% utility gains over a constant duty-cycle scheme designed to consume the same amount of energy. These benefits are dependent on the energy budget.","PeriodicalId":6443,"journal":{"name":"2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84344154","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Model centroids for the simplification of Kernel Density estimators 简化核密度估计的模型质心
Olivier Schwander, F. Nielsen
{"title":"Model centroids for the simplification of Kernel Density estimators","authors":"Olivier Schwander, F. Nielsen","doi":"10.1109/ICASSP.2012.6287989","DOIUrl":"https://doi.org/10.1109/ICASSP.2012.6287989","url":null,"abstract":"Gaussian mixture models are a widespread tool for modeling various and complex probability density functions. They can be estimated using Expectation- Maximization or Kernel Density Estimation. Expectation- Maximization leads to compact models but may be expensive to compute whereas Kernel Density Estimation yields to large models which are cheap to build. In this paper we present new methods to get high-quality models that are both compact and fast to compute. This is accomplished with clustering methods and centroids computation. The quality of the resulting mixtures is evaluated in terms of log-likelihood and Kullback-Leibler divergence using examples from a bioinformatics application.","PeriodicalId":6443,"journal":{"name":"2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84397645","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
User recommendation with tensor factorization in social networks 基于张量分解的社交网络用户推荐
Zhenlei Yan, Jie Zhou
{"title":"User recommendation with tensor factorization in social networks","authors":"Zhenlei Yan, Jie Zhou","doi":"10.1109/ICASSP.2012.6288758","DOIUrl":"https://doi.org/10.1109/ICASSP.2012.6288758","url":null,"abstract":"The rapid growth of population in social networks has posed a challenge to existing systems for recommending to a user new friends having similar interests. In this paper, we address this user recommendation problem in social networks by proposing a novel framework which utilizes users' tagging information with tensor factorization. This work brings two major contributions: (1) A tensor model is proposed to capture the potential association among user, user's interests and friends in social tagging systems; (2) A novel approach is proposed to recommend new friends based on this model. The experiments on a real-world dataset crawled from Last.fm show that the proposed method outperforms other state-of-the-art approaches.","PeriodicalId":6443,"journal":{"name":"2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84992023","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Adaptive kernel principal components tracking 自适应核主成分跟踪
Toshihisa Tanaka, Y. Washizawa, A. Kuh
{"title":"Adaptive kernel principal components tracking","authors":"Toshihisa Tanaka, Y. Washizawa, A. Kuh","doi":"10.1109/ICASSP.2012.6288276","DOIUrl":"https://doi.org/10.1109/ICASSP.2012.6288276","url":null,"abstract":"Adaptive online algorithms for simultaneously extracting nonlinear eigenvectors of kernel principal component analysis (KPCA) are developed. KPCA needs all the observed samples to represent basis functions, and the same scale of eigenvalue problem as the number of samples should be solved. This paper reformulates KPCA and deduces an expression in the Euclidean space, where an algorithm for tracking generalized eigenvectors is applicable. The developed algorithm here is least mean squares (LMS)-type and recursive least squares (RLS)-type. Numerical example is then illustrated to support the analysis.","PeriodicalId":6443,"journal":{"name":"2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85021729","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Lagrangian multiplier optimization using correlations in residues 利用残数相关性的拉格朗日乘子优化
Zhenyu Liu, Dongsheng Wang, Junwei Zhou, T. Ikenaga
{"title":"Lagrangian multiplier optimization using correlations in residues","authors":"Zhenyu Liu, Dongsheng Wang, Junwei Zhou, T. Ikenaga","doi":"10.1109/ICASSP.2012.6288099","DOIUrl":"https://doi.org/10.1109/ICASSP.2012.6288099","url":null,"abstract":"Rate distortion optimization (RDO) algorithm plays the vital role in the up to date hybrid video codec H.264/AVC. The RDO algorithm of H.264/AVC reference software is built up by assuming that the transformed residues are memoryless variables. However, our experiments reveal that, for some sequences, the strong temporal correlations exist in the prediction residues. This paper extends the Lagrangian optimization techniques by modeling the transformed residues as the first-order Markov source and calibrating the distortion model with the piecewise approximation function. The proposed algorithms adjust the Lagrangian multiplier dynamically to improve the overall coding quality. Comprehensive experiments testify that, as compared with the JM reference software, our optimizations can achieve up to 1.875dB coding gain. Moreover, our algorithms posses more robust coding performance and introduce less computational overhead than the Laplace distribution based methods. The inherent short process latency makes it possible to cooperate our algorithms with rate control operation. Last but not least, the proposed approach is also useful for the emerging standard, HEVC.","PeriodicalId":6443,"journal":{"name":"2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85233219","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
GMM foreground segmentation processor based on address free pixel streams 基于地址自由像素流的GMM前景分割处理器
R. Yagi, Tomohito Kajimoto, T. Nishitani
{"title":"GMM foreground segmentation processor based on address free pixel streams","authors":"R. Yagi, Tomohito Kajimoto, T. Nishitani","doi":"10.1109/ICASSP.2012.6288213","DOIUrl":"https://doi.org/10.1109/ICASSP.2012.6288213","url":null,"abstract":"A compact implementation of a foreground segmentation processor in a multi-resolution transform domain has been proposed for HDTV signals. The proposed architecture is designed to simplify system controls by the hardware streaming and to reduce required memory capacities. It enables flowing pixels through all functional units in order, including multi-resolution spatial transform and temporal segmentation. The resultant architecture does not use memories except I/O buffers. Therefore, memory modules as well as complex address manipulation over the multiple global transforms and spatial/temporal interface is not required. The FPGA prototype chip dissipates 150 mW of power. This approach can be used for tablets and smart-phone by an ASIC implementation which will reduce the operation power to about 1/6.","PeriodicalId":6443,"journal":{"name":"2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85591359","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
A local intensity adaptive structural similarity index 一种局部强度自适应结构相似性指数
Zhengguo Li, Chuohao Yeo, Y. H. Tan, S. Rahardja
{"title":"A local intensity adaptive structural similarity index","authors":"Zhengguo Li, Chuohao Yeo, Y. H. Tan, S. Rahardja","doi":"10.1109/ICASSP.2012.6288090","DOIUrl":"https://doi.org/10.1109/ICASSP.2012.6288090","url":null,"abstract":"Existing structural similarity (SSIM) index comprises of one term on luminance comparison and the other term on contrast and structure comparison. In this paper, the SSIM index is first improved by introducing three weighting factors to the second term such that it is adaptive to local intensities of two images to be compared. The improved SSIM (iSSIM) index is further extended for two images with possibly different exposures. Experimental results show that the proposed indices are more robust to large intensity changes of two images from the same scene and more sensitive to two images from different scenes than the existing SSIM index.","PeriodicalId":6443,"journal":{"name":"2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85681993","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信