2011 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT)最新文献

筛选
英文 中文
Quantifying parameters of a source-filter model for oesophageal speech 食道语音源滤波模型参数的量化
J. O’Toole, B. G. Zapirain
{"title":"Quantifying parameters of a source-filter model for oesophageal speech","authors":"J. O’Toole, B. G. Zapirain","doi":"10.1109/ISSPIT.2011.6151618","DOIUrl":"https://doi.org/10.1109/ISSPIT.2011.6151618","url":null,"abstract":"Signal processing methods can improve the quality and intelligibility of oesophageal speech. Current methods show only moderate improvement leaving potential for better results. Quantifying parameters of oesophageal speech relative to laryngeal (normal) speech would help in the design of future enhancement methods for oesophageal speech. We quantified parameters of a source-filter model on a database of sustained vowels in Spanish from 4 oesophageal and 4 normal speakers. A ten-parameter glottal waveform model was used as the source and an autoregressive model was used as the filter. Classification, using a log-spectral distance measure, showed that all oesophageal speech samples were classified as whisper voice types; a voice type with a signal to noise ratio of −20 dB. Filter parameters representing spectral amplitudes and bandwidths had a large degree of variation for oesophageal speech comparative to the degree of variation for normal speech (Brown-Forsythe test, F < 0.001). Source metrics, noise to harmonic ratio (NHR) and variation in fundamental frequency, were also significantly greater for oesophageal speech (t-test, P < 0.001). These results show a greater degree of nonstationarity, and a noisier glottal waveform, for oesophageal speech comparative to normal speech.","PeriodicalId":288042,"journal":{"name":"2011 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126368866","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Low-complexity LSMR equalisation of FrFT-based multicarrier systems in doubly dispersive channels 双色散信道中基于frft的多载波系统的低复杂度LSMR均衡
A. Solyman, Stephan Weiss, J. Soraghan
{"title":"Low-complexity LSMR equalisation of FrFT-based multicarrier systems in doubly dispersive channels","authors":"A. Solyman, Stephan Weiss, J. Soraghan","doi":"10.1109/ISSPIT.2011.6151606","DOIUrl":"https://doi.org/10.1109/ISSPIT.2011.6151606","url":null,"abstract":"The discrete fractional Fourier transform (FrFT) has been suggested to enhance performance over DFT-based multicarrier systems when transmitting over doubly-dispersive channels. In this paper, we propose a novel low-complexity equaliser for inter-symbol and inter-carrier interference arising in such multicarrier transmission system. Due to a lower spreading in the FrFT-domain compared to the DFTchannel matrix as compared to the DFT domain, the equaliser can approximate the fractional-domain channel matrix by a band matrix. Further, we utilise the least squares minres (LSMR) algorithm in the calculation of the equalisation, which exhibits attractive numerical properties and low complexity. Simulation results demonstrate the superior performance of the proposed LSMR equaliser over benchmark schemes.","PeriodicalId":288042,"journal":{"name":"2011 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124567038","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
Disparity energy model using a trained neuronal population 使用训练神经元群的视差能量模型
Jaime A. Martins, J. Rodrigues, J. du Buf
{"title":"Disparity energy model using a trained neuronal population","authors":"Jaime A. Martins, J. Rodrigues, J. du Buf","doi":"10.1109/ISSPIT.2011.6151575","DOIUrl":"https://doi.org/10.1109/ISSPIT.2011.6151575","url":null,"abstract":"Depth information using the biological Disparity Energy Model can be obtained by using a population of complex cells. This model explicitly involves cell parameters like their spatial frequency, orientation, binocular phase and position difference. However, this is a mathematical model. Our brain does not have access to such parameters, it can only exploit responses. Therefore, we use a new model for encoding disparity information implicitly by employing a trained binocular neuronal population. This model allows to decode disparity information in a way similar to how our visual system could have developed this ability, during evolution, in order to accurately estimate disparity of entire scenes.","PeriodicalId":288042,"journal":{"name":"2011 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT)","volume":"67 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121271715","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
An Expectation-Maximization-based approach to the relative grid-locking problem 基于期望最大化的相对电网锁定问题研究
S. Fortunati, F. Gini, M. Greco, A. Farina, A. Graziano, S. Giompapa
{"title":"An Expectation-Maximization-based approach to the relative grid-locking problem","authors":"S. Fortunati, F. Gini, M. Greco, A. Farina, A. Graziano, S. Giompapa","doi":"10.1109/ISSPIT.2011.6151614","DOIUrl":"https://doi.org/10.1109/ISSPIT.2011.6151614","url":null,"abstract":"An important prerequisite for successful multisensory integration is that the data from the reporting sensors are transformed to a common reference frame free of systematic or registration bias errors. If not properly corrected, the registration errors can seriously degrade the global surveillance system performance. The relative sensor registration (or grid-locking) process aligns remote data to local data under the assumption that the local data are bias free and that all biases reside with the remote sensor. In this paper, we take into account all registration errors involved in the grid-locking problem. An EM-based estimator of these bias terms is derived and its statistical performance compared to the hybrid Cramér-Rao lower bound (HCRLB).","PeriodicalId":288042,"journal":{"name":"2011 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117117952","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
An improved β-order WEDM spectral amplitude estimator for speech enhancement 一种用于语音增强的改进β阶WEDM谱幅估计器
Na Li, C. Bao, Bingyin Xia, Feng Deng
{"title":"An improved β-order WEDM spectral amplitude estimator for speech enhancement","authors":"Na Li, C. Bao, Bingyin Xia, Feng Deng","doi":"10.1109/ISSPIT.2011.6151552","DOIUrl":"https://doi.org/10.1109/ISSPIT.2011.6151552","url":null,"abstract":"This paper proposes an improved β-order Weighted Euclidean Distortion Measure (I-β-WEDM) spectral estimator for speech enhancement. The new β-order WEDM gain function is introduced, in which a novel method of computing order β is proposed. In this method, the input noisy signal is divided into several critical bands in the human auditory model. The value of order β is updated adaptively according to the signal-to-noise ratios (SNR) which are computed in different critical bands. The performance of the proposed estimator with different values of β has been evaluated by ITU-T G.160 under the white noise and the non-stationary noise, respectively. The experimental results show that, comparing with the reference algorithms, the proposed algorithm with adaptive β could provide large amount of noise reduction, and has a little impact on the level of speech signal. The quality of enhanced speech is improved at the same time.","PeriodicalId":288042,"journal":{"name":"2011 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130869495","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A tailor-made development for time domain data series pre-processing in the power industry 电力行业时域数据序列预处理的定制开发
Juan J. Gude, L. Vázquez-Seisdedos, David Diaz Martinez
{"title":"A tailor-made development for time domain data series pre-processing in the power industry","authors":"Juan J. Gude, L. Vázquez-Seisdedos, David Diaz Martinez","doi":"10.1109/ISSPIT.2011.6151586","DOIUrl":"https://doi.org/10.1109/ISSPIT.2011.6151586","url":null,"abstract":"This paper introduces a tailor-made development, supported in Matlab, to handle a set of instrumental observations associated to energy production technologies. Original sequence of values can be obtained by a filtered one and/or by its curve of trends. Accuracy knowledge of its time scale signal is not strictly necessary because instead of applying signal processing theory and methods to a window of data values, the statement is translated into statistic methods applied to a data time series. A Graphical User Interface Development Environment (GUIDE) integrates data loading, filters configuration, visualization facilities to validate filtering and saving data, resulting in a powerful pre-processing tool. This software platform solution will facilitate the application of such techniques as modelling and identification, monitoring chemical processes, building tools for control strategy decisions, and failure diagnosing, among others.","PeriodicalId":288042,"journal":{"name":"2011 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131072004","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
EM algorithm of spherical models for binned data 分类数据球面模型的EM算法
H. Hamdan, Jingwen Wu
{"title":"EM algorithm of spherical models for binned data","authors":"H. Hamdan, Jingwen Wu","doi":"10.1109/ISSPIT.2011.6151542","DOIUrl":"https://doi.org/10.1109/ISSPIT.2011.6151542","url":null,"abstract":"In cluster analysis, dealing with large quantity of data is computational expensive. And binning data can be efficient in solving this problem. In the former study, basing cluster analysis on Gaussian mixture models becomes a classical and powerful approach. EM and CEM algorithm are commonly used in mixture approach and classification approach respectively. According to the parametrization of the variance matrices (allowing some of the features of clusters be the same or different: orientation, shape and volume), 14 Gaussian parsimonious models can be generated. Choosing the right parsimonious model is important in obtaining a good result. According to the existing study, Binned-EM algorithm was performed for the most general and diagonal model. In this paper, we apply binned-EM algorithm on spherical models. Two spherical models are studied and their performances on simulated data are compared. The influence of the size of bins in binned-EM algorithm is analyzed. Practical application is shown by applying on Iris data.","PeriodicalId":288042,"journal":{"name":"2011 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114972456","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Mechanical vibration signal compression by LOT-based subband coding 基于lot子带编码的机械振动信号压缩
M. Oltean, J. Picheral, E. Lahalle, H. Hamdan
{"title":"Mechanical vibration signal compression by LOT-based subband coding","authors":"M. Oltean, J. Picheral, E. Lahalle, H. Hamdan","doi":"10.1109/ISSPIT.2011.6151525","DOIUrl":"https://doi.org/10.1109/ISSPIT.2011.6151525","url":null,"abstract":"A novel compression method for the mechanical vibration signals is proposed in this paper. The vibration signal is first decomposed into subbands, by the intermediate of the Lapped Orthogonal Transform. Next, adaptive bit allocation is done on per subband basis and uniform quantization is performed in each subband. The method is applied on a large number of mechanical vibration signals issued by aircraft engines and it shows good results. Due to the quality of the decoding, the reconstructed signals are usable by post-compression treatments, such as fault detection for health monitoring purposes.","PeriodicalId":288042,"journal":{"name":"2011 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT)","volume":"61 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124726263","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A robust characterization of audio signals using the level of information content per Chroma 利用每个色度的信息含量水平对音频信号进行鲁棒表征
A. Manzo-Martinez, José Antonio Camarena Ibarrola
{"title":"A robust characterization of audio signals using the level of information content per Chroma","authors":"A. Manzo-Martinez, José Antonio Camarena Ibarrola","doi":"10.1109/ISSPIT.2011.6151562","DOIUrl":"https://doi.org/10.1109/ISSPIT.2011.6151562","url":null,"abstract":"In this paper we propose a new technique to characterize audio-signals. We use Shannon's Entropy to estimate the level of information content per chroma and we show that involving entropy contributes for a more robust audio characterization. A new audio-fingerprint (AFP) based on this feature is proposed in this paper which we have called Entropy-Chroma Fingerprint (ECFP). Two approaches were considered to estimate entropy; the first assumes the spectral coefficients distribute normally, while the second, estimates its probability density function (PDF) with the Parzen Windows Estimation method. We compared the robustness of the ECFP against the Chromagram-Based Audio-Fingerprint (CBFP) which is determined using the Constant Q Transform (CQT). Three thousand and five hundred AFPs were determined from songs of several genres. A subset of 350 songs were severely degraded and searched for using excerpts of 5 seconds for that matter. The ECFP determined assuming gaussianity on the PDF turned out to be much more robust than the CBFP. The ECFP determined assuming gaussianity is much faster to process than both, the CBFP and the ECFP determined with Parzen Windows and still more robust.","PeriodicalId":288042,"journal":{"name":"2011 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124808185","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Exponentiated enhancement for fundamental frequency extraction of noisy speech 噪声语音基频提取的指数增强方法
Masatoshi Narita, T. Shimamura
{"title":"Exponentiated enhancement for fundamental frequency extraction of noisy speech","authors":"Masatoshi Narita, T. Shimamura","doi":"10.1109/ISSPIT.2011.6151585","DOIUrl":"https://doi.org/10.1109/ISSPIT.2011.6151585","url":null,"abstract":"The fundamental frequency extraction of speech is an important problem needed in many speech processing systems. In this paper, we propose a new method for extracting the fundamental frequency of noisy speech using correlation, difference, and exponent. The inverse Fourier transform of the exponentiated amplitude spectrum is weighted by the inverse of the average exponentiated magnitude difference function. By using an appropriate exponent constant determined from the noise amount, the signal component corresponding to the fundamental frequency is emphasized. We compare the proposed method with conventional ones. The simulation results obtained with the appropriate exponent at each signal-to-noise ratio show the effectiveness of the proposed method.","PeriodicalId":288042,"journal":{"name":"2011 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116142914","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信