{"title":"一种从噪声干扰语音中估计形成峰频率的倒频谱算法","authors":"S. Fattah, W. Zhu, M. Ahmad","doi":"10.1109/ICNNSP.2008.4590321","DOIUrl":null,"url":null,"abstract":"A new scheme for the estimation of formant frequencies from noise-corrupted speech signals is presented in this paper. In order to overcome the effect of noise, first, instead of conventional autocorrelation function (ACF), a once-repeated ACF of the observed data is employed. A ramp cosine cepstrum model of the ORACF of speech signal is developed, followed by a model-fitting based least-square optimization to extract the formants. For the purpose of implementation, the discrete cosine transform (DCT) is used which offers computational advantages for real signals and solves the phase unwrapping problem. Synthetic and natural vowels as well as some naturally spoken sentences in noisy environments are tested. The experimental results demonstrate a better performance obtained by the proposed scheme in comparison to some of the existing methods at low levels of signal-to-noise ratio.","PeriodicalId":250993,"journal":{"name":"2008 International Conference on Neural Networks and Signal Processing","volume":"693 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A cepstral domain algorithm for formant frequency estimation from noise-corrupted speech\",\"authors\":\"S. Fattah, W. Zhu, M. Ahmad\",\"doi\":\"10.1109/ICNNSP.2008.4590321\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A new scheme for the estimation of formant frequencies from noise-corrupted speech signals is presented in this paper. In order to overcome the effect of noise, first, instead of conventional autocorrelation function (ACF), a once-repeated ACF of the observed data is employed. A ramp cosine cepstrum model of the ORACF of speech signal is developed, followed by a model-fitting based least-square optimization to extract the formants. For the purpose of implementation, the discrete cosine transform (DCT) is used which offers computational advantages for real signals and solves the phase unwrapping problem. Synthetic and natural vowels as well as some naturally spoken sentences in noisy environments are tested. The experimental results demonstrate a better performance obtained by the proposed scheme in comparison to some of the existing methods at low levels of signal-to-noise ratio.\",\"PeriodicalId\":250993,\"journal\":{\"name\":\"2008 International Conference on Neural Networks and Signal Processing\",\"volume\":\"693 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2008-06-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2008 International Conference on Neural Networks and Signal Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICNNSP.2008.4590321\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 International Conference on Neural Networks and Signal Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICNNSP.2008.4590321","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A cepstral domain algorithm for formant frequency estimation from noise-corrupted speech
A new scheme for the estimation of formant frequencies from noise-corrupted speech signals is presented in this paper. In order to overcome the effect of noise, first, instead of conventional autocorrelation function (ACF), a once-repeated ACF of the observed data is employed. A ramp cosine cepstrum model of the ORACF of speech signal is developed, followed by a model-fitting based least-square optimization to extract the formants. For the purpose of implementation, the discrete cosine transform (DCT) is used which offers computational advantages for real signals and solves the phase unwrapping problem. Synthetic and natural vowels as well as some naturally spoken sentences in noisy environments are tested. The experimental results demonstrate a better performance obtained by the proposed scheme in comparison to some of the existing methods at low levels of signal-to-noise ratio.