Hongwei Wu, Yibiao Yu, Heming Zhao, Xueqin Chen, Chunjuan Wang
{"title":"噪声语音多锥度频谱中均值移位算法的基音估计","authors":"Hongwei Wu, Yibiao Yu, Heming Zhao, Xueqin Chen, Chunjuan Wang","doi":"10.1109/ICIST.2014.6920512","DOIUrl":null,"url":null,"abstract":"In this paper, we apply the mean shift algorithm to estimate pitches of noisy speech from its multitaper spectrum. The noisy speech is first transformed into the multitaper spectrum, which can reduce the stationary noise. The pitch is extracted from the multitaper spectrum using the mean shift algorithm. After all estimates are collected, dynamic programming is used to obtain a smoothed pitch contour. We compare the performance of our method with two well-known algorithms on the Keele pitch database and demonstrate that it performs much well even at SNR as low as -15dB.","PeriodicalId":306383,"journal":{"name":"2014 4th IEEE International Conference on Information Science and Technology","volume":"11 2","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-04-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Pitch estimation using mean shift algorithm on multitaper spectrum of noisy speech\",\"authors\":\"Hongwei Wu, Yibiao Yu, Heming Zhao, Xueqin Chen, Chunjuan Wang\",\"doi\":\"10.1109/ICIST.2014.6920512\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we apply the mean shift algorithm to estimate pitches of noisy speech from its multitaper spectrum. The noisy speech is first transformed into the multitaper spectrum, which can reduce the stationary noise. The pitch is extracted from the multitaper spectrum using the mean shift algorithm. After all estimates are collected, dynamic programming is used to obtain a smoothed pitch contour. We compare the performance of our method with two well-known algorithms on the Keele pitch database and demonstrate that it performs much well even at SNR as low as -15dB.\",\"PeriodicalId\":306383,\"journal\":{\"name\":\"2014 4th IEEE International Conference on Information Science and Technology\",\"volume\":\"11 2\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-04-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2014 4th IEEE International Conference on Information Science and Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICIST.2014.6920512\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 4th IEEE International Conference on Information Science and Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIST.2014.6920512","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Pitch estimation using mean shift algorithm on multitaper spectrum of noisy speech
In this paper, we apply the mean shift algorithm to estimate pitches of noisy speech from its multitaper spectrum. The noisy speech is first transformed into the multitaper spectrum, which can reduce the stationary noise. The pitch is extracted from the multitaper spectrum using the mean shift algorithm. After all estimates are collected, dynamic programming is used to obtain a smoothed pitch contour. We compare the performance of our method with two well-known algorithms on the Keele pitch database and demonstrate that it performs much well even at SNR as low as -15dB.