{"title":"Pitch detection algorithm using a wavelet correlation model","authors":"N. A. Kader","doi":"10.1109/NRSC.2000.838962","DOIUrl":null,"url":null,"abstract":"A new algorithm for pitch detection of the speech signal is introduced. The technique is based on the discrete wavelet transform to classify the speech signal into voiced and unvoiced segments. The wavelet parameters of the voiced segments in two frequency bands are extracted and crosscorrelation is performed to generate a correlation function. Then, a peak detection is applied to extract the pitch period. The algorithm is highly immunized to noise. A comparison between the ordinary methods and this new one is presented. The pitch contour is varying through the utterance period rather of being consider as constant through the analysis periods as in ordinary methods. The results are accurate for speech of signal to noise ratio equals to 2 dB.","PeriodicalId":211510,"journal":{"name":"Proceedings of the Seventeenth National Radio Science Conference. 17th NRSC'2000 (IEEE Cat. No.00EX396)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2000-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Seventeenth National Radio Science Conference. 17th NRSC'2000 (IEEE Cat. No.00EX396)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NRSC.2000.838962","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8
Abstract
A new algorithm for pitch detection of the speech signal is introduced. The technique is based on the discrete wavelet transform to classify the speech signal into voiced and unvoiced segments. The wavelet parameters of the voiced segments in two frequency bands are extracted and crosscorrelation is performed to generate a correlation function. Then, a peak detection is applied to extract the pitch period. The algorithm is highly immunized to noise. A comparison between the ordinary methods and this new one is presented. The pitch contour is varying through the utterance period rather of being consider as constant through the analysis periods as in ordinary methods. The results are accurate for speech of signal to noise ratio equals to 2 dB.