{"title":"Voiced/Unvoiced Detection of Speech Signals Using Empirical Mode Decomposition Model","authors":"K. I. Molla, K. Hirose, N. Minematsu, K. Hasan","doi":"10.1109/ICICT.2007.375400","DOIUrl":null,"url":null,"abstract":"This paper presents a new technique for voiced/unvoiced (V/UV) discrimination based on the extraction of pitch period. Empirical mode decomposition (EMD) is employed for multi-band representation of speech signal in time domain. The fundamental oscillation in a speech segment is determined in the autocorrelation function (ACF) of the EMD space. A damped cosine model is fitted using least squared method to extract the frequency of the fundamental oscillation. The mean fractional energy contributed by the oscillations with pitch period in different ACFs is used as the factor to classify a speech segment as V/UV. The experimental results show that the performance of the proposed method is noticeable as compared to other reported methods.","PeriodicalId":206443,"journal":{"name":"2007 International Conference on Information and Communication Technology","volume":"57 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-03-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"15","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2007 International Conference on Information and Communication Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICICT.2007.375400","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 15
Abstract
This paper presents a new technique for voiced/unvoiced (V/UV) discrimination based on the extraction of pitch period. Empirical mode decomposition (EMD) is employed for multi-band representation of speech signal in time domain. The fundamental oscillation in a speech segment is determined in the autocorrelation function (ACF) of the EMD space. A damped cosine model is fitted using least squared method to extract the frequency of the fundamental oscillation. The mean fractional energy contributed by the oscillations with pitch period in different ACFs is used as the factor to classify a speech segment as V/UV. The experimental results show that the performance of the proposed method is noticeable as compared to other reported methods.