Sishir Kalita, S. R. Mahadeva Prasanna, S. Dandapat
{"title":"Analysis of glottal stops using pitch synchronous integrated linear prediction residual","authors":"Sishir Kalita, S. R. Mahadeva Prasanna, S. Dandapat","doi":"10.1109/NCC.2016.7561143","DOIUrl":null,"url":null,"abstract":"This work analyzes excitation source to characterize glottal stops using integrated linear prediction (ILP) residual, derived by pitch-synchronous (PS) approach. The glottal stop consonant is produced due to laryngeal gesture in the form of constricted glottis. This pressed glottal configuration, leads to period to period irregularities, aperiodicity, and asymmetry. Normalized crosscorrelation coefficient (NCC) between two successive glottal cycles is computed to capture the dissimilarity between adjacent glottal cycles. Also to characterize the variation in the abruptness of glottal pulses, ratio between strength of excitation (SoE) at two consecutive epoch locations and temporal energy distribution using waveform peak factor (WPF) are computed. To capture the asymmetric behavior of each glottal cycle, higher order statistics (HOS) measures are evaluated. These features show discrimination between glottal stop and other adjacent sounds and can therefore be used for spotting glottal stops in continuous speech.","PeriodicalId":279637,"journal":{"name":"2016 Twenty Second National Conference on Communication (NCC)","volume":"74 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 Twenty Second National Conference on Communication (NCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NCC.2016.7561143","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
This work analyzes excitation source to characterize glottal stops using integrated linear prediction (ILP) residual, derived by pitch-synchronous (PS) approach. The glottal stop consonant is produced due to laryngeal gesture in the form of constricted glottis. This pressed glottal configuration, leads to period to period irregularities, aperiodicity, and asymmetry. Normalized crosscorrelation coefficient (NCC) between two successive glottal cycles is computed to capture the dissimilarity between adjacent glottal cycles. Also to characterize the variation in the abruptness of glottal pulses, ratio between strength of excitation (SoE) at two consecutive epoch locations and temporal energy distribution using waveform peak factor (WPF) are computed. To capture the asymmetric behavior of each glottal cycle, higher order statistics (HOS) measures are evaluated. These features show discrimination between glottal stop and other adjacent sounds and can therefore be used for spotting glottal stops in continuous speech.