{"title":"Analysis of Excitation Source Characteristics for Shouted and Normal Speech Classification","authors":"Shikha Baghel, S. Prasanna, P. Guha","doi":"10.1109/NCC48643.2020.9056079","DOIUrl":null,"url":null,"abstract":"The present work is aimed at analysing the excitation source characteristics of normal and shouted speech. In this context, we analyze the Differenced Electroglottogram (DEGG) signal corresponding to different vowels. This work proposes two novel excitation source features that are estimated from DEGG signal. These features are (a) Open Phase Triangle Area (OPTA) and (b) Flatness of Glottal Cycle (FoGC). OPTA captures the effect of open phase duration and slope of DEGG signal. FoGC measures the change in source characteristics due to strength of excitation (SoE) and pitch period. A practical issue in using the proposed features is the unavailability of DEGG signal in most speech processing applications. To overcome this problem, the integrated linear prediction residual (ILPR) signal estimated from speech is considered as an approximation of DEGG. We show that the proposed features can be computed from ILPR signal in the absence of DEGG. It is observed that the proposed features (estimated from either DEGG or ILPR) are successful in discriminating shouted from normal speech.","PeriodicalId":183772,"journal":{"name":"2020 National Conference on Communications (NCC)","volume":"216 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 National Conference on Communications (NCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NCC48643.2020.9056079","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
The present work is aimed at analysing the excitation source characteristics of normal and shouted speech. In this context, we analyze the Differenced Electroglottogram (DEGG) signal corresponding to different vowels. This work proposes two novel excitation source features that are estimated from DEGG signal. These features are (a) Open Phase Triangle Area (OPTA) and (b) Flatness of Glottal Cycle (FoGC). OPTA captures the effect of open phase duration and slope of DEGG signal. FoGC measures the change in source characteristics due to strength of excitation (SoE) and pitch period. A practical issue in using the proposed features is the unavailability of DEGG signal in most speech processing applications. To overcome this problem, the integrated linear prediction residual (ILPR) signal estimated from speech is considered as an approximation of DEGG. We show that the proposed features can be computed from ILPR signal in the absence of DEGG. It is observed that the proposed features (estimated from either DEGG or ILPR) are successful in discriminating shouted from normal speech.