{"title":"Representation of speech signals using Hartley group delay function","authors":"K. Narendra, R. K. Swamy","doi":"10.1109/IC3I.2016.7917974","DOIUrl":null,"url":null,"abstract":"This paper presents an alternate representation of phase information in speech signals using Hartley transform. Hartley Group Delay Function (HGDF) is computed on similar lines of Fourier Group delay function. Cepstral smoothing is applied so as to reduce the spiky nature of the group delay functions. The smoothened HGDF (SHGDF) is reported to have better resolution in group delay spectrum. A speaker verification system is designed as an application for the proposed signal representation. SHGDF is then presented as input to feed forward neural network. Performance curves using MFCCs, MODGFs and proposed SHGDF as features for the neural network are compared. It is found that the SHGDF functions provide better average performance for the speaker recognition system.","PeriodicalId":305971,"journal":{"name":"2016 2nd International Conference on Contemporary Computing and Informatics (IC3I)","volume":"57 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 2nd International Conference on Contemporary Computing and Informatics (IC3I)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IC3I.2016.7917974","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This paper presents an alternate representation of phase information in speech signals using Hartley transform. Hartley Group Delay Function (HGDF) is computed on similar lines of Fourier Group delay function. Cepstral smoothing is applied so as to reduce the spiky nature of the group delay functions. The smoothened HGDF (SHGDF) is reported to have better resolution in group delay spectrum. A speaker verification system is designed as an application for the proposed signal representation. SHGDF is then presented as input to feed forward neural network. Performance curves using MFCCs, MODGFs and proposed SHGDF as features for the neural network are compared. It is found that the SHGDF functions provide better average performance for the speaker recognition system.