Effectiveness of polarity detection for improved epoch extraction from speech

D. Govind, P. Hisham, D. Pravena
{"title":"Effectiveness of polarity detection for improved epoch extraction from speech","authors":"D. Govind, P. Hisham, D. Pravena","doi":"10.1109/NCC.2016.7561089","DOIUrl":null,"url":null,"abstract":"The objective of the present work is to demonstrate the significance of speech polarity detection in improving the accuracy of the estimated epochs in speech. The paper also proposes a method to extract the speech polarity information using the properties of the Hilbert transform. The Hilbert transform of the speech is computed as the imaginary part of the complex analytic signal representation of the original speech. The Hilbert envelope (HE) is then computed as the magnitude of the analytic signal. The average slope of the signal amplitudes of speech and Hilbert transform of speech around the peaks in the HE are observed to be varying in accordance with the polarity of the speech signal. The effectiveness of the proposed approach is confirmed by the performance evaluation over 7 voices of the phonetically balanced CMU-Arctic database and German emotional speech database. The performance of the proposed approach is also observed to be comparable with that of the existing algorithms such as residual skewness based polarity detection and Hilbert phase based speech polarity detection. Finally, a significant improvement in the identification accuracies of the estimated epochs in speech using the popular zero frequency filtering (ZFF) method is demonstrated as an application of the speech polarity detection.","PeriodicalId":279637,"journal":{"name":"2016 Twenty Second National Conference on Communication (NCC)","volume":"40 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 Twenty Second National Conference on Communication (NCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NCC.2016.7561089","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

Abstract

The objective of the present work is to demonstrate the significance of speech polarity detection in improving the accuracy of the estimated epochs in speech. The paper also proposes a method to extract the speech polarity information using the properties of the Hilbert transform. The Hilbert transform of the speech is computed as the imaginary part of the complex analytic signal representation of the original speech. The Hilbert envelope (HE) is then computed as the magnitude of the analytic signal. The average slope of the signal amplitudes of speech and Hilbert transform of speech around the peaks in the HE are observed to be varying in accordance with the polarity of the speech signal. The effectiveness of the proposed approach is confirmed by the performance evaluation over 7 voices of the phonetically balanced CMU-Arctic database and German emotional speech database. The performance of the proposed approach is also observed to be comparable with that of the existing algorithms such as residual skewness based polarity detection and Hilbert phase based speech polarity detection. Finally, a significant improvement in the identification accuracies of the estimated epochs in speech using the popular zero frequency filtering (ZFF) method is demonstrated as an application of the speech polarity detection.
极性检测在改进语音历元提取中的有效性
本研究的目的是证明语音极性检测在提高语音时代估计精度方面的重要意义。本文还提出了一种利用希尔伯特变换的特性提取语音极性信息的方法。语音的希尔伯特变换被计算为原始语音的复解析信号表示的虚部。然后计算希尔伯特包络(HE)作为分析信号的幅度。观察到语音信号振幅的平均斜率和语音希尔伯特变换在HE峰周围的平均斜率随语音信号的极性而变化。通过对语音平衡的CMU-Arctic数据库和德语情感语音数据库的7种语音进行性能评价,验证了该方法的有效性。该方法的性能也可与现有的基于残差偏度的语音极性检测和基于希尔伯特相位的语音极性检测相媲美。最后,作为语音极性检测的一种应用,证明了使用流行的零频率滤波(ZFF)方法可以显著提高语音中估计epoch的识别精度。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信