A Collelogram based Pitch and Voiced/Unvoiced Classification Method for Real-Time Speech Analysis in Noisy Environment

Md. Ekramul Hamid, M. I. Molla
{"title":"A Collelogram based Pitch and Voiced/Unvoiced Classification Method for Real-Time Speech Analysis in Noisy Environment","authors":"Md. Ekramul Hamid, M. I. Molla","doi":"10.1109/APWCONCSE.2017.00025","DOIUrl":null,"url":null,"abstract":"Pitch estimation is frequently used in voice quality analysis. Speech in a noisy environment, the accuracy of pitch extraction is poor due to the effect of noises. This paper presents a simple technique for robust pitch estimation as well as voiced/unvoiced classification based on correlogram of noisy speech signal. Like spectrogram, the short-time autocorrelation outputs can be displayed graphically as another image called correlogram is an alternative to short time spectral analysis. This technique operates frame-by-frame basis on normalized autocorrelation function (NACF) of signal. Initially, the noisy speech signal is low pass filtered within the pitch range 50-500 Hz to obtain the pre-filtered signal. Then a threshold function is derived from the NACF. We use this threshold value for pitch position indicator and voiced/unvoiced classifier. The accurate pitch period is obtained from the weighted correlogram. The proposed pitch estimation and voiced/unvoiced classification algorithm using correlogram is very simple, fast and easily implemented in computer. The performance of the proposed algorithm is compared with recently developed EMD based method. The experimental results show that the proposed one is useful in speech analysis research.","PeriodicalId":215519,"journal":{"name":"2017 4th Asia-Pacific World Congress on Computer Science and Engineering (APWC on CSE)","volume":"228 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 4th Asia-Pacific World Congress on Computer Science and Engineering (APWC on CSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/APWCONCSE.2017.00025","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Pitch estimation is frequently used in voice quality analysis. Speech in a noisy environment, the accuracy of pitch extraction is poor due to the effect of noises. This paper presents a simple technique for robust pitch estimation as well as voiced/unvoiced classification based on correlogram of noisy speech signal. Like spectrogram, the short-time autocorrelation outputs can be displayed graphically as another image called correlogram is an alternative to short time spectral analysis. This technique operates frame-by-frame basis on normalized autocorrelation function (NACF) of signal. Initially, the noisy speech signal is low pass filtered within the pitch range 50-500 Hz to obtain the pre-filtered signal. Then a threshold function is derived from the NACF. We use this threshold value for pitch position indicator and voiced/unvoiced classifier. The accurate pitch period is obtained from the weighted correlogram. The proposed pitch estimation and voiced/unvoiced classification algorithm using correlogram is very simple, fast and easily implemented in computer. The performance of the proposed algorithm is compared with recently developed EMD based method. The experimental results show that the proposed one is useful in speech analysis research.
噪声环境下实时语音分析的一种基于共性图的音高和浊音/浊音分类方法
音高估计是语音质量分析中常用的一种方法。在噪声环境下的语音,由于噪声的影响,基音提取的精度较差。本文提出了一种简单的基于噪声语音信号相关图的鲁棒基音估计和浊音/浊音分类技术。与谱图一样,短时自相关输出可以图形化地显示为另一种称为相关图的图像,是短时谱分析的替代方案。该技术基于信号的归一化自相关函数(NACF)逐帧处理。首先,对带噪声的语音信号在50- 500hz的基音范围内进行低通滤波,得到预滤波信号。然后从NACF中导出阈值函数。我们将这个阈值用于音高位置指示器和浊音/浊音分类器。通过加权相关图得到准确的基音周期。本文提出的基于相关图的音高估计和浊音/浊音分类算法简单、快速、易于在计算机上实现。将该算法的性能与最近发展的基于EMD的方法进行了比较。实验结果表明,该方法在语音分析研究中是有效的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信