A Modified Multi-Feature Voiced/Unvoiced Speech Classification Method

R. Cai
{"title":"A Modified Multi-Feature Voiced/Unvoiced Speech Classification Method","authors":"R. Cai","doi":"10.1109/APPED.2010.25","DOIUrl":null,"url":null,"abstract":"A modified multi-feature voiced/unvoiced speech classification method is presented. The method is based on statistical analysis of wavelet-based frequency distribution of the average energy, zero-crossing rate, and average energy of short-time segments of the speech signal. The method first classifies the input speech into voiced, unvoiced and uncertain parts by comparing features with predetermined thresholds. Then, the uncertain parts are treated in three conditions and the boundary between voiced and unvoiced speech parts is determined by the average energy feature. The performance of the method has been evaluated using a large speech database. The method is shown to perform well in the cases of both clean and noise-degraded speech.","PeriodicalId":129691,"journal":{"name":"2010 Asia-Pacific Conference on Power Electronics and Design","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 Asia-Pacific Conference on Power Electronics and Design","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/APPED.2010.25","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

Abstract

A modified multi-feature voiced/unvoiced speech classification method is presented. The method is based on statistical analysis of wavelet-based frequency distribution of the average energy, zero-crossing rate, and average energy of short-time segments of the speech signal. The method first classifies the input speech into voiced, unvoiced and uncertain parts by comparing features with predetermined thresholds. Then, the uncertain parts are treated in three conditions and the boundary between voiced and unvoiced speech parts is determined by the average energy feature. The performance of the method has been evaluated using a large speech database. The method is shown to perform well in the cases of both clean and noise-degraded speech.
一种改进的多特征浊音/非浊音分类方法
提出了一种改进的多特征浊音/不浊音分类方法。该方法基于对语音信号短时间段的平均能量、过零率和平均能量的小波频率分布的统计分析。该方法首先通过特征与预定阈值的比较,将输入语音分为浊音部分、浊音部分和不确定部分。然后,将不确定部分分为三种情况进行处理,根据平均能量特征确定浊音部分和浊音部分的边界;利用大型语音数据库对该方法的性能进行了评价。结果表明,该方法在清晰语音和有噪声的语音中都表现良好。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信