基于MFCC和声学参数的语音信号鲁棒分割

Zhandos Yessenbayev
{"title":"基于MFCC和声学参数的语音信号鲁棒分割","authors":"Zhandos Yessenbayev","doi":"10.1109/AMS.2012.26","DOIUrl":null,"url":null,"abstract":"In the current work, we investigate the effect of combining the mel-frequency cepstral coefficients (MFCC) with the acoustic parameters (AP) in the task of segmentation of continuous speech into sonorant and obstruent regions using Hidden Markov Models (HMM) with Gaussian Mixture Models (GMM). Along with the influence of APs to the performance of the model built, we analyze the set of acoustic features extracted for each phoneme to see how robust they are in the noise. All the experiments were conducted on TIMIT database. The results of the experiments show that there are APs, which have nice separating property and, therefore, improve the performance of a system if used with MFCCs, however, they are not robust to noise. On the other hand, there are APs, which do not have this property, but possess the intrinsic stability in noisy conditions and, as a result, add some robustness to a system.","PeriodicalId":407900,"journal":{"name":"2012 Sixth Asia Modelling Symposium","volume":"28 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-05-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Robust Segmentation of Speech Signal Using MFCC and Acoustic Parameters\",\"authors\":\"Zhandos Yessenbayev\",\"doi\":\"10.1109/AMS.2012.26\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the current work, we investigate the effect of combining the mel-frequency cepstral coefficients (MFCC) with the acoustic parameters (AP) in the task of segmentation of continuous speech into sonorant and obstruent regions using Hidden Markov Models (HMM) with Gaussian Mixture Models (GMM). Along with the influence of APs to the performance of the model built, we analyze the set of acoustic features extracted for each phoneme to see how robust they are in the noise. All the experiments were conducted on TIMIT database. The results of the experiments show that there are APs, which have nice separating property and, therefore, improve the performance of a system if used with MFCCs, however, they are not robust to noise. On the other hand, there are APs, which do not have this property, but possess the intrinsic stability in noisy conditions and, as a result, add some robustness to a system.\",\"PeriodicalId\":407900,\"journal\":{\"name\":\"2012 Sixth Asia Modelling Symposium\",\"volume\":\"28 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-05-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 Sixth Asia Modelling Symposium\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/AMS.2012.26\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 Sixth Asia Modelling Symposium","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AMS.2012.26","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

在本工作中,我们研究了将mel-frequency倒谱系数(MFCC)与声学参数(AP)结合使用隐马尔可夫模型(HMM)和高斯混合模型(GMM)将连续语音分割成浊音和浊音区域的效果。随着ap对所建模型性能的影响,我们分析了为每个音素提取的声学特征集,以查看它们在噪声中的鲁棒性。所有实验均在TIMIT数据库上进行。实验结果表明,存在ap,它们具有良好的分离性能,因此,如果与mfc一起使用,可以提高系统的性能,但是它们对噪声的鲁棒性不强。另一方面,有些ap不具有这种性质,但在噪声条件下具有固有稳定性,从而为系统增加了一定的鲁棒性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Robust Segmentation of Speech Signal Using MFCC and Acoustic Parameters
In the current work, we investigate the effect of combining the mel-frequency cepstral coefficients (MFCC) with the acoustic parameters (AP) in the task of segmentation of continuous speech into sonorant and obstruent regions using Hidden Markov Models (HMM) with Gaussian Mixture Models (GMM). Along with the influence of APs to the performance of the model built, we analyze the set of acoustic features extracted for each phoneme to see how robust they are in the noise. All the experiments were conducted on TIMIT database. The results of the experiments show that there are APs, which have nice separating property and, therefore, improve the performance of a system if used with MFCCs, however, they are not robust to noise. On the other hand, there are APs, which do not have this property, but possess the intrinsic stability in noisy conditions and, as a result, add some robustness to a system.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信