来自剩余相位的特定于说话人的信息

2004 International Conference on Signal Processing and Communications, 2004. SPCOM '04. Pub Date : 2004-12-11 DOI:10.1109/SPCOM.2004.1458513

K. Sri Rama Murty, S. R. Mahadeva Prasanna, B. Yegnanarayana

{"title":"来自剩余相位的特定于说话人的信息","authors":"K. Sri Rama Murty, S. R. Mahadeva Prasanna, B. Yegnanarayana","doi":"10.1109/SPCOM.2004.1458513","DOIUrl":null,"url":null,"abstract":"This paper demonstrates the presence of speaker-specific information in the residual phase using autoassociative neural network (AANN) models. The residual phase is extracted from the speech signal after eliminating the vocal tract information by the linear prediction (LP) analysis. AANN models are used for capturing the speaker-specific information present in the residual phase. The speaker recognition studies infer that the residual phase contains significant speaker-specific information and it is indeed captured by the AANN models. In this study we also demonstrate that in voiced speech segments, regions around the instants of glottal closure are more speaker-specific compared to other regions.","PeriodicalId":424981,"journal":{"name":"2004 International Conference on Signal Processing and Communications, 2004. SPCOM '04.","volume":"32 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2004-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"22","resultStr":"{\"title\":\"Speaker-specific information from residual phase\",\"authors\":\"K. Sri Rama Murty, S. R. Mahadeva Prasanna, B. Yegnanarayana\",\"doi\":\"10.1109/SPCOM.2004.1458513\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper demonstrates the presence of speaker-specific information in the residual phase using autoassociative neural network (AANN) models. The residual phase is extracted from the speech signal after eliminating the vocal tract information by the linear prediction (LP) analysis. AANN models are used for capturing the speaker-specific information present in the residual phase. The speaker recognition studies infer that the residual phase contains significant speaker-specific information and it is indeed captured by the AANN models. In this study we also demonstrate that in voiced speech segments, regions around the instants of glottal closure are more speaker-specific compared to other regions.\",\"PeriodicalId\":424981,\"journal\":{\"name\":\"2004 International Conference on Signal Processing and Communications, 2004. SPCOM '04.\",\"volume\":\"32 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2004-12-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"22\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2004 International Conference on Signal Processing and Communications, 2004. SPCOM '04.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SPCOM.2004.1458513\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2004 International Conference on Signal Processing and Communications, 2004. SPCOM '04.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SPCOM.2004.1458513","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 22

摘要

本文利用自关联神经网络(AANN)模型证明了残差相位中存在特定说话人的信息。对语音信号进行线性预测分析，剔除声道信息后提取残差相位。AANN模型用于捕获残差阶段中存在的特定于说话人的信息。说话人识别研究表明，残差相位包含重要的说话人特定信息，并且确实被AANN模型捕获。在这项研究中，我们还证明，在浊音段中，与其他区域相比，声门关闭瞬间周围的区域更具说话者特异性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Speaker-specific information from residual phase

This paper demonstrates the presence of speaker-specific information in the residual phase using autoassociative neural network (AANN) models. The residual phase is extracted from the speech signal after eliminating the vocal tract information by the linear prediction (LP) analysis. AANN models are used for capturing the speaker-specific information present in the residual phase. The speaker recognition studies infer that the residual phase contains significant speaker-specific information and it is indeed captured by the AANN models. In this study we also demonstrate that in voiced speech segments, regions around the instants of glottal closure are more speaker-specific compared to other regions.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2004 International Conference on Signal Processing and Communications, 2004. SPCOM '04.

自引率

0.00%

发文量