K. Sri Rama Murty, S. R. Mahadeva Prasanna, B. Yegnanarayana
{"title":"来自剩余相位的特定于说话人的信息","authors":"K. Sri Rama Murty, S. R. Mahadeva Prasanna, B. Yegnanarayana","doi":"10.1109/SPCOM.2004.1458513","DOIUrl":null,"url":null,"abstract":"This paper demonstrates the presence of speaker-specific information in the residual phase using autoassociative neural network (AANN) models. The residual phase is extracted from the speech signal after eliminating the vocal tract information by the linear prediction (LP) analysis. AANN models are used for capturing the speaker-specific information present in the residual phase. The speaker recognition studies infer that the residual phase contains significant speaker-specific information and it is indeed captured by the AANN models. In this study we also demonstrate that in voiced speech segments, regions around the instants of glottal closure are more speaker-specific compared to other regions.","PeriodicalId":424981,"journal":{"name":"2004 International Conference on Signal Processing and Communications, 2004. SPCOM '04.","volume":"32 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2004-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"22","resultStr":"{\"title\":\"Speaker-specific information from residual phase\",\"authors\":\"K. Sri Rama Murty, S. R. Mahadeva Prasanna, B. Yegnanarayana\",\"doi\":\"10.1109/SPCOM.2004.1458513\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper demonstrates the presence of speaker-specific information in the residual phase using autoassociative neural network (AANN) models. The residual phase is extracted from the speech signal after eliminating the vocal tract information by the linear prediction (LP) analysis. AANN models are used for capturing the speaker-specific information present in the residual phase. The speaker recognition studies infer that the residual phase contains significant speaker-specific information and it is indeed captured by the AANN models. In this study we also demonstrate that in voiced speech segments, regions around the instants of glottal closure are more speaker-specific compared to other regions.\",\"PeriodicalId\":424981,\"journal\":{\"name\":\"2004 International Conference on Signal Processing and Communications, 2004. SPCOM '04.\",\"volume\":\"32 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2004-12-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"22\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2004 International Conference on Signal Processing and Communications, 2004. SPCOM '04.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SPCOM.2004.1458513\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2004 International Conference on Signal Processing and Communications, 2004. SPCOM '04.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SPCOM.2004.1458513","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
This paper demonstrates the presence of speaker-specific information in the residual phase using autoassociative neural network (AANN) models. The residual phase is extracted from the speech signal after eliminating the vocal tract information by the linear prediction (LP) analysis. AANN models are used for capturing the speaker-specific information present in the residual phase. The speaker recognition studies infer that the residual phase contains significant speaker-specific information and it is indeed captured by the AANN models. In this study we also demonstrate that in voiced speech segments, regions around the instants of glottal closure are more speaker-specific compared to other regions.